Fix: calculate Cooks distances with few samples #44

BorisMuzellec · 2023-01-12T10:17:10Z

This PR aims to fix the bug described in Issue #43.

The issue seemed to be that the summation was done on the wrong axis in the trimmed_variance function, called through robust_method_of_moments_disp in the calculate_cooks method of the DeseqDataSet class.

ghost

Very high level review, but is there any way to add a unit test to check the behaviour that lead to this bug? This way, we make sure the codebase does not forget about it. Also, it's a good way to ensure this PR really fixes the related issue.

BorisMuzellec · 2023-01-12T12:49:26Z

Very high level review, but is there any way to add a unit test to check the behaviour that lead to this bug?

I guess we could add a test where the pipeline is run on a dataset with few samples, as the piece of code responsible for the bug only is run only when no cohort has 3 or more samples. Not sure what to test for though, would error-free execution be enough?

ghost · 2023-01-12T12:52:30Z

Not sure what to test for though, would error-free execution be enough?

Yes I think that simply adding this corner case and ensuring it runs without error is a good starting point!

…wise

…ples <= 2) cohorts

ghost

LGTM, thanks @BorisMuzellec

BorisMuzellec requested review from arthurPignetOwkin, maikia and a user January 12, 2023 10:17

BorisMuzellec mentioned this pull request Jan 12, 2023

Issue with Cooks Refitting if Less Than Three Replicates #43

Closed

BorisMuzellec added the bug Something isn't working label Jan 12, 2023

ghost reviewed Jan 12, 2023

View reviewed changes

fix: change default axis in trimmedVariance from sample-wise to gene-…

580f216

…wise

BorisMuzellec force-pushed the fix_small_sample_cooks branch from 449442a to 580f216 Compare January 12, 2023 13:59

BorisMuzellec added 2 commits January 12, 2023 15:11

docs: update docstring

8254a68

ci: add test to check that pydeseq2 runs error-free with small (n_sam…

5c3c5da

…ples <= 2) cohorts

BorisMuzellec requested a review from a user January 12, 2023 14:15

ghost approved these changes Jan 13, 2023

View reviewed changes

refactor: set "refit_cooks=True" explicitly in "test_few_samples"

c11ee78

BorisMuzellec merged commit 56a37f3 into main Jan 13, 2023

BorisMuzellec deleted the fix_small_sample_cooks branch January 13, 2023 08:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: calculate Cooks distances with few samples #44

Fix: calculate Cooks distances with few samples #44

BorisMuzellec commented Jan 12, 2023

ghost left a comment •

edited by ghost

Loading

BorisMuzellec commented Jan 12, 2023

ghost commented Jan 12, 2023

ghost left a comment

Fix: calculate Cooks distances with few samples #44

Fix: calculate Cooks distances with few samples #44

Conversation

BorisMuzellec commented Jan 12, 2023

ghost left a comment • edited by ghost Loading

Choose a reason for hiding this comment

BorisMuzellec commented Jan 12, 2023

ghost commented Jan 12, 2023

ghost left a comment

Choose a reason for hiding this comment

ghost left a comment •

edited by ghost

Loading