how do singleton samples contribute to dispersion estimate in DESeq2?
Entering edit mode
Victor ▴ 10
Last seen 5 weeks ago
United States

Suppose I have a design like this, with a large number of control samples, and a single test sample

design = ~ condition

control treatmentA

50 1

When I estimate the dispersions and run differential expression testing, I would have expected that all the information about the dispersion comes from the control samples, since the treatment sample is perfectly fit by the linear model and does not add any degrees of freedom.

But in practice that's not what happens -- if I exclude the treatment sample and set design = ~ 1 or if I include an additional single sample with, say, "treatmentB", I get quite different estimates for the dispersion.

Is there an intuitive explanation of why?

DESeq2 • 125 views
Entering edit mode
Last seen 2 days ago
United States

For that type of design, they do not really just much information about the dispersion. But the dispersion estimates are MLE and not based on closed formula like the pooled variance estimates for t-tests.


Login before adding your answer.

Traffic: 192 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6