DESeq2 - dispersion estimation when there is none
1
0
Entering edit mode
WardDeb • 0
@901be5b0
Last seen 3 days ago
Germany

Hi,

I'm trying to better grasp what is happening under the hood during dispersion estimation, and have a bit of a naive question. When using the makeExampleDESeqDataSet function, I can set the dispersions to zero by just specifying a 'null function', i.e.:

dds_pois <- makeExampleDESeqDataSet(
  n=5000,
  m=10,
  dispMeanRel = function(x) 0
)

and indeed the dispersions here are all zero, and thus our counts should follow Poisson (mean == var).

mcols(dds_pois) %>% head()

DataFrame with 6 rows and 3 columns
      trueIntercept  trueBeta  trueDisp
          <numeric> <numeric> <numeric>
gene1      5.016721         0         0
gene2      0.226108         0         0
.....

If I now run DESeq and plot the estimated dispersion:

dds_pois <- DESeq(dds_pois)
estimating size factors
estimating dispersions
gene-wise dispersion estimates
mean-dispersion relationship
-- note: fitType='parametric', but the dispersion trend was not well captured by the
   function: y = a/x + b, and a local regression fit was automatically substituted.
   specify fitType='local' or 'mean' to avoid this message next time.
final dispersion estimates
fitting model and testing
plotDispEsts(dds_pois, CV=TRUE)

I see a negative linear relationship between the counts and the estimated dispersion. I would expect the estimates to be a) all very close to the minimum (i.e. 1e-8) and b) not have any specific trend at all to begin with. Am I missing something in my interpretation ?

est_disps

Thanks !

DESeq2 • 214 views
ADD COMMENT
2
Entering edit mode
@mikelove
Last seen 1 hour ago
United States

This is discussed in the paper, that you will still have some non-zero estimates even when the true value of log dispersion is negative infinity.

For data consistent with a Poisson, you can use a Poisson GLM. In practice, we typically see non-zero dispersion for most genes when the data are not technical replicates.

ADD COMMENT
0
Entering edit mode

Thank you for the pointers, I'll revert back again to the method section of the paper. As for the negative relationship of the estimated dispersions, is it correct to assume that this is a result of the assumed mean-dependent prior for the dispersions (i.e. the alpha_tr in the manuscript) ?

ADD REPLY
1
Entering edit mode

The red line does assume that shape but the fact the black points show that pattern is discussed in the 2014 paper.

Some caution is warranted to disentangle true underlying dependence from effects of estimation bias that can create a perceived dependence of the dispersion on the mean.

ADD REPLY

Login before adding your answer.

Traffic: 854 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6