Question

Testing moderated fold changes in DESeq2 version >= 1.16.0

1

Entering edit mode

david.watson ▴ 30

@davidwatson-9545

Last seen 5.2 years ago

Queen Mary University of London

I'm trying to run a straightforward Wald test in DESeq2 and was a little surprised to see that fold changes and standard errors are no longer shrunken by default. I know I can still compute moderated values using the lfcShrink() function, but I see no straightforward way to:

a) calculate the corresponding test statistics and p-values; or

b) implement the gene filtering, LFC thresholding, hypothesis testing, and/or p-value adjustment options implemented in results() to these shrunken estimates.

It's especially weird that test statistics and p-values are calculated from raw rather than moderated values even when running lfcShrink(). For instance:

dds <- makeExampleDESeqDataset()
dds <- DESeq(dds)
res <- results(dds)
shrunk <- lfcShrink(dds)

# Fold changes and SEs are different
identical(res$log2FoldChange, shrunk$log2FoldChange)
# [1] FALSE
identical(res$lfcSE, shrunk$lfcSE)
# [1] FALSE
# But test statistics and p-values are the same
identical(res$stat, shrunk$stat)
# [1] TRUE
identical(res$pvalue, shrunk$pvalue)
# [1] TRUE

Of course it's fairly simple to calculate moderated z-scores (z <- shrunk$log2FoldChange / shrunk$lfcSE) and two-sided p-values (p <- 2 * (1 - pnorm(abs(z)))) from the lfcShrink() output, but it's confusing that the function doesn't do so by default or include an option to do so automatically.

Moreover, it would be nice to incorporate some of the helpful functionalities of results() into lfcShrink(). I'm thinking specifically about the ability to specify filter methods and LFC thresholds, perform one-sided tests, etc. Again, it's not impossible to do all this oneself, but I don't see why it shouldn't be part of the package anymore. Is there some simple way to recombine the two functions or at least extend the same options to both? Or are there good reasons why these options were removed that I'm just failing to appreciate?

deseq2 • 969 views

ADD COMMENT • link updated 6.3 years ago by Michael Love 41k • written 6.3 years ago by david.watson ▴ 30

score 0 · Answer 1 · 2017-12-15

hi David,

Here's a post from a few weeks ago that maybe answers some of these questions:

DESeq2 threshold-based Wald tests, before or after lfcShrink?

There's another thread here where we discussed what vignette updates are needed for the new functions (these will go in to the April release):

A: lfcShrink and factors with multiple levels

In short though:

We purposefully want to separate shrunken LFC and the standard DESeq2 null hypothesis tests. p-values and FDR sets are calculated on the MLE LFCs. Going forward we will be adding more statistical inference for the shrunken LFC, it's just that we've been doing a lot of work and internal testing on these estimates themselves, and haven't had time to push everything out to users!

In version 1.18, released in October, we added two new shrinkage estimators (one we built ourselves called 'apeglm', the other leverages the 'ashr' package). In the latest release, you can compute s-values (aggregate false sign rate statistics) on the shrunken LFCs using these new estimation packages, ... but we haven't written it up what these mean and what they are useful for, etc. in the vignette, because we didn't have time before the Bioconductor release, and the new estimation package was only added to Bioconductor in September. So the functionality exists but we didn't advertise it widely yet. We also want to compute posterior areas for arbitrary thresholds, as in lfcThreshold, but didn't get a chance to do this either. That is just sitting as a comment in the code right now.

Did I miss anything? I appreciate the feedback. Hopefully the new vignette documentation for the next release (April) will be able to clear things up.