Question

nestedF in decideTests: an analogue of Anova with post-hoc t-tests?

0

Entering edit mode

mikhael.manurung ▴ 270

@mikhaelmanurung-17423

Last seen 22 months ago

Netherlands

Dear all,

I would like to extend the discussion on my Design and Contrast Matrix for Linear Model. To Combine or not to Combine?

Briefly, I have data from two groups that are followed longitudinally at three different time points. After going through the workflow of the limma package, (lmFit, contrast fit, and eBayes), now here comes the decideTests.

As the contrasts are closely related, I opted to use 'method=nestedF' instead of 'method=separate', which, if I understand correctly, will identify the contrast in which a gene is deemed significant after the moderated F-tests (that subsequently will reduce the number of multiplicity adjustments that I have to do).

What is your opinion regarding my approach?

Thank you for your response.

limma anova decidetests • 1.4k views

ADD COMMENT • link updated 5.6 years ago by Gordon Smyth 50k • written 5.6 years ago by mikhael.manurung ▴ 270

score 3 · Answer 1 · 2018-09-29

3

Entering edit mode

Gordon Smyth 50k

@gordon-smyth

Last seen 1 hour ago

WEHI, Melbourne, Australia

No, there's no reason for you to use nestedF. You'd be better off using moderated t-tests and F-tests in the usual limma way, as Aaron and James have previously advised you.

nestedF is really a classification method rather than a testing method (the limma User's Guide explains that it doesn't control the FDR rate) and it doesn't reduce the number of multiplicity adjustments.

You do not need nestedF or decideTests in order to do F-tests.

ADD COMMENT • link 5.6 years ago Gordon Smyth 50k

0

Entering edit mode

Dear Prof. Smyth,

Suppose that I want to test the following null hypotheses:

there are no differences between time points (t0, t1, t2) in group A, and
there are no significant differences in pairwise comparisons of time points (see the contrast as defined below).

I find single-factor design easier to understand, so I create a new variable combining group and timepoint.

group_time <- paste(group, timepoint, sep="_) 

design <- model.matrix(~ 0+ group_time) 

contrast <- makeContrasts(t0t1_inGroupA = groupA_t0 - groupA_t1, 
                          t1t2_inGroupA = groupA_t1 - groupA_t2,
                          t0t2_inGroupA = groupA_t0 - groupA_t2, 
                          levels=design)

Then, I proceed with the workflow until I get an eBayes object named fit2.

Using topTable, is it correct if I answer my first hypothesis with:

topTable(fit2, contrast=contrast[,1:3])

and my second hypothesis with:

topTable(fit2, contrast=contrast[,1])
topTable(fit2, contrast=contrast[,2])
topTable(fit2, contrast=contrast[,3])

And if I understand correctly, FDR adjustment for topTable is done with method=separate,right?

However, the contrasts above, as I understand it, are closely related. Then, is method=separate the right one for this or method=global?

And my burning question is, does the approach above equivalent with Anova followed with post-hoc T tests?

Thank you for your time, Prof. Smyth. It is very easy to get an output from the software but then it is actually quite challenging to do it right.

ADD REPLY • link 5.6 years ago mikhael.manurung ▴ 270

2

Entering edit mode

1. method is only an argument for decideTests. For topTable, the manner of the correction is unambiguous - you have a set of p-values, and you apply the Benjamini-Hochberg correction across them.

2. As mentioned in 1, method doesn't have a role here. Nonetheless, I will try to answer what seems to be your more conceptual question; should we apply the BH method to the p-values within each contrast or to the pool of p-values across all contrasts? This comes down to the level of FDR control that you want. I would say that controlling the FDR for DE genes within each contrast generally makes most sense for interpretation, i.e., in my set of DE genes, I expect no more than 5% of them to be false positives. That's pretty clear to most people, especially the end-users (i.e., wet-lab people) who use it to make decisions.

However, there are also cases where the FDR across all contrasts may be interest, in which case applying the BH method globally is more appropriate. This can be the case when you want to make statements across contrasts, e.g., "the comparison between A and B yields twice as many DE genes as the comparison between C and D". By applying the BH method to the pool of p-values, you ensure that the same p-value threshold is used to define DE genes across all contrasts. This avoids inflated differences from the "rich getting richer" behaviour of the BH method, where the presence of some low p-values relaxes the threshold for other genes.

3. No. But why are you worrying about that anyway? It sounds like you want to know which genes are DE between pairs of groups, so just forget about the ANOVA and use the individual contrasts in topTable. There's no point worrying about minimizing the number of multiplicity adjustments - if you need inferences for individual contrasts, then you're going to have to correct for them anyway, ANOVA or otherwise. Even post hoc tests involve some implicit correction for multiple testing across contrasts.

I should also add that the FDR is quite generous when it comes to multiple contrasts. If you get a DE set with 5% false discoveries, and another DE set with 5% false discoveries, and you pool the results, then the pooled set will still have a FDR of 5%. (I'm assuming that you retain a distinction between significant results for same gene but different contrasts; and that at least one true positive is detected in each component set.) So the penalty is less severe than you might think.

By comparison, post-hoc tests like Tukey's control the FWER across contrasts, which is a more conservative criterion - especially if you don't intend to perform all pairwise contrasts. Moreover, this is only done for a single gene, so you'll still have to correct for multiple testing across genes. Finally, it's never been clear to me what is the error rate when you only run post hoc tests on the genes that are detected as being significant from ANOVA. It's much easier to interpret the FDR for specific pairwise comparisons - and if you don't care about an interpretable error rate, then what's the point of doing any of this?

ADD REPLY • link 5.6 years ago Aaron Lun ★ 28k

0

Entering edit mode

Thank you for your elaborate reply, Aaron.

The point that I wanted to make (or confirm) for the first question is that topTable will control for FDR by treating the p-values as a single vector of values, just like supplying method=separate in decideTests to adjust for FDR separately for each tested contrasts. I do know that method is an argument for decideTests only. I guess I was not clear on that before.

For my second question, I want to compare the number of DE genes across the contrasts. Thanks to your explanation, I am now confident to use method=global for my decideTests. Indeed, in several of my contrasts, there are a few very very low p-values that 'cranked up' other genes, thereby inflating the number of DE genes if I use method=separate compared to method=global.

P.S. Did I use the correct method to answer my first and second hypotheses?

ADD REPLY • link 5.6 years ago mikhael.manurung ▴ 270

0

Entering edit mode

No, your code is incorrect, and would lead to errors if you tried to run it, because topTable doesn't accept a contrast argument.

The idea of anova followed by post-hoc tests is not a helpful concept here. All the limma tests are conducted post-hoc.

ADD REPLY • link 5.6 years ago Gordon Smyth 50k

0

Entering edit mode

I did not elaborate on that for the sake of brevity, but then apparently clarity was sacrificed. I fit an lmFit object named fit with the contrasts as below.

fit.c <- contrasts.fit(fit, contrasts= contrast) 
fit2 <- eBayes(fit.c, robust=TRUE)

To get the summary of DE genes in all contrasts, I did the following:

deg <-decideTests(fit2, method="global", p=0.05, adjust="BH") 
summary(deg)

My bad with the codes above as I was not working with my script beside me. I will be much more careful next time. As I supplied method="global" argument to my decideTests, I used write.fitto get my DE genes for each contrasts, not the topTablecommand.

ADD REPLY • link 5.6 years ago mikhael.manurung ▴ 270