Hello!
Me and some colleagues are trying to work out if we need to account for multiple testing when using DESEQ2 or if this is something the package covers already. And also, how to handle global testing rather than pairwise.
Our data structure: RNA-seq gene count data with the variables: tissue type (6 levels), treatment (2 levels control/treated), and replicate number (3 levels).
The types of questions we want to ask: (1) which genes are differentially expressed in each tissue between control and treatment? (2) which genes are differentially expressed from one tissue to the next? (3) which genes are differentially expressed in one tissue compared to the others? (global)
On a previous forum, someone suggested creating another variable "group" which combines tissue type and treatment to allow us to extract certain groups needed for the first question. Therefore "group" variable has 12 levels.
Contrast examples:
TissueA:control versus TissueA:treated
Tissue B:control versus TissueB:treated
etc
Tissue A versus Tissue B
Tissue A versus Tissue C
etc
Tissue A versus all other tissues.
Tissue B versus all other tissues.
etc
I am aware that DESEQ2 using BH adjustment to generate p-adjusted values. According to Michael Love "The adjusted p-value column only correct across genes." but not across different contrasts/comparisons. I have also looked in the vignette and have not found an answer.
Could anyone please advise on whether we need to do any extra steps for DESEQ2 to account for multiple testing problems, and if we do, what?
Thank you for this explanation!
I am taking this to mean you can treat each "contrast" as an independent object.
Could you explain a little more on achieving Bonferroni significance as I believe the vignette/tutorial recommends not using Bonferroni as it is too conservative.
I said that at least one gene has to reach Bonferroni significance. That's different than saying you are using a Bonferroni adjustment. I am not actually 100% correct on saying this though.
The way BH works is that you sort all the p-values, and then apply successively smaller penalties. The smallest p-value is adjusted using Bonferroni (e.g., the adjusted p-value will be min(1, p x n) where n is the number of tests), the next one is 'half Bonferroni' (min(1, p*n/2)), then 'quarter Bonferroni' (I am making these terms up BTW), until the largest p-value which isn't adjusted at all.
Your smallest p-value might not meet Bonferroni, but if the second smallest p-value is < 0.05 after multiplying by n/2, or the third smallest p-value is < 0.05 after multiplying by n/3, (and on and on) then you will get significant FDR values.