Question: Multiple comparisons for DE analysis with DESeq2
gravatar for francesco.gandolfi
9 months ago by
francesco.gandolfi0 wrote:

Hi guys,

Probably my problem is not so complex but reasoning about the appropriate settings to test differential expression is often a source of confusion for me. Briefly, I wanted to test differentially expressed miRNA from miRNA-seq data. My dataset is composed by 8 samples in total, subdivided in 4 classes of 2 samples (replicates) each. I have 4 sample classes since the experimental design has two different factors with two conditions for each factor.

sample cell_component type
sample 1 (rep1) intracellular  Wt
sample 2 (rep2) intracellular  Wt
sample 3 (rep1) intracellular   mut
sample 4 (rep2) intracellular  mut
sample 5 (rep1) exosome Wt
sample 6 (rep2) exosome Wt
sample 7 (rep1) exosome mut
sample 8 (rep2) exosome mut


Now I just would like to use DESeq2 package to test DE miRNAs in the following comparisons: 

Exosome_wt vs Intracellular_wt

Exosome_mut vs Exosome_wt

Intracellular_mut vs Intracellular_wt

Exosome_mut vs Intracellular_mut

Obviously, the 'intracellular' condition refers to intracellular miRNAs and 'exosome' refers to miRNA expression from exosomes.

My main doubt is how to test these contrasts with DESeq2. Initially I supposed to create the DESeqDataSet object using both the experimental factors:

dds <-DESeqDataSetFromMatrix(countData = ReadCountTable, colData = sampleinfo, design = Cell_component ~ Type).

But then, if I understood correctly, the results function of DESeq2 will extract logFC/pvalue/adj.pval only for comparisons between levels of one factor, for example: 

res <- results(dds, contrast = c("cell_component", "exosome", "intracellular") OR

res <- results(dds, contrast=c("type", "mut", "wt")

But in my case, I wanted to test DE between combinations of factors. One solution I have tried: creating a new column in colData containing for each sample the corresponding combination of factors: intracellular_wt, intracellular_wt, intracellular_mut, intracellular_mut, exosome_wt, etc... and then using results to extract each time the output of each comparison on the new column:

for example:

res <- results(dds, contrast = c("new_column", "exosome_wt", "intracellular_wt") ).

However, I'm not sure at all this is the correct procedure. Can somebody help me?

Thanks a lot,



ADD COMMENTlink modified 9 months ago by Gavin Kelly550 • written 9 months ago by francesco.gandolfi0
gravatar for Gavin Kelly
9 months ago by
Gavin Kelly550
United Kingdom / London / Francis Crick Institute
Gavin Kelly550 wrote:

Yes, I think your approach of adding a combination factor (design = ~ new_column) is a correct way to carry out the analysis.  It would be possible to achieve something similar if you had a design = ~Cell_component * Type with an interaction, but it wouldn't be as transparent as the approach you've suggested.  (I'm not sure your 'design = Cell_component ~ Type' is a typo, as generally DESeq2 designs are specified without a left-hand-side to the formula).  

One warning is that if you go on and look at set intersections of these genelists (e.g. mutation-differential in exosome but not in intracellular), then you're doubling up on potential statistical errors, and there are two-way designs which answer similar questions in one pass, so may be more appropriate - a local statistician would be able to advise.

ADD COMMENTlink written 9 months ago by Gavin Kelly550

Hi Gavin,

ok, for the moment I will try the first approach. Thanks a lot for your help and your suggestions!



ADD REPLYlink written 9 months ago by francesco.gandolfi0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 145 users visited in the last hour