Question

DESeq2 with high range of within-group variability

0

Entering edit mode

Tash. • 0

@tash-17343

Last seen 2.8 years ago

United Kingdom

Hi there,

I'm struggling to decide whether I should split up my groups or use the contrast argument of the results function to extract the comparisons after fitting the model (as explained in the vignette). As you can see in the plot linked below, in the LI-P-L and LI-NP groups, there are a few individuals that don't cluster within their groups.

PCA Sample Type

I'm interested in comparing:

1. LI-P-L vs LI-NP
2. LI-P-L vs HV
3. LI-P-L vs LI-P-NL
4. LI-P-NL vs HV

Based on the plot, do you think it makes sense to split these into 4 separate matrices, or simply use the contrast function?

Thanks so much for your help!

DESeq2 • 811 views

ADD COMMENT • link 3.0 years ago • updated 2.8 years ago Tash. • 0

0

Entering edit mode

I can't see the plot, can you?

ADD REPLY • link 3.0 years ago Michael Love 41k

0

Entering edit mode

Hi Michael,

I can see it? It downloads when I click on the link. I'll try again here. !

PCA

ADD REPLY • link 3.0 years ago Tash. • 0

score 0 · Answer 1 · 2021-05-05

0

Entering edit mode

Michael Love 41k

@mikelove

Last seen 4 hours ago

United States

I still can't see this file, it's an unrecognized format on my machine.

I guess, if you are worried about too much heterogeneity, just use the split dataset approach.

ADD COMMENT • link 3.0 years ago Michael Love 41k

0

Entering edit mode

Hi again Michael,

So sorry, don't know why that doesn't work. Here is an image link instead: PCA

Thanks for the advice!

ADD REPLY • link 3.0 years ago Tash. • 0

1

Entering edit mode

Another option would be to estimate a batch variable using SVA or RUV (see the workflow for example code), and then use this as a blocking variable ~sv1 + condition. For this approach, use the full data to best estimate the batch variable and for the DESeq() analysis.

ADD REPLY • link 3.0 years ago Michael Love 41k