Question

Accounting for batch effect in DESseq2

0

Entering edit mode

gtechbio ▴ 10

@gtechbio-13996

Last seen 10 months ago

Spain

Dear all,

I am having a problem with removing/accounting for the batch effect in my RNAseq experiment using DESeq2.

Initially we did 1 big time-course experiment in one batch of human cells growing with fungus.

The colData is the following:

Then I was comparing all time points against 0 using contrasts.

After getting some initial results, we decided to perform another experiment at 3h with a different fungus (lets call it 3D) to compare the difference between 0vs3 and 3vs3D.

After analyzing the data (not only what is mentioned above, but a similar experimental design with 3 other fungal species) I realized that 3D samples have a strong batch effect (I have seen this by removing the batch effect using limma).

For the case above, when I want to add the batch variable to the design, I get the error the model matrix is not full rank, which of course makes sense.

I am wondering is there is any workaround for this problem? For example, can I somehow use the output of limma removeBatchEffect to construct a new model and make DE analysis? Thank you

deseq2 batch_effect • 822 views

ADD COMMENT • link updated 5.1 years ago by Michael Love 41k • written 5.1 years ago by gtechbio ▴ 10

score 1 · Answer 1 · 2019-03-14

1

Entering edit mode

Michael Love 41k

@mikelove

Last seen 14 hours ago

United States

It sounds like you have a confounded design. You should have included some 0's in the new batch if you wanted to learn about the 3 vs 0 difference, but as it is, you cannot make much use of these extra samples. You don't know which is closer to the true level of expression, the mean level of expression from 3 or 3D, all you know is that they are different from each other, separated by a batch effect.

ADD COMMENT • link 5.1 years ago Michael Love 41k

0

Entering edit mode

Hi Michael, Thanks for the reply. If I would have known in advance the genes which are not differentially expressed in 3 and 3D compared to 0 (basically genes which are intact to both types of treatment) without the batch effect, can I use these genes to somehow infer the the strength of batch effect given that with batch effect these genes change expression?

Thank you

ADD REPLY • link 5.1 years ago gtechbio ▴ 10

1

Entering edit mode

I don't really follow, but generally DESeq2 won't make much use of these samples, especially as you can't tell from the data what's batch and what's treatment effect.

ADD REPLY • link 5.1 years ago Michael Love 41k

0

Entering edit mode

Thanks for swift reply Michael! Sorry for not explaining properly, but it seems that what I meant is implemented in RUVg package http://bioconductor.org/packages/release/bioc/vignettes/RUVSeq/inst/doc/RUVSeq.pdf (chapter 2.2), which can be coupled with DESeq2. I am wondering if you have any experience with this package? Maybe I should open a separate question for this. Thanks again for support

ADD REPLY • link 5.1 years ago gtechbio ▴ 10

1

Entering edit mode

Yeah, feel free to post an RUVg question (their answer will be applicable broadly to all the RNA-seq DE packages). But it's outside the scope of DESeq2 methods.

ADD REPLY • link 5.1 years ago Michael Love 41k