Biological replicates consideration in DESeq2 design formula.
2
2
Entering edit mode
Christian ▴ 20
@59b7b1bb
Last seen 3 months ago
Spain

Dear all,

I have a question regarding the consideration of the biological replicates inside the design formula of DESeq2. We performed 2 different approaches, one being (design=~condition) and the other (design= ~replicate + condition) (see Metadata in picture 1) and found out that there were significant variations in number and padj value of DEG detected.

colData

Since PCA plot shows similar tendencies between conditions but also variance between samples from the same group, we were wondering if it would be more accurate to adjust the design to ignore the inherent variance of independent biological replicates and batch effects to maximize differences that are based on our condition of interest. PCA plot

To put into perspective, first approach gives us 44 DEG (filtered by padj<0,05 only) and the second one 1640 (padj< 0,05), with the second one allowing us to introduce stricter filtering criteria such as |log2foldChange| >1.

Thanks in advance for your help

Christian

PhDStudent in Health Sciences

```

DESeq2 RNASeqData RNASeq • 1.3k views
ADD COMMENT
1
Entering edit mode
ATpoint ★ 4.9k
@atpoint-13662
Last seen 3 days ago
Germany

You clearly see that the PCA is dominated by the effect of "replicate", that is what people often call "batch effect", so unwanted technical variation due to timepoint of doing experiment. The formula that includes replicate corrects for this unwanted variation. That is exactly what in the vignette is shown in Quick Start -- including batch to control for unwanted variation. It's the right choice here imo.

ADD COMMENT

Login before adding your answer.

Traffic: 2205 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6