Dispersion Estimates and MA plots
Entering edit mode
Last seen 19 months ago

Hi, I am currently performing DE analysis on fungal gene expression in axenic and in planta conditions. I have 3 biological replicates subjected to the two conditions. When inspecting the Dispersion estimates and MA (LFC shrunk) plots, these are the plots. I don't think these plots are accurate. The dispersion plot seems to has bad fit similarly to the MA plot which does not resemble the typical shape of an MA plot. I am new in DE analysis so I need others opinion on these plots

dispersion plot

MA plot

# MA-plot
resLFC <- lfcShrink(ddsTxi, coef="condition_IN_PLANTA_vs_AXENIC", type="apeglm")
plotMA(resLFC, ylim=c(-8,8))

# dispersion estimates
plotDispEsts(ddsTxi, ylim = c(1e-4, 1e3))
DESeq2 • 2.4k views
Entering edit mode
Last seen 1 day ago
United States

This dataset may require a bit more filtering. Can you start by eliminating very low count genes:

dds <- DESeqDataSetFrom... # dataset creation
keep <- rowSums(counts(dds) >= 10) >= 3
dds <- dds[keep,]
# then perform DESeq()...

Also it would be good to visualize with a PCA plot:

vsd <- vst(dds, blind=FALSE)
Entering edit mode

Thank you for the suggestion!

I pondered upon the idea of filtering before but as I read that DESeq2 performs stricter filtering, I decided on not filtering. This time, I filtered the data based your suggestion (and it should be done for visualization) and so these are the plots

Dispersion plot after filtering MA-plot after filtering

I also produced PCA plot:


Am I right to suggest that:

1) The high variability of data from in planta samples as seen from PCA is causing the dispersed distribution in the dispersion plot

2) Based on the log-fold change in the MA-plot, there is a hint that there is a large difference in the expression of gene across the two treatment

Thank you!

Entering edit mode

The high variability in the PCA is attributable to condition, so does not affect the dispersion. Dispersion estimation takes into account the experimental design.

Yes, there are large differences across condition.

I would recommend pre-filtering this dataset, because the excess number of features with small count seemed to impair dispersion estimation in the first set of plots. The second pair of plots look good to me.

Entering edit mode

Thank you so much for you reply, sir. And thank you for helping other beginners like me throughout the forum!


Login before adding your answer.

Traffic: 266 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6