Dispersion Estimates and MA plots
1
0
Entering edit mode
@hazwanfikri0505-23072
Last seen 2.1 years ago
Malaysia

Hi, I am currently performing DE analysis on fungal gene expression in axenic and in planta conditions. I have 3 biological replicates subjected to the two conditions. When inspecting the Dispersion estimates and MA (LFC shrunk) plots, these are the plots. I don't think these plots are accurate. The dispersion plot seems to has bad fit similarly to the MA plot which does not resemble the typical shape of an MA plot. I am new in DE analysis so I need others opinion on these plots

dispersion plot

MA plot

# MA-plot
resLFC <- lfcShrink(ddsTxi, coef="condition_IN_PLANTA_vs_AXENIC", type="apeglm")
plotMA(resLFC, ylim=c(-8,8))


# dispersion estimates
plotDispEsts(ddsTxi, ylim = c(1e-4, 1e3))
DESeq2 • 2.9k views
ADD COMMENT
3
Entering edit mode
@mikelove
Last seen 17 hours ago
United States

This dataset may require a bit more filtering. Can you start by eliminating very low count genes:

dds <- DESeqDataSetFrom... # dataset creation
keep <- rowSums(counts(dds) >= 10) >= 3
dds <- dds[keep,]
# then perform DESeq()...

Also it would be good to visualize with a PCA plot:

vsd <- vst(dds, blind=FALSE)
plotPCA(vsd)
ADD COMMENT
0
Entering edit mode

Thank you for the suggestion!

I pondered upon the idea of filtering before but as I read that DESeq2 performs stricter filtering, I decided on not filtering. This time, I filtered the data based your suggestion (and it should be done for visualization) and so these are the plots

Dispersion plot after filtering MA-plot after filtering

I also produced PCA plot:

PCA

Am I right to suggest that:

1) The high variability of data from in planta samples as seen from PCA is causing the dispersed distribution in the dispersion plot

2) Based on the log-fold change in the MA-plot, there is a hint that there is a large difference in the expression of gene across the two treatment

Thank you!

ADD REPLY
0
Entering edit mode

The high variability in the PCA is attributable to condition, so does not affect the dispersion. Dispersion estimation takes into account the experimental design.

Yes, there are large differences across condition.

I would recommend pre-filtering this dataset, because the excess number of features with small count seemed to impair dispersion estimation in the first set of plots. The second pair of plots look good to me.

ADD REPLY
2
Entering edit mode

Thank you so much for you reply, sir. And thank you for helping other beginners like me throughout the forum!

ADD REPLY

Login before adding your answer.

Traffic: 640 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6