PCA plot on DE genes do not separate samples. DESeq2.
1
0
Entering edit mode
g.atla ▴ 10
@gatla-9491
Last seen 7.2 years ago

I am running the DE analysis using Deseq2. When I plot the PCA of differentially expressed genes ( ntop=500) using the command below, I do not see a clear difference between in treated vs untreated.

de <- rownames(resdds[ (resdds$padj<0.05) & (!is.na((resdds)$pvalue)) & (!is.na(resdds$padj)),])
data <- plotPCA(vsd, intgroup="treat", returnData=TRUE, ntop=500)
percentVar <- round(100 * attr(data, "percentVar"))
ggplot(data, aes(PC1, PC2, color=treat)) + geom_point(size=3) + geom_text_repel(aes(label=row.names(data))) + xlab(paste0("PC1: ",percentVar[1],"% variance")) + ylab(paste0("PC2: ",percentVar[2],"% variance"))

Are my results are reliable ? Here is the MA Plot and the GO looks god.

Tanks in advance.

 

deseq2 pca • 3.7k views
ADD COMMENT
4
Entering edit mode
@james-w-macdonald-5106
Last seen 1 day ago
United States

I disagree. You get almost complete separation between the groups on the second principal component. Anyway, principal components is looking at the differences in samples based on a linear combination of the top 500 genes, whereas each univariate comparison is based on individual genes. There will always be differences between an aggregation of data and individual observations, and the former doesn't invalidate the latter.

ADD COMMENT

Login before adding your answer.

Traffic: 395 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6