PCA on deseq2 gene data set
1
0
Entering edit mode
sally_b86 • 0
@sally_b86-13975
Last seen 5.4 years ago

Hello, 

Just a quick question please I plotted PCA plot of gene expression data in DESeq2 via the genespca function in pcaexplorer package, I got 2 subsets distributed in the second dimension, once checking the genes I found the 2 subsets represent up and down regulated genes. My question is what does PC1 represent. I was comparing a test to control only with 8 replicates each.

Thank you in advance.

pcaexplorer genespca deseq2 • 1.1k views
ADD COMMENT
0
Entering edit mode
@mikelove
Last seen 1 hour ago
United States

Wikipedia has a paragraph on the intuition behind PCA:

https://en.wikipedia.org/wiki/Principal_component_analysis#Intuition

PC1, or the first component, represents the "direction" or "axis" in the space of the genes (by default we look at the top 500 genes with most variance of transformed counts) that captures most of the variance among samples.

So you can imagine, if you have a block of 10 genes that show very large DE changes across condition, and this is a big difference relative to the variance across samples for other genes in the experiment, then these 10 genes would have a big contribution to PC1, and PC1 would show separation of the samples by condition.

ADD COMMENT

Login before adding your answer.

Traffic: 841 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6