Question: PCA on deseq2 gene data set
gravatar for sally_b86
10 months ago by
sally_b860 wrote:


Just a quick question please I plotted PCA plot of gene expression data in DESeq2 via the genespca function in pcaexplorer package, I got 2 subsets distributed in the second dimension, once checking the genes I found the 2 subsets represent up and down regulated genes. My question is what does PC1 represent. I was comparing a test to control only with 8 replicates each.

Thank you in advance.

ADD COMMENTlink modified 10 months ago by Michael Love19k • written 10 months ago by sally_b860
gravatar for Michael Love
10 months ago by
Michael Love19k
United States
Michael Love19k wrote:

Wikipedia has a paragraph on the intuition behind PCA:

PC1, or the first component, represents the "direction" or "axis" in the space of the genes (by default we look at the top 500 genes with most variance of transformed counts) that captures most of the variance among samples.

So you can imagine, if you have a block of 10 genes that show very large DE changes across condition, and this is a big difference relative to the variance across samples for other genes in the experiment, then these 10 genes would have a big contribution to PC1, and PC1 would show separation of the samples by condition.

ADD COMMENTlink written 10 months ago by Michael Love19k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 192 users visited in the last hour