Search
Question: PCA on deseq2 gene data set
0
gravatar for sally_b86
8 days ago by
sally_b860
sally_b860 wrote:

Hello, 

Just a quick question please I plotted PCA plot of gene expression data in DESeq2 via the genespca function in pcaexplorer package, I got 2 subsets distributed in the second dimension, once checking the genes I found the 2 subsets represent up and down regulated genes. My question is what does PC1 represent. I was comparing a test to control only with 8 replicates each.

Thank you in advance.

ADD COMMENTlink modified 8 days ago by Michael Love15k • written 8 days ago by sally_b860
0
gravatar for Michael Love
8 days ago by
Michael Love15k
United States
Michael Love15k wrote:

Wikipedia has a paragraph on the intuition behind PCA:

https://en.wikipedia.org/wiki/Principal_component_analysis#Intuition

PC1, or the first component, represents the "direction" or "axis" in the space of the genes (by default we look at the top 500 genes with most variance of transformed counts) that captures most of the variance among samples.

So you can imagine, if you have a block of 10 genes that show very large DE changes across condition, and this is a big difference relative to the variance across samples for other genes in the experiment, then these 10 genes would have a big contribution to PC1, and PC1 would show separation of the samples by condition.

ADD COMMENTlink written 8 days ago by Michael Love15k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 266 users visited in the last hour