DESEQ2, VST PCA/Clustering: Is it not advisable to standard normalize data?
1
1
Entering edit mode
owen.whitley ▴ 10
@owenwhitley-15693
Last seen 5.1 years ago

Hi,

I noticed in the DESEQ2 tutorial (https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#data-quality-assessment-by-sample-clustering-and-visualization) one doesn't standard normalize VST transformed data prior to PCA and/or clustering. Is it not advisable to do so? The VST eliminates the mean/variance relationship but still results in certain genes having a lot more variance (in transformed values) than most of the other genes.

 

Thanks,

Owen

deseq2 • 22k views
ADD COMMENT
2
Entering edit mode
@mikelove
Last seen 12 hours ago
United States

The genes with high variance are typically those in which the samples are different for biological reasons, eg differential expression. If you squash all the genes to have equal variance you demote the biological signal and promote the noise. I’ve never understood why people suggest to do this.

ADD COMMENT

Login before adding your answer.

Traffic: 471 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6