Question

PCA rand heatmap of raw data

0

Entering edit mode

Fischer-philipp ▴ 30

@fischer-philipp-18490

Last seen 2.8 years ago

Austria

Hello Everybody, first I created a PCA of unnormalized RNA-Seq count data. From that result I used the top n genes with the highest absolute loading value and created a heatmap of using their count data. I know the advised way is to normalize the data (accounting for sequencing depth, ..) first and then maybe use the rlog or vst methods from DESeq2 to stabilize the variances.

But since the different samples cluster really good together - as good as after e.g. rlog transformation and the resulted top n genes represent the biological meaning actually really good - is it still a no go?

My gut and my logic say yes. But I was thinking what you guys might think. Thank you in advance and all the best

DESeq2 DiffSeq PrincipalComponent rlog heatmaps • 1.1k views

ADD COMMENT • link updated 4.0 years ago by Michael Love 43k • written 4.0 years ago by Fischer-philipp ▴ 30

score 1 · Answer 1 · 2021-03-01

1

Entering edit mode

Michael Love 43k

@mikelove

Last seen 15 minutes ago

United States

We discuss in the workflow why variance stabilization is a good idea, so you can just read that (linked from vignette abstract) on my opinion.

ADD COMMENT • link 4.0 years ago Michael Love 43k