Question

PCA for voom normalized RNAseq data

2

Entering edit mode

Merienne Nicolas ▴ 120

@merienne-nicolas-6729

Last seen 7.1 years ago

Switzerland

Dear all,

I have a very basic question about PCA for RNAseq data. I am currently analysing data normalized with the voom method as described in the Limma user guide. Now I would like to explore the distribution of my samples using PCA with the prcomp function. However, I am not sure of the good way to use the scaling (scale.) function with prcomp. If I understand well, scale. = TRUE is used when we expect that a small number of variables with a high variance will mainly contribute to the total variance of the distribution, which is seen on raw read counts of RNAseq data. But the voom normalization should have reduce this effect. So, intuitively, I could use scale. = FALSE for normalized RNAseq data. Am I correct? If not, could anyone explain me why?

Please, do not hesitate to tell me if my explanations were not clear.

Thank you very much in advance.

Best,

Nico

voom pca • 3.8k views

ADD COMMENT • link updated 9.7 years ago by Aaron Lun ★ 28k • written 9.7 years ago by Merienne Nicolas ▴ 120

score 2 · Answer 1 · 2015-04-13

2

Entering edit mode

Aaron Lun ★ 28k

@alun

Last seen 1 hour ago

The city by the bay

If you want to explore your data, you could just use plotMDS on the EList object produced by voom. This generates a MDS plot that achieves the same thing as a PCA plot, i.e., libraries that cluster together are more similar, those that are far apart are more different.

ADD COMMENT • link 9.7 years ago Aaron Lun ★ 28k

0

Entering edit mode

Hi Aaron,

Thank you for your answer. I tried with plotMDS and this gave me similar results with the unscaled PCA. So I think that due to the normalization and transformation of the data, it should be possible to use both kind of PCA with prcomp function. However, I will rather use plotMDS directly on the EList object.

Thanks.

Nico

ADD REPLY • link 9.7 years ago Merienne Nicolas ▴ 120