Should I use CPM or Voom transformed values for downstream analysis?
1
0
Entering edit mode
@f6f38afd
Last seen 13 months ago
Macao

Hi,

I have used Voom to transform my data with the TMM scaling factor and identified a list of differentially expressed genes for my RNA-seq data. What I want to do next, is to generate a bar plot on some genes of interest and also plot an expression heatmap. My question is, can I use the transformed E values from Voom for this, or should I recalculate the Log2CPM values using the cpm(x, log=T, prior.count = 0.5). I had taken a look at the transformed counts from both functions and their differences are subtle. I'm not sure which one is most ideal. Thanks for the help in advance.

edgeR limma • 1.3k views
ADD COMMENT
1
Entering edit mode
@gordon-smyth
Last seen 8 hours ago
WEHI, Melbourne, Australia

For plotting and descriptive purposes we recommend cpm(x, log=TRUE). Note that you do not need to set prior.count=0.5, just use the default. See for example Section 5.1 of the limma-voom workflow.

The voom E values are designed to be used in conjunction with the voom precision weights. For plotting or downstream analyses that can't use the precision weights, it is better to use cpms with a larger prior.count in order to reduce the variance of the log2CPM values for small counts.

ADD COMMENT
0
Entering edit mode

Hi Gordon, thanks for the guidance. That was very useful.

ADD REPLY

Login before adding your answer.

Traffic: 438 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6