Estimating Fold-Changes of Lowly Expressed Genes
1
0
Entering edit mode
vm ▴ 30
@vm-21340
Last seen 22 months ago
Brazil

I am doing a DGE analysis using RNAseq data to compare three conditions. I am using a standard pipeline (Create DGEList > Filter very lowly expressed genes > TMM normalize > DGE). Since there is a significant variation in library sizes (8-fold), I chose voom rather than limma-trend for DGE.

However, some of my top DEG are genes of low expression (e.g. Gene #1 : Voom-CPM = -5.0 in group A vs Voom-CPM = -1.0 in group B, hence contrast B versus A logFC =4 with adjusted-P = 10^-20). I am concerned that such fold-changes may be exaggerated. I am aware that voom calculates CPM using a prior.count of only 0.5, whereas the cpm() function uses a default prior.count of 2, leading to FC shrinkage.

Which approach would you recommend to estimate FC in these cases?

limma • 908 views
ADD COMMENT
2
Entering edit mode
@gordon-smyth
Last seen 1 hour ago
WEHI, Melbourne, Australia

voom has already shrunk the logFC so they are not exaggerated. The raw fold-changes would be much larger.

If you are using voom on small counts, and if you have genes that are completely absent in one or more of the treatment groups, then I recommend edgeR::voomLmFit() instead of limma::voom().

Have you filtered low expressed genes using filterByExpr() as recommended in the limma workflows? Filtering is important for voom() although less necessary for voomLmFit().

ADD COMMENT
0
Entering edit mode

Thank you very much for the clarification. Yes, I have used filterByExpr() before running voom. In my case, gene counts are almost absent in one of the conditions under study, so I'll follow your suggestion and try edgeR::voomLmFit(). Thank you once again.

ADD REPLY

Login before adding your answer.

Traffic: 903 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6