treating RNA-seq gene expression data in cpm for edgeR
2
0
Entering edit mode
ychoi18 • 0
@ychoi18-23425
Last seen 4.0 years ago

Hi, I have a gene expression data in cpm not raw count and I want to do DE analysis using edgeR. In this case, is it okay to follow the overall workflow shown in the edgeR user guide because it's not raw counts in integer...? I'm wondering if it's okay to do "calcNormFactors(y)" where y is the DGEList of the data in cpm.

x is my data that contains gene expression values in cpm.

x <- read.table (file~) : this is cpm values of each gene in each samples.`
y <- DGEList(x)
y <- calcNormFactors(y)

thanks in advance.

edger • 706 views
ADD COMMENT
1
Entering edit mode
@steve-lianoglou-2771
Last seen 14 months ago
United States

No, it’s not. If you can’t get the raw counts, then the only thing you can really do is to add a small prior count, log2 transform the data, then use the limma- trend pipeline, ie. use eBayes(fit, trend=TRUE)

You can find more details in the limma help pages, as well as searching these forums.

ADD COMMENT
0
Entering edit mode
Aaron Lun ★ 28k
@alun
Last seen 2 hours ago
The city by the bay

It is, in fact, so wrong that we have a specific section of the edgeR user's guide (2.7.6) warning against it.

ADD COMMENT

Login before adding your answer.

Traffic: 450 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6