TMM normalization of cancer RNA seq
1
0
Entering edit mode
marak • 0
@marak-18210
Last seen 5.9 years ago

i have RNA seq counts data when i apply TMM normalization why do i also get negative values beside positive one my source code is below:

 

library(edgeR)
RNAseq2 <-read.delim("C:\\Users\\hp folio\\Desktop\\BRCA.tsv",header = TRUE)
rnames <-RNAseq2[,1]

MA <- data.matrix(RNAseq2[,2:60])
MAA  <- (2^MA)- 1
head(MAA)
dge <- DGEList(MAA)
dim(dge)
cal <- calcNormFactors(dge,method = "TMM")
RR <- cpm(cal,normalized.lib.sizes=TRUE ,log = TRUE,prior.count = 1)
hist(RR)
row.names(RR)<- rnames
write.table(RR,"C:\\Users\\hp folio\\Desktop\\TMM3.tsv",sep='\t',row.names=TRUE,col.names = TRUE)

 

 

pls:my data was log2(count+1) transformed so i have also reversed the transformation.i will be very thankful if anyone help me.

 

cancer normalization • 1.1k views
ADD COMMENT
0
Entering edit mode
@steve-lianoglou-2771
Last seen 22 months ago
United States

You are getting negative values because you are calling cpm(...) with log = TRUE.

Some of your raw/unlogged counts-per-million values are falling between 0 and 1, and therefore their log2 values are negative.

ADD COMMENT
0
Entering edit mode

thanks a lot . i removed log=true and it worked.Can you please guide me that histogram of TMM normalized values must be in normal distribution form?

ADD REPLY
0
Entering edit mode

There is no reason to expect that the logCPM values would be normally distributed. Why do you think that they should? And do note that many normal distributions include negative values (for example the standard normal distribution), so if you were expecting your 'normalized' data to be normally distributed it shouldn't have bothered you that some values were negative.

ADD REPLY

Login before adding your answer.

Traffic: 775 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6