EdgeR and libsize normalization
1
0
Entering edit mode
@francois-richard-5410
Last seen 9.6 years ago
Dear all, I am using EdgeR on RNA-seq data for differential analysis. I would like to see the impact of the double normalizations (TMM + libsize) on the counts. Correct me if I am wrong but to have the counts after TMM I am doing : TMM_counts = raw_counts / ( libsize * norm.factor ) But how to get the counts after TMM and lib.size normalization ? Calling equalizedLibSizes(d) give me a common libsize (N value) But I am not sure how to rescale the normalise factor obtain on the raw counts. Can someone help me? Thanks a lot Fran?ois
Normalization edgeR Normalization edgeR • 2.4k views
ADD COMMENT
0
Entering edit mode
Mark Robinson ▴ 880
@mark-robinson-4908
Last seen 5.4 years ago
Hi Francois, I'm a little confused as to what you are asking. You asked a similar question last week: https://stat.ethz.ch/pipermail/bioconductor/2012-July/047057.html > Correct me if I am wrong but to have the counts after TMM I am doing : > TMM_counts = raw_counts / ( libsize * norm.factor ) These are normalized "values", but they are, of course, no longer counts. You could multiply by 1e6 and have a (normalized) counts per million interpretation. > But how to get the counts after TMM and lib.size normalization ? The edgeR user's guide says: "The edgeR methodology needs to work with the original digital expression counts, so these should not be transformed in any way by users prior to analysis." And, in your own words: "[TMM] gives a normalization factor that will correspond to an offset in the model that will test for differential expressed genes." So, what do you actually mean be "counts after [?] normalization"? The normalization doesn't actually change the raw counts; it changes the offset in the model. > Calling equalizedLibSizes(d) give me a common libsize (N value) > But I am not sure how to rescale the normalise factor obtain on the raw counts. equalizeLibSizes() gets used only for the purpose of estimating the dispersion parameter, and generally does not need to be called directly. Do you have a reason for calling it directly? Best, Mark ---------- Prof. Dr. Mark Robinson Bioinformatics Institute of Molecular Life Sciences University of Zurich Winterthurerstrasse 190 8057 Zurich Switzerland v: +41 44 635 4848 f: +41 44 635 6898 e: mark.robinson at imls.uzh.ch o: Y11-J-16 w: http://tiny.cc/mrobin ---------- http://www.fgcz.ch/Bioconductor2012 http://www.eccb12.org/t5 On 24.07.2012, at 10:58, Fran?ois RICHARD wrote: > Dear all, > I am using EdgeR on RNA-seq data for differential analysis. > > I would like to see the impact of the double normalizations (TMM + > libsize) on the counts. > > Correct me if I am wrong but to have the counts after TMM I am doing : > TMM_counts = raw_counts / ( libsize * norm.factor ) > > But how to get the counts after TMM and lib.size normalization ? > Calling equalizedLibSizes(d) give me a common libsize (N value) > But I am not sure how to rescale the normalise factor obtain on the raw counts. > > Can someone help me? > > Thanks a lot > > Fran?ois > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
ADD COMMENT

Login before adding your answer.

Traffic: 666 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6