Search
Question: EdgeR and libsize normalization
0
gravatar for François RICHARD
5.3 years ago by
François RICHARD20 wrote:
Dear all, I am using EdgeR on RNA-seq data for differential analysis. I would like to see the impact of the double normalizations (TMM + libsize) on the counts. Correct me if I am wrong but to have the counts after TMM I am doing : TMM_counts = raw_counts / ( libsize * norm.factor ) But how to get the counts after TMM and lib.size normalization ? Calling equalizedLibSizes(d) give me a common libsize (N value) But I am not sure how to rescale the normalise factor obtain on the raw counts. Can someone help me? Thanks a lot Fran?ois
ADD COMMENTlink modified 5.3 years ago by Mark Robinson870 • written 5.3 years ago by François RICHARD20
0
gravatar for Mark Robinson
5.3 years ago by
Mark Robinson870
Mark Robinson870 wrote:
Hi Francois, I'm a little confused as to what you are asking. You asked a similar question last week: https://stat.ethz.ch/pipermail/bioconductor/2012-July/047057.html > Correct me if I am wrong but to have the counts after TMM I am doing : > TMM_counts = raw_counts / ( libsize * norm.factor ) These are normalized "values", but they are, of course, no longer counts. You could multiply by 1e6 and have a (normalized) counts per million interpretation. > But how to get the counts after TMM and lib.size normalization ? The edgeR user's guide says: "The edgeR methodology needs to work with the original digital expression counts, so these should not be transformed in any way by users prior to analysis." And, in your own words: "[TMM] gives a normalization factor that will correspond to an offset in the model that will test for differential expressed genes." So, what do you actually mean be "counts after [?] normalization"? The normalization doesn't actually change the raw counts; it changes the offset in the model. > Calling equalizedLibSizes(d) give me a common libsize (N value) > But I am not sure how to rescale the normalise factor obtain on the raw counts. equalizeLibSizes() gets used only for the purpose of estimating the dispersion parameter, and generally does not need to be called directly. Do you have a reason for calling it directly? Best, Mark ---------- Prof. Dr. Mark Robinson Bioinformatics Institute of Molecular Life Sciences University of Zurich Winterthurerstrasse 190 8057 Zurich Switzerland v: +41 44 635 4848 f: +41 44 635 6898 e: mark.robinson at imls.uzh.ch o: Y11-J-16 w: http://tiny.cc/mrobin ---------- http://www.fgcz.ch/Bioconductor2012 http://www.eccb12.org/t5 On 24.07.2012, at 10:58, Fran?ois RICHARD wrote: > Dear all, > I am using EdgeR on RNA-seq data for differential analysis. > > I would like to see the impact of the double normalizations (TMM + > libsize) on the counts. > > Correct me if I am wrong but to have the counts after TMM I am doing : > TMM_counts = raw_counts / ( libsize * norm.factor ) > > But how to get the counts after TMM and lib.size normalization ? > Calling equalizedLibSizes(d) give me a common libsize (N value) > But I am not sure how to rescale the normalise factor obtain on the raw counts. > > Can someone help me? > > Thanks a lot > > Fran?ois > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
ADD COMMENTlink written 5.3 years ago by Mark Robinson870
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 314 users visited in the last hour