Different Normalizations
2
0
Entering edit mode
@julia-engelmann-559
Last seen 9.6 years ago
Hi, I am comparing different normalizations of Affymetrix microarrays: 1. expresso with rma, vsn, pmonly and medianpolish 2. calling rma and vsn with: eset=rma(Data); Set=vsn(eset); I know that medianpolish does log2 transformation and vsn does log transformation (base e), but when I compare the results, the difference between the two datasets is huge- not a matter of log-base. Shouldn't the results be very similar? Any help would be much appreciated. Thanks, Julia
vsn vsn • 1.5k views
ADD COMMENT
0
Entering edit mode
Susan Holmes ▴ 120
@susan-holmes-158
Last seen 9.6 years ago
On Fri, 6 Feb 2004, Julia Engelmann wrote: > > Hi, > I am comparing different normalizations of Affymetrix microarrays: > 1. expresso with rma, vsn, pmonly and medianpolish > 2. calling rma and vsn with: eset=rma(Data); Set=vsn(eset); > I know that medianpolish does log2 transformation and vsn does log > transformation (base e), but when I compare the results, the difference > between the two datasets is huge- not a matter of log-base. > Shouldn't the results be very similar? > Julia, vsn doesn't use log (base) but a different trasnformation that stabilizes the variance, I have found the results to be quite different too, they are meant to be especially if there is a strong dependency between variance and expression levels. Best Susan Holmes Associate Professor Statistics Stanford
ADD COMMENT
0
Entering edit mode
@anja-von-heydebreck-625
Last seen 9.6 years ago
Hi Julia, >Hi, >I am comparing different normalizations of Affymetrix microarrays: >1. expresso with rma, vsn, pmonly and medianpolish >2. calling rma and vsn with: eset=rma(Data); Set=vsn(eset); > rma gives you expression levels on the log scale, whereas vsn expects input values on the raw scale. So it doesn't make sense to call vsn after rma like in 2. (apart from the question why you should normalize your data twice - I guess in 1. you used rma only for background correction; but calling rma like in 2. involves also quantile normalization and medianpolish). >I know that medianpolish does log2 transformation and vsn does log >transformation (base e), but when I compare the results, the difference >between the two datasets is huge- not a matter of log-base. >Shouldn't the results be very similar? > >Any help would be much appreciated. > >Thanks, >Julia > > Best, Anja
ADD COMMENT
0
Entering edit mode
On Sat, 7 Feb 2004, Anja von Heydebreck wrote: > Hi Julia, > > >Hi, > >I am comparing different normalizations of Affymetrix microarrays: > >1. expresso with rma, vsn, pmonly and medianpolish > >2. calling rma and vsn with: eset=rma(Data); Set=vsn(eset); > > > rma gives you expression levels on the log scale, whereas > vsn expects input values on the raw scale. So it doesn't make sense > to call vsn after rma like in 2. (apart from the question why you > should normalize your data twice - I guess in 1. you used rma only for > background correction; but calling rma like in 2. involves also > quantile normalization and medianpolish). of cource you could do: exprs(eset) <- 2^exprs(eset) eset = vsn(eset) exprs(eset) <- log2(exprs(eset)) this should make things look more alike. > > >I know that medianpolish does log2 transformation and vsn does log > >transformation (base e), but when I compare the results, the difference > >between the two datasets is huge- not a matter of log-base. > >Shouldn't the results be very similar? > > > >Any help would be much appreciated. > > > >Thanks, > >Julia > > > > > Best, > Anja > > _______________________________________________ > Bioconductor mailing list > Bioconductor@stat.math.ethz.ch > https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor >
ADD REPLY
0
Entering edit mode
Rafael A. Irizarry wrote: >On Sat, 7 Feb 2004, Anja von Heydebreck wrote: > > >>Hi Julia, >> >> >>>Hi, >>>I am comparing different normalizations of Affymetrix microarrays: >>>1. expresso with rma, vsn, pmonly and medianpolish >>>2. calling rma and vsn with: eset=rma(Data); Set=vsn(eset); >>> >>> >>> >>rma gives you expression levels on the log scale, whereas >>vsn expects input values on the raw scale. So it doesn't make sense >>to call vsn after rma like in 2. (apart from the question why you >>should normalize your data twice - I guess in 1. you used rma only for >>background correction; but calling rma like in 2. involves also >>quantile normalization and medianpolish). >> >> > >of cource you could do: > >exprs(eset) <- 2^exprs(eset) >eset = vsn(eset) > But the output of vsn is - at least approximately - on the natural log scale. >exprs(eset) <- log2(exprs(eset)) > Therefore, to obtain log2-values, I would rather do exprs(eset) <- log2(exp(exprs(eset))), or exprs(eset) <- log2(exp(1))*exprs(eset) Best, Anja -- Dr. Anja von Heydebreck Max Planck Institute for Molecular Genetics Dept. Computational Molecular Biology Ihnestr. 73 14195 Berlin, Germany heydebre@molgen.mpg.de phone: +49-30-8413-1168 http://www.molgen.mpg.de/~heydebre fax: +49-30-8413-1152
ADD REPLY
0
Entering edit mode
Dear all, I am working in R a program for cross validation. I have a vector of 20 observations and I want to find all the possible combinations (20,4) and store them as data. I wonder if someone can help me. Thanks in advance. Makis ---------------------- E Motakis, Mathematics E.Motakis@bristol.ac.uk
ADD REPLY
0
Entering edit mode
wrong place for this question, should go to r-help@r-project.org anyway, i am not aware any R-function that gives all subsets of size k. i will use matlab to create it and input it back to R if you don't want to stick with matlab for the rest. if you have some time to spare, you can create your own function by implement the algorithms that generates all subsets of size k. a good source for such algorithms is http://www.cs.sunysb.edu/~algorith/ good luck Kenny Kenny Ye Assistant Professor Department of Applied Math and Statistics SUNY at Stony Brook Stony Brook, New York 11794-3600 Phone (631)632-9344 Fax (631)632-8490 On Mon, 9 Feb 2004, E Motakis, Mathematics wrote: > Dear all, > > I am working in R a program for cross validation. I have a vector of 20 > observations and I want to find all the possible combinations (20,4) and > store them as data. I wonder if someone can help me. > > Thanks in advance. > Makis > > ---------------------- > E Motakis, Mathematics > E.Motakis@bristol.ac.uk > > _______________________________________________ > Bioconductor mailing list > Bioconductor@stat.math.ethz.ch > https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor >
ADD REPLY
0
Entering edit mode
Hi E, have a look at the function "nchoosek" from package vsn. Best wishes Wolfgang E Motakis, Mathematics wrote: > Dear all, > > I am working in R a program for cross validation. I have a vector of 20 > observations and I want to find all the possible combinations (20,4) and > store them as data. I wonder if someone can help me. > > Thanks in advance. > Makis > > ---------------------- > E Motakis, Mathematics > E.Motakis@bristol.ac.uk > > _______________________________________________ > Bioconductor mailing list > Bioconductor@stat.math.ethz.ch > https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor -- ------------------------------------- Wolfgang Huber Division of Molecular Genome Analysis German Cancer Research Center Heidelberg, Germany Phone: +49 6221 424709 Fax: +49 6221 42524709 Http: www.dkfz.de/abt0840/whuber
ADD REPLY

Login before adding your answer.

Traffic: 860 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6