median polish vs mas/PLIER info

0

Entering edit mode

Anthony Bosco ▴ 500

@anthony-bosco-517

Last seen 9.6 years ago

Just some info on PLIER (info from webtalk available on Affy website) incase you are interested. I have got hold of a beta test version of new GREX software (any Affymetrix customer can get hold of this through Affy distributors), and new GREX software gives several options. 1) Standard mas5 algorithm 2) Quantile normalisation 3) bg subtraction, none, percentage background, or MM estimates bg. 4) PP-MM, PM only, or MM treated as PM values. 5) Modelling of probe affinities and target concentration using M-estimator (PLIER bit) With PLIER, M vs A plots are much cleaner at low end in comparison to mas5, but they are still more noisy than rma plots. I am currently looking at comparing qRT-PCR results with the different algorithms to see which algortihm most accurately predicts the actual gene expression level (since rma doesn't adjust for probe affinities I don't expect rma to gove the best correlation with PCR but not sure yet). I can send some M vs A plots with my own data just for interest if you like. Regards Anthony X-Original-To: anthonyb@ichr.uwa.edu.au Delivered-To: anthonyb@ichr.uwa.edu.au Date: Mon, 17 May 2004 12:25:13 -0400 From: "James MacDonald" <jmacdon@med.umich.edu> To: <bioconductor@stat.math.ethz.ch>, <naomi@stat.psu.edu> Subject: Re: [BioC] median polish vs mas X-Virus-Scanned: by amavisd-new Cc: wliu@hes.hmc.psu.edu X-BeenThere: bioconductor@stat.math.ethz.ch X-Mailman-Version: 2.1.4 List-Id: The Bioconductor Project Mailing List <bioconductor.stat.math.ethz.ch> List-Unsubscribe: <https: www.stat.math.ethz.ch="" mailman="" listinfo="" bioconductor="">, <mailto:bioconductor- request@stat.math.ethz.ch?subject="unsubscribe"> List-Archive: <https: www.stat.math.ethz.ch="" pipermail="" bioconductor=""> List-Post: <mailto:bioconductor@stat.math.ethz.ch> List-Help: <mailto:bioconductor- request@stat.math.ethz.ch?subject="help"> List-Subscribe: <https: www.stat.math.ethz.ch="" mailman="" listinfo="" bioconductor="">, <mailto:bioconductor- request@stat.math.ethz.ch?subject="subscribe"> Sender: bioconductor-bounces@stat.math.ethz.ch X-Spam-Checker-Version: SpamAssassin 2.63 (2004-01-11) on poohbear.ichr.uwa.edu.au X-Spam-Status: No, hits=-2.8 required=5.0 tests=AWL,BAYES_10 autolearn=no version=2.63 X-Spam-Level: Dear Naomi, I think we are talking about two different things here. Your question appears to be whether or not rma is a reasonable method for computing expression values, and you appear not to distinguish between justRMA and rma. My statement is directed towards the purpose of justRMA. To answer your question, I personally like rma, and I am not convinced that there is any over-normalization occuring by doing a quantile normalization followed by medianpolish. I have tried pretty much everything out there, and I have yet to find a method for computing expression values that I think does a better job in general use. This is based primarily on how well a given method works with the affy spike- in and GeneLogic dilution data sets (I have had arguments with other statisticians who think that rma only works as well as it does with these data sets because it has been specifically 'tuned' for them. If so, my hat is off to Rafael and Ben for their ability to come up with an algorithm that can magically pick the 16 spiked-in genes out of the other 18,000 or so other genes...). For a variety of reasons, not the least of which is the fact that rma 'beats' most other methods, rma has sort of become the canonical method for computing expression values for Affy data. It has been implemented in other non-BioC packages such as GeneSpring, etc, and although I haven't seen anything concrete, I would bet dollars to donuts that the Affy PLIER algorithm is simply rma by another name. I think this is why your reviewer wants to know why you are doing quantile normalization followed by Tukey's biweight instead of what he/she would consider to be the 'usual' method. Now to the point I was originally trying to make. One of the problems that people encounter with rma is the fact that you first have to create an AffyBatch with all of your chips, and then compute expression values which are stored in an exprSet. This can take a huge amount of RAM, and people with maybe 512 Mb of RAM (which is plenty for the vast majority of things you will ever do on a computer) were running out of memory with a relatively small number of chips. Rafael noted that a modification could be made to rma that would use much less memory, and with his help I wrote the original justRMA. This function was designed for one purpose only; to allow people with less RAM to be able to do rma. The decision to use medianpolish wasn't arbitrary at all; justRMA is designed to give the exact same results as rma (which of course uses medianpolish to compute expression values), so by default I had to use medianpolish. Best, Jim James W. MacDonald Affymetrix and cDNA Microarray Core University of Michigan Cancer Center 1500 E. Medical Center Drive 7410 CCGC Ann Arbor MI 48109 734-647-5623 >>> Naomi Altman <naomi@stat.psu.edu> 05/17/04 11:33AM >>> Dear Jim, The reason I ask is that I have been using expresso with "mas". But I recently had a paper returned with the comment that median polish was "known to be better". If so, I should probably use it. The reviewer appears to have based his/her remarks on the fact (mentioned in the review) that median polish is the "default". If the decision to use median polish in justRMA was arbitrary, I would like to know this, since I am currently in the process of redoing all of the statistical analyses and tables in the paper (which is pretty time-consuming). The main reason we are redoing everything, rather than defending our decision to use "mas" is that I certainly have no evidence that Tukey's biweight is "better" except for the heuristic about over-normalization, and I figured in the long run we will have fewer arguments with reviewers if we use the default. I should not have said that median polish is the "default" in justRMA, since it is the only method available, but I do think that its use in justRMA is an endorsement meaning that anyone doing anything besides Affy-type MAS5 or justRMA or justGCRMA (if this is available) is going to be asked to justify what they are doing with more stringency. --Naomi At 10:49 AM 5/17/2004, James MacDonald wrote: The default (and only) option for justRMA is medianpolish because justRMA is designed to *just* do *RMA*, which is a quantile normalization followed by medianpolish. The only reason justRMA exists is to allow people with less RAM to be able to do rma. If you think a quantile normalization followed by Tukey's biweight will do better than rma, you can certainly do that using the expresso() function. Best, Jim James W. MacDonald Affymetrix and cDNA Microarray Core University of Michigan Cancer Center 1500 E. Medical Center Drive 7410 CCGC Ann Arbor MI 48109 734-647-5623 >>> Naomi Altman <naomi@stat.psu.edu> 05/17/04 10:15AM >>> I have been wondering why the default in justRMA is summary.method="medianpolish" instead of "mas" which is Tukey's biweight. Since we are already doing quantile normalization, doesn't the extra between array step imposed by median polish give the possibility of masking differential expression? Naomi S. Altman 814-865-3791 (voice) Associate Professor Bioinformatics Consulting Center Dept. of Statistics 814-863-7114 (fax) Penn State University 814-865-1348 (Statistics) University Park, PA 16802-2111 _______________________________________________ Bioconductor mailing list Bioconductor@stat.math.ethz.ch https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor _______________________________________________ Bioconductor mailing list Bioconductor@stat.math.ethz.ch https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor Naomi S. Altman 814-865-3791 (voice) Associate Professor Bioinformatics Consulting Center Dept. of Statistics 814-863-7114 (fax) Penn State University 814-865-1348 (Statistics) University Park, PA 16802-2111 _______________________________________________ Bioconductor mailing list Bioconductor@stat.math.ethz.ch https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor -- ______________________________________________ Anthony Bosco - PhD Student Institute for Child Health Research (Company Limited by Guarantee ACN 009 278 755) Subiaco, Western Australia, 6008 Ph 61 8 9489 , Fax 61 8 9489 7700 email anthonyb@ichr.uwa.edu.au

Microarray Normalization Cancer probe affy GeneSpring PROcess plier Microarray Cancer • 1.1k views

ADD COMMENT • link 19.9 years ago Anthony Bosco ▴ 500

Login before adding your answer.