residuals.MArrayLM

0

Entering edit mode

Gordon Smyth 52k

@gordon-smyth

Last seen 3 hours ago

WEHI, Melbourne, Australia

Dear Tiandao, There is no good way to plot residuals for microarray experiments that I know of, so I am not at all sure what you have in mind here. You cannot sensibly do one big plot of all the residuals because the data is so heteroscedastic across genes, as well as being dependent. The residuals() method for MArrayLM objects doesn't work when ndups>1. Do you want residuals before or after averaging over dups? If the later, you might try MAav <- avedups(MA, ndups=4) fitav <- lmFit(MAav, design) res <- residuals(fitav, MAav) Best wishes Gordon >Date: Sat, 13 Oct 2007 14:55:24 -0500 (CDT) >From: Tiandao Li <tiandao.li at="" usm.edu=""> >Subject: [BioC] residuals.MArrayLM >To: bioconductor at stat.math.ethz.ch >Message-ID: <pine.lnx.4.64.0710131419210.30552 at="" orca.st.usm.edu=""> >Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed > >Dear List, > >I am using limma to analyze 2-color microarray data. After reading data >from gpr files, normalize with print-tip loess, then build linear model to >find the differentially expressed gene list. However, before jump to any >conclusions, I want to check the mode first, whether the linear model is a >good fit of the real data. The common check is residual plot to valid the >model assumption. However, one residuals from stats package and one >residuals.MArrayLM from limma package, basically the usages are kind of >similar, residuals(object,y,..). Now I would like to extract residuals >from model. I searched the BioC and R archives, and also google the web, I >knew how to extract residuals using lm from R, from there I knew how to >work around to extract residuals from limma model. However, there must be >a easy way to exract the residuals from limma linear model. > >Please forgive my simple question, any comments are welcome! > >Best wishes, > >Tiandao > >MA <- normalizeWithinArrays(RG) ># correlation between duplicates >design <- modelMatrix(targets,ref="REF") >corfit <- duplicateCorrelation(MA,design,ndups=4) >fit <- lmFit(MA,design,ndups=4,correlation=corfit$consensus,method="ls") > > > sessionInfo() >R version 2.5.1 (2007-06-27) >i386-pc-mingw32 > >locale: >LC_COLLATE=English_United States.1252;LC_CTYPE=English_United >States.1252;LC_MONETARY=English_United >States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252 > >attached base packages: >[1] "stats" "graphics" "grDevices" "utils" "datasets" "methods" >[7] "base" > >other attached packages: > MASS statmod limma >"7.2-34" "1.3.0" "2.10.5"

Microarray limma Microarray limma • 2.7k views

ADD COMMENT • link updated 17.1 years ago by Tiandao Li ▴ 260 • written 17.1 years ago by Gordon Smyth 52k

0

Entering edit mode

Tiandao Li ▴ 260

@tiandao-li-2372

Last seen 10.2 years ago

Dear Dr. Symth, Thank you so much for your valuable help. This is what I was thinking of. May I ask one more question since I searched for days, how do we objectively assess lmFit result(s) using least squares or robust methods? Regards, Tiandao On Mon, 15 Oct 2007, Gordon Smyth wrote: Dear Tiandao, There is no good way to plot residuals for microarray experiments that I know of, so I am not at all sure what you have in mind here. You cannot sensibly do one big plot of all the residuals because the data is so heteroscedastic across genes, as well as being dependent. The residuals() method for MArrayLM objects doesn't work when ndups>1. Do you want residuals before or after averaging over dups? If the later, you might try MAav <- avedups(MA, ndups=4) fitav <- lmFit(MAav, design) res <- residuals(fitav, MAav) Best wishes Gordon >Date: Sat, 13 Oct 2007 14:55:24 -0500 (CDT) >From: Tiandao Li <tiandao.li at="" usm.edu=""> >Subject: [BioC] residuals.MArrayLM >To: bioconductor at stat.math.ethz.ch >Message-ID: <pine.lnx.4.64.0710131419210.30552 at="" orca.st.usm.edu=""> >Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed > >Dear List, > >I am using limma to analyze 2-color microarray data. After reading data >from gpr files, normalize with print-tip loess, then build linear model to >find the differentially expressed gene list. However, before jump to any >conclusions, I want to check the mode first, whether the linear model is a >good fit of the real data. The common check is residual plot to valid the >model assumption. However, one residuals from stats package and one >residuals.MArrayLM from limma package, basically the usages are kind of >similar, residuals(object,y,..). Now I would like to extract residuals >from model. I searched the BioC and R archives, and also google the web, I >knew how to extract residuals using lm from R, from there I knew how to >work around to extract residuals from limma model. However, there must be >a easy way to exract the residuals from limma linear model. > >Please forgive my simple question, any comments are welcome! > >Best wishes, > >Tiandao > >MA <- normalizeWithinArrays(RG) ># correlation between duplicates >design <- modelMatrix(targets,ref="REF") >corfit <- duplicateCorrelation(MA,design,ndups=4) >fit <- lmFit(MA,design,ndups=4,correlation=corfit$consensus,method="ls") > > > sessionInfo() >R version 2.5.1 (2007-06-27) >i386-pc-mingw32 > >locale: >LC_COLLATE=English_United States.1252;LC_CTYPE=English_United >States.1252;LC_MONETARY=English_United >States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252 > >attached base packages: >[1] "stats" "graphics" "grDevices" "utils" "datasets" "methods" >[7] "base" > >other attached packages: > MASS statmod limma >"7.2-34" "1.3.0" "2.10.5" _______________________________________________ Bioconductor mailing list Bioconductor at stat.math.ethz.ch https://stat.ethz.ch/mailman/listinfo/bioconductor Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor

ADD COMMENT • link 17.1 years ago Tiandao Li ▴ 260

0

Entering edit mode

At 05:36 PM 15/10/2007, Tiandao Li wrote: >Dear Dr. Symth, Smyth! >Thank you so much for your valuable help. This is what I was thinking of. >May I ask one more question since I searched for days, how do we >objectively assess lmFit result(s) using least squares or robust methods? I don't understand your question, or rather, it could mean a lot of things. Can you please try to ask more specifically what it is that you want to know. Best wishes Gordon >Regards, >Tiandao

ADD REPLY • link 17.1 years ago Gordon Smyth 52k

0

Entering edit mode

Dear Dr. Smyth, Double check. Last time it was my typo. :-) My question is: from a statistical point, how do we know the linear model generated by lmFit is a good analogy of the real data? Tiandao On Tue, 16 Oct 2007, Gordon Smyth wrote: At 05:36 PM 15/10/2007, Tiandao Li wrote: >Dear Dr. Symth, Smyth! >Thank you so much for your valuable help. This is what I was thinking of. >May I ask one more question since I searched for days, how do we >objectively assess lmFit result(s) using least squares or robust methods? I don't understand your question, or rather, it could mean a lot of things. Can you please try to ask more specifically what it is that you want to know. Best wishes Gordon >Regards, >Tiandao _______________________________________________ Bioconductor mailing list Bioconductor at stat.math.ethz.ch https://stat.ethz.ch/mailman/listinfo/bioconductor Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor

ADD REPLY • link 17.1 years ago Tiandao Li ▴ 260

0

Entering edit mode

Dear Tiandao, I don't have a neat answer to that question. A model always tries to be good enough for the purpose, and you have to very precise about what the purpose is before you can even properly address the question. For example, the p-values may still behave well for ranking genes even if the data are far from normal. It isn't possible to fully put a model to the test on the basis of an individual microarray data set. There are many research questions here. I personally think that with microarray data it is most fruitful to examine data quality at the array level, rather than attempting an microarray equivalent of the sort of residual analysis you might do for a univariate regression problem. Best wishes Gordon At 12:43 PM 16/10/2007, Tiandao Li wrote: >Dear Dr. Smyth, > >Double check. Last time it was my typo. :-) > >My question is: from a statistical point, how do we know the linear model >generated by lmFit is a good analogy of the real data? > >Tiandao > > >On Tue, 16 Oct 2007, Gordon Smyth wrote: > >At 05:36 PM 15/10/2007, Tiandao Li wrote: > >Dear Dr. Symth, > >Smyth! > > >Thank you so much for your valuable help. This is what I was thinking of. > >May I ask one more question since I searched for days, how do we > >objectively assess lmFit result(s) using least squares or robust methods? > >I don't understand your question, or rather, it could mean a lot of >things. Can you please try to ask more specifically what it is that >you want to know. > >Best wishes >Gordon > > >Regards, > >Tiandao

ADD REPLY • link 17.1 years ago Gordon Smyth 52k

Login before adding your answer.