DEGraph demo exprData question
2
0
Entering edit mode
Dick Beyer ★ 1.4k
@dick-beyer-26
Last seen 10.2 years ago
Hi, Does anyone know how the values in the DEGraph demo exprData matrix were calculated? I have read the DEGraph documentation, as well as the source documents for the data (GSE6532), and I can't figure out how to go from the RMA values (range 0-16) to the values in the exprData matrix (-5 to 7.5 about). It doesn't seem to be z-scores. If anyone can help, I'd love to hear. Just trying to understand the details of DEGraph Thanks very much, Dick ********************************************************************** ********* Richard P. Beyer, Ph.D. University of Washington Tel.:(206) 616 7378 Env. & Occ. Health Sci. , Box 354695 Fax: (206) 685 4696 4225 Roosevelt Way NE, # 100 Seattle, WA 98105-6099 http://depts.washington.edu/ceeh/members_fc_bioinfo.html http://staff.washington.edu/~dbeyer
GO DEGraph GO DEGraph • 1.4k views
ADD COMMENT
0
Entering edit mode
@pierre-neuvial-4269
Last seen 10.2 years ago
Hi Dick, > > > Hi, > > Does anyone know how the values in the DEGraph demo exprData matrix were > calculated? ?I have read the DEGraph documentation, as well as the source > documents for the data (GSE6532), and I can't figure out how to go from the > RMA values (range 0-16) to the values in the exprData matrix (-5 to 7.5 > about). ?It doesn't seem to be z-scores. > > If anyone can help, I'd love to hear. ?Just trying to understand the details > of DEGraph Thanks for your interest in DEGraph. The (probe set) expression values from the demo were taken from the "LUMINAL.RData.gz" file in the GSE6532 GEO archive: ftp://ftp.ncbi.nih.gov/pub/geo/DATA/supplementary/series/GSE6532/LUMIN AL.RData.gz These files were processed by RMA, and *median-centered*, as explained in the corresponding README file (ftp://ftp.ncbi.nih.gov/pub/geo/DATA/supplementary/series/GSE6532/LUMI NAL_README.txt.gz): "RMA normalization was performed separately for each population ("series" in demo) and all the probesets were median centered." Note that 'population' refers to the fact that the data comes from three different institutions. This median-centering explains the range of expression values in "LUMINAL.RData", and therefore the range of the corresponding values in the demo data set of the DEGraph package. In case you'd like more information on what exactly is in DEGraph's 'exprData': the data in GSE6532 is at the *probe set* level and the data in DEGraph is at the *gene* level. We defined the expression level of a gene as the expression level of the probe set with largest alignment score among all probe sets mapping to this gene according to the annotation in GSE6532. When the largest alignment score was achieved by several probe sets, we took the median expression level of those probe sets. I've just posted the script we have been using to do this here: http://stat.genopole.cnrs.fr/_media/Members/pneuvial/Loi2008_geneLevel Data.R I hope this helps, Pierre > > Thanks very much, > Dick > ******************************************************************** *********** > Richard P. Beyer, Ph.D. University of Washington > Tel.:(206) 616 7378 ? ? Env. & Occ. Health Sci. , Box 354695 > Fax: (206) 685 4696 ? ? 4225 Roosevelt Way NE, # 100 > ? ? ? ? ? ? ? ? ? ? ? ?Seattle, WA 98105-6099 > http://depts.washington.edu/ceeh/members_fc_bioinfo.html > http://staff.washington.edu/~dbeyer > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor
ADD COMMENT
0
Entering edit mode
Dick Beyer ★ 1.4k
@dick-beyer-26
Last seen 10.2 years ago
Hi Pierre, Perfect! Thanks very much for your thorough explanation. Cheers, Dick ********************************************************************** ********* Richard P. Beyer, Ph.D. University of Washington Tel.:(206) 616 7378 Env. & Occ. Health Sci. , Box 354695 Fax: (206) 685 4696 4225 Roosevelt Way NE, # 100 Seattle, WA 98105-6099 http://depts.washington.edu/ceeh/members_fc_bioinfo.html http://staff.washington.edu/~dbeyer
ADD COMMENT
0
Entering edit mode
On 03/11/2011 07:48 AM, Dick Beyer wrote: > Hi Pierre, > > Perfect! Thanks very much for your thorough explanation. Probably a good idea to add the description to the appropriate man page in the package! Martin > > Cheers, > Dick > ******************************************************************** *********** > Richard P. Beyer, Ph.D. University of Washington > Tel.:(206) 616 7378 Env. & Occ. Health Sci. , Box 354695 > Fax: (206) 685 4696 4225 Roosevelt Way NE, # 100 > Seattle, WA 98105-6099 > http://depts.washington.edu/ceeh/members_fc_bioinfo.html > http://staff.washington.edu/~dbeyer > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor -- Computational Biology Fred Hutchinson Cancer Research Center 1100 Fairview Ave. N. PO Box 19024 Seattle, WA 98109 Location: M1-B861 Telephone: 206 667-2793
ADD REPLY
0
Entering edit mode
Hi Martin, 2011/3/11 Martin Morgan <mtmorgan@fhcrc.org> > > Probably a good idea to add the description to the appropriate man page > in the package! Martin > Pierre already updated the man page on our svn, it will be in the next bioC release. Best, Laurent -- Laurent Jacob Department of Statistics UC Berkeley http://cbio.ensmp.fr/~ljacob [[alternative HTML version deleted]]
ADD REPLY

Login before adding your answer.

Traffic: 581 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6