GOstats - defining the gene universe
1
0
Entering edit mode
@rachael-mcbride-2414
Last seen 11.2 years ago
Hi, I have a quick question that I can't seem to find an answer to by searching the BioC lists. I want to use GOstats on a gene list. I've read the vignette and understand that defining the gene universe is an important step. The vignette outlines various non-specific filtering steps that can be done on an expression set in order to define the gene universe. My question is are the non-specific filtering steps done on a normalized or un-normalized expression set. Thanks, Rachael
GOstats GOstats • 1.4k views
ADD COMMENT
0
Entering edit mode
@james-w-macdonald-5106
Last seen 8 hours ago
United States
Hi Rachel, Rachael McBride wrote: > Hi, > > I have a quick question that I can't seem to find an answer to by > searching the BioC lists. I want to use GOstats on a gene list. I've > read the vignette and understand that defining the gene universe is an > important step. The vignette outlines various non-specific filtering > steps that can be done on an expression set in order to define the gene > universe. My question is are the non-specific filtering steps done on a > normalized or un-normalized expression set. You would almost always want to use normalized expression data. The vignette actually includes some steps that by all rights would have occurred earlier in the analysis (namely the part where low-variance genes are removed). Usually the analysis proceeds something like this: Preprocess - normalize, background correct, etc. Filter 'uninteresting' genes to reduce multiplicity Make comparisons Do hypergeometric on the sets from the comparison step. In this case the universe you would start with would be the data you used to make the comparisons, which already lacks the genes you filtered out because they were uninteresting by some measure. At this point you simply want to remove any duplicates, genes lacking Entrez Gene IDs, and genes lacking GO terms. Best, Jim > > Thanks, > Rachael > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor -- James W. MacDonald, M.S. Biostatistician Affymetrix and cDNA Microarray Core University of Michigan Cancer Center 1500 E. Medical Center Drive 7410 CCGC Ann Arbor MI 48109 734-647-5623
ADD COMMENT
0
Entering edit mode
James W. MacDonald wrote: > Hi Rachel, > > Rachael McBride wrote: >> Hi, >> >> I have a quick question that I can't seem to find an answer to by >> searching the BioC lists. I want to use GOstats on a gene list. I've >> read the vignette and understand that defining the gene universe is an >> important step. The vignette outlines various non-specific filtering >> steps that can be done on an expression set in order to define the >> gene universe. My question is are the non-specific filtering steps >> done on a normalized or un-normalized expression set. > > You would almost always want to use normalized expression data. > > The vignette actually includes some steps that by all rights would have > occurred earlier in the analysis (namely the part where low-variance > genes are removed). > > Usually the analysis proceeds something like this: > > Preprocess - normalize, background correct, etc. > Filter 'uninteresting' genes to reduce multiplicity > Make comparisons > Do hypergeometric on the sets from the comparison step. > > In this case the universe you would start with would be the data you > used to make the comparisons, which already lacks the genes you filtered > out because they were uninteresting by some measure. At this point you > simply want to remove any duplicates, genes lacking Entrez Gene IDs, and > genes lacking GO terms. > > Best, > > Jim > > > >> >> Thanks, >> Rachael >> >> _______________________________________________ >> Bioconductor mailing list >> Bioconductor at stat.math.ethz.ch >> https://stat.ethz.ch/mailman/listinfo/bioconductor >> Search the archives: >> http://news.gmane.org/gmane.science.biology.informatics.conductor > Hi Jim, Thanks for the clarification, Rachael.
ADD REPLY

Login before adding your answer.

Traffic: 861 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6