GEOquery and platforms splitting
2
0
Entering edit mode
@bosotti-roberta-nervianoms-1085
Last seen 9.6 years ago
An embedded and charset-unspecified text was scrubbed... Name: not available Url: https://stat.ethz.ch/pipermail/bioconductor/attachments/20060518/ 607b1533/attachment.pl
• 1.0k views
0
Entering edit mode
Peter ▴ 170
@peter-1556
Last seen 9.6 years ago
Bosotti, Roberta [Nervianoms] wrote: > Hi all, > > I downloaded a GSE file from GEO using GEOquery. The GSM file contain > two "platforms": GPL96 and GPL97. I need to make two separate exprset > from the two (the GSMs are not contiguous for the two platforms, but > are mixed). Do you have any suggestion on how I could make it? > > Thanks in advance, Roberta Interesting - what was the GSE number? That would be very helpful to try and reproduce the problem. Peter
ADD COMMENT
0
Entering edit mode
This is not a "problem" per se but several datasets can have data from multiple platforms. In fact, Sean Davis wrote in the vignette for GEOquery that "the GSE is the most confusing of the GEO entries ... [it] can represent an arbitrary number of samples run on a arbitrary number of platforms." One potential solution is to use the GEOquery package. It will automatically download GSE files and produce a list of GSM objects and a list of GPL objects. These can then be used to produce a datatable for a single exprset or to construct different datatables for each platform. I haven't tested this functionality extensively. From my limited testing, GEOquery works great with GDS files, which are hand-curated files. However I had mixed success with GSE files. So I would suggest that GEOquery is your best bet. -Christos -----Original Message----- From: bioconductor-bounces@stat.math.ethz.ch [mailto:bioconductor-bounces at stat.math.ethz.ch] On Behalf Of Peter (BioC) Sent: Thursday, May 18, 2006 10:42 AM To: bioconductor at stat.math.ethz.ch Subject: Re: [BioC] GEOquery and platforms splitting Bosotti, Roberta [Nervianoms] wrote: > Hi all, > > I downloaded a GSE file from GEO using GEOquery. The GSM file contain > two "platforms": GPL96 and GPL97. I need to make two separate exprset > from the two (the GSMs are not contiguous for the two platforms, but > are mixed). Do you have any suggestion on how I could make it? > > Thanks in advance, Roberta Interesting - what was the GSE number? That would be very helpful to try and reproduce the problem. Peter _______________________________________________ Bioconductor mailing list Bioconductor at stat.math.ethz.ch https://stat.ethz.ch/mailman/listinfo/bioconductor Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
ADD REPLY
0
Entering edit mode
On 5/18/06 1:23 PM, "Christos Hatzis" <christos at="" silicoinsights.com=""> wrote: > This is not a "problem" per se but several datasets can have data from > multiple platforms. In fact, Sean Davis wrote in the vignette for GEOquery > that "the GSE is the most confusing of the GEO entries ... [it] can > represent an arbitrary number of samples run on a arbitrary number of > platforms." > > One potential solution is to use the GEOquery package. It will > automatically download GSE files and produce a list of GSM objects and a > list of GPL objects. These can then be used to produce a datatable for a > single exprset or to construct different datatables for each platform. Hi, Roberta. Christos has already done a good job of explaining the situation--GEO does allow multiple platforms and their associated samples (GSMs) to be lumped together into a single GSE. In the GEOquery vignette, there is an example of converting a GSE to an exprSet. It shows how to do it with one platform in the GSE. Doing so for two platforms is the same except that you need to extract the GPLs individually. Also, you need to extract the GSMs associated with each platform separately. The GPLList(gse) function will get you the list of GPLs that you can operate on. The GSMList will get you a list of GSMs; to extract only those that are associated with a given platform, the example in the vignette shows how to extract the platform information from each GSM in the GSMList. You can use this information to pull out the GSMs associated with each GPL. I admit that this process is not a simple one-line command like for GDSs, but GEO series are by their very nature complicated and do not lend themselves very well to a "standard, one-size-fits-all" approach. Sean > -Christos > > -----Original Message----- > From: bioconductor-bounces at stat.math.ethz.ch > [mailto:bioconductor-bounces at stat.math.ethz.ch] On Behalf Of Peter (BioC) > Sent: Thursday, May 18, 2006 10:42 AM > To: bioconductor at stat.math.ethz.ch > Subject: Re: [BioC] GEOquery and platforms splitting > > Bosotti, Roberta [Nervianoms] wrote: >> Hi all, >> >> I downloaded a GSE file from GEO using GEOquery. The GSM file contain >> two "platforms": GPL96 and GPL97. I need to make two separate exprset >> from the two (the GSMs are not contiguous for the two platforms, but >> are mixed). Do you have any suggestion on how I could make it? >> >> Thanks in advance, Roberta > > Interesting - what was the GSE number? That would be very helpful to try > and reproduce the problem. > > Peter > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor
ADD REPLY
0
Entering edit mode
I agree with you .....i have used GEOquery for GSE files and it works fine.... You must pay attention when NCBI GEO changed the location of the GSE files! Sometimes it appens but Sean has changed GEOquery very rapidly (very good work). Best regards Dr Alberto Goldoni Medical Genetics Unit S. Orsola-Malpighi Hospital Via Massarenti n.9, Pad 11 40100 Bologna, Italy alberto.goldoni at eurogene.org www.eurogene.org www.lagem.it -----Messaggio originale----- Da: bioconductor-bounces at stat.math.ethz.ch [mailto:bioconductor-bounces at stat.math.ethz.ch] Per conto di Christos Hatzis Inviato: gioved? 18 maggio 2006 19.24 A: bioconductor at stat.math.ethz.ch Oggetto: Re: [BioC] GEOquery and platforms splitting This is not a "problem" per se but several datasets can have data from multiple platforms. In fact, Sean Davis wrote in the vignette for GEOquery that "the GSE is the most confusing of the GEO entries ... [it] can represent an arbitrary number of samples run on a arbitrary number of platforms." One potential solution is to use the GEOquery package. It will automatically download GSE files and produce a list of GSM objects and a list of GPL objects. These can then be used to produce a datatable for a single exprset or to construct different datatables for each platform. I haven't tested this functionality extensively. From my limited testing, GEOquery works great with GDS files, which are hand-curated files. However I had mixed success with GSE files. So I would suggest that GEOquery is your best bet. -Christos -----Original Message----- From: bioconductor-bounces@stat.math.ethz.ch [mailto:bioconductor-bounces at stat.math.ethz.ch] On Behalf Of Peter (BioC) Sent: Thursday, May 18, 2006 10:42 AM To: bioconductor at stat.math.ethz.ch Subject: Re: [BioC] GEOquery and platforms splitting Bosotti, Roberta [Nervianoms] wrote: > Hi all, > > I downloaded a GSE file from GEO using GEOquery. The GSM file contain > two "platforms": GPL96 and GPL97. I need to make two separate exprset > from the two (the GSMs are not contiguous for the two platforms, but > are mixed). Do you have any suggestion on how I could make it? > > Thanks in advance, Roberta Interesting - what was the GSE number? That would be very helpful to try and reproduce the problem. Peter _______________________________________________ Bioconductor mailing list Bioconductor at stat.math.ethz.ch https://stat.ethz.ch/mailman/listinfo/bioconductor Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor _______________________________________________ Bioconductor mailing list Bioconductor at stat.math.ethz.ch https://stat.ethz.ch/mailman/listinfo/bioconductor Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
ADD REPLY
0
Entering edit mode
@bosotti-roberta-nervianoms-1085
Last seen 9.6 years ago
An embedded and charset-unspecified text was scrubbed... Name: not available Url: https://stat.ethz.ch/pipermail/bioconductor/attachments/20060519/ 69e25575/attachment.pl

Login before adding your answer.

Traffic: 737 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6