Finding all samples with specific gene
1
0
Entering edit mode
weil485 • 0
@weil485-13180
Last seen 6.9 years ago

In regards to using GEO, is it possible to find all samples/series using a specific gene? Better yet would it be possible to find all samples/series with a specific gene and a specific range of p values?

 

Just asking if something like that would be feasible but if anyone knows a way to do it, any information would be appreciated. Thanks in advance.

geo2r lima geoquery • 658 views
ADD COMMENT
0
Entering edit mode
@james-w-macdonald-5106
Last seen 17 hours ago
United States

It's hypothetically feasible, to a certain extent. The GEOmetadb package will allow you to find all arrays by species. Given that list you could use GEOquery to download the GPL file for each of those arrays and search for a gene of interest (or if you really want to hang with the cool kids, you could dump the GPL data into a SQLite DB, so you can make arbitrary searches in future).

Once you know all the GPL IDs that contain the gene of interest, you could use the gse_gpl table in GEOmeta.db to get all the series that are on the arrays you care about, and then use gse_gsm to map to the actual samples.

You won't be able to do anything with a range of p-values, as GEO is intended as a repository of data, not results. Submissions only contain information about the underlying experiment and the data, but no statistics.

ADD COMMENT

Login before adding your answer.

Traffic: 831 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6