GEOmetadb package is missing "Organism" metadata field in all GSE accession IDs
1
1
Entering edit mode
@bohdan-khomtchouk-9617
Last seen 6.5 years ago
Stanford University

Hello, I ran into a bug in GEOmetadb:


library(GEOmetadb)
if(!file.exists('GEOmetadb.sqlite')) getSQLiteFile()
con <- dbConnect(SQLite(),'GEOmetadb.sqlite')

dbListFields(con,'gse')
##  [1] "ID"                   "title"             
##  [3] "gse"                  "status"            
##  [5] "submission_date"      "last_update_date"  
##  [7] "pubmed_id"            "summary"           
##  [9] "type"                 "contributor"       
## [11] "web_link"             "overall_design"    
## [13] "repeats"              "repeats_sample_list"
## [15] "variable"             "variable_description"
## [17] "contact"              "supplementary_file"

Clearly, there should also be an "Organism" metadata field, e.g.: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE4032

Clearly, there should also be an "Organism" metadata field, e.g.: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE4032

Clearly, there should also be an "Organism" metadata field, e.g.: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE4032

Clearly, there should also be an "Organism" metadata field, e.g.: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE4032

Clearly, there should also be an "Organism" metadata field, e.g.: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE4032

Please advise.  Thank you!

geometadb • 1.2k views
ADD COMMENT
1
Entering edit mode
@vincent-j-carey-jr-4
Last seen 7 weeks ago
United States

This was cross-posted to bioc-devel.  The answer that I provided there was:

You can get the information through the merged tables:

> dbListTables(con)
 [1] "gds"               "gds_subset"        "geoConvert"       
 [4] "geodb_column_desc" "gpl"               "gse"              
 [7] "gse_gpl"           "gse_gsm"           "gsm"              
[10] "metaInfo"          "sMatrix"          
> cdesc = dbReadTable(con, "geodb_column_desc")
> poten = grep("gani", cdesc[,2])
> cdesc[poten,]
   TableName         FieldName
26       gpl          organism
49       gsm      organism_ch1
57       gsm      organism_ch2
77       gds platform_organism
80       gds   sample_organism
                                                  Description
26 [Values separated by ';tab', if more than one] Organism(s)
49 organism(s) from which the biological material was derived
57                                       same contents as ch1
77                                   organism of the platform
80                                    organism of the sample
ADD COMMENT

Login before adding your answer.

Traffic: 579 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6