Entering edit mode
How can I load in the data from GSE108497 into R?
I tried using GEOquery and the expression data is empty:
gse108497<- getGEO('GSE108497',GSEMatrix=T)
show(gse108497)
$GSE108497_series_matrix.txt.gz
ExpressionSet (storageMode: lockedEnvironment)
assayData: 0 features, 512 samples
element names: exprs
protocolData: none
phenoData
sampleNames: GSM2901826 GSM2901827 ... GSM2902337 (512 total)
varLabels: title geo_accession ... tp:ch1 (74 total)
varMetadata: labelDescription
featureData
featureNames:
fvarLabels: ID Species ... GB_ACC (30 total)
fvarMetadata: Column Description labelDescription
experimentData: use 'experimentData(object)'
Annotation: GPL10558
I've also tried using getGEOSuppFiles(), but could not find a solution. Thank you.
The submitters did not actually submit quantification results for the samples as part of the sample records. Therefore, GEOquery cannot load them automatically. You'll have to use getGEOSuppFiles() and then merge the raw data with the annotation data that you got with getGEO. Unfortunately, there is no standard approach for merging the supplied raw data with the metadata from GEOquery; each dataset will vary somewhat.
Thank you for your reply. I don't think the submitters submitted anything at all.
The supplementary files (if you want to look programmatically, see
getGEOSuppFiles
) contain what are described as "non-normalized" and "normalized" data that you can read using standard R tab-delimited text file readers. You'll then have to manipulate those data into a form that you can use in R/Bioconductor. But the data are submitted.When I read in "GSE108497_normalized_data.txt", I get this:
However, I can't find the file that will help me correspond the sample ID (i.e., GSMxxxxx to ILMN_xxxxxxx). Do you know where I can find this?
Note that R adds the "X" to the column names in the .txt file, but if you look at the "description", they match up with the exception of the "X". And, yes, you have to just poke around. This approach works for this dataset, but the next one will be different (columns, naming, etc).
Thank you so much!! I've been working on this for days. Much appreciated.