Retrieving data from NCBI GEO Problem and RNA-Seq Data Analysis
2
1
Entering edit mode
hkarakurt ▴ 20
@hkarakurt-12988
Last seen 8 weeks ago
Turkey

Hello, I am new at RNA-Seq data analysis and I want to analyze the data and do some analyses such as finding differentially expressed genes. My data set is from NCBI GEO and coded as GSE80336. Link is here:

https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE80336

I see data is not normalized. I used getGEO() command. When I used exprs() command, I have this message:

Error in (function (classes, fdef, mtable)  :
unable to find an inherited method for function ‘exprs’ for signature ‘"list"’

What is the real problem here? How can I find non-normalized expression matrix?

I also wanted to RAW data but there is not any RAW data file in supplementary section.

How can I reach RAW data, normalize it and find significantly changed genes. I really need help I am stuck.

Also there is a file called "Counts.txt" in supplementary. What is this file actually and can I use it?

Thank you.

rnaseq rna-seq limma geoquery edger • 2.6k views
1
Entering edit mode
Aaron Lun ★ 27k
@alun
Last seen 6 hours ago
The city by the bay
2. Apply DE analysis workflows like https://bioconductor.org/help/workflows/RnaSeqGeneEdgeRQL/.
1
Entering edit mode
@sean-davis-490
Last seen 1 day ago
United States

Echoing the answer by Aaron, here is the code to download all supplemental files for a given GEO accession.

library(GEOquery)
sfiles = getGEOSuppFiles('GSE80336')
fnames = rownames(sfiles)
# there is only one supplemental file

0
Entering edit mode

Yes there is only one supplement file and it is the Counts file. Mostly there are RAW_data files in supplemental files but this data is different. Probably I will use counts data for analyses.