Hi,
I am attempting to download data but instead get a file with the extension '.tar.gz'. However, this is not a compressed file. It is a text file with the words:
{
"message": "internal server error"
}
Below is the code I ran:
samples <- c("TCGA-BA-4074", "TCGA-BA-4075")
interesting.genes <- c("TP53", "PIK3CA", "FAT1")
query.exp <- GDCquery(project = "TCGA-HNSC",
legacy = TRUE,
data.category = "Gene expression",
data.type = "Gene expression quantification",
platform = "Illumina HiSeq",
file.type = "results",
experimental.strategy = "RNA-Seq",
barcode = samples)
GDCdownload(query.exp)
Thanks for your help
Hi,
I was able to run the code here.Please, what is your session info? And what query.exp$results shows?
Best regards,
Tiago
My output shows this
> GDCdownload(query.exp) GDCdownload will download 2 files. A total of 3.031802 MB Downloading as: Fri_Nov_11_09_35_27_2016.tar.gz Downloading: 1.2 MB [1] 1
> GDCprepare(query.exp) |=======================================================================================================================================================| 100% Downloading genome information. Using: Homo sapiens genes (GRCh37.p13) Starting to add information to samples => Add clinical information to samples => Adding subtype information to samples Subtype information from:doi:10.1038/nature14129 class: RangedSummarizedExperiment dim: 20330 2 metadata(0): assays(2): raw_count scaled_estimate rownames(20330): A1BG|1 A1CF|29974 ... ZZEF1|23140 ZZZ3|26009 rowData names(4): gene_id entrezgene ensembl_gene_id transcript_id.transcript_id_TCGA-BA-4075-01A-01R-1436-07 colnames(2): TCGA-BA-4075-01A-01R-1436-07 TCGA-BA-4074-01A-01R-1436-07 colData names(62): sample patient ... subtype_Copy.Number subtype_PARADIGM > data <- GDCprepare(query.exp) |==================================================================| 100% Downloading genome information. Using: Homo sapiens genes (GRCh37.p13) Starting to add information to samples => Add clinical information to samples => Adding subtype information to samples Subtype information from:doi:10.1038/nature14129 > query.exp$results [[1]] center.code center.name center.short_name center.center_id center.namespace center.center_type data_type updated_datetime 1276 07 University of North Carolina UNC ee7a85b3-8177-5d60-a10c-51180eb9009c unc.edu CGCC Gene expression quantification 2016-09-07T11:17:30.997957-05:00 1425 07 University of North Carolina UNC ee7a85b3-8177-5d60-a10c-51180eb9009c unc.edu CGCC Gene expression quantification 2016-09-07T11:17:30.997957-05:00 file_name md5sum data_format acl access platform state state_comment file_id data_category file_size 1276 unc.edu.85034d8f-c10c-4db2-ade2-f26ea7cf2d95.1507611.rsem.genes.results 654b40396ed647c6ba22c3fbaf963b1b TXT open open Illumina HiSeq live NA 9bbe732f-4592-4681-91ee-d9e00c88ef1c Gene expression 1508723 1425 unc.edu.78a8e33e-fd10-4dcd-b8fd-aad93db18c45.1484374.rsem.genes.results 2faf952fe332870fbec90dbe81b96b2b TXT open open Illumina HiSeq live NA 20a606f9-2aef-489b-a1d6-0044533e96ff Gene expression 1523079 cases submitter_id type tags experimental_strategy tissue.definition 1276 TCGA-BA-4075-01A-01R-1436-07 NA file v2, unnormalized, gene RNA-Seq Primary solid Tumor 1425 TCGA-BA-4074-01A-01R-1436-07 NA file v2, unnormalized, gene RNA-Seq Primary solid Tumor