Question: TCGAbiolinks gives error when parsing a txt file (instead of expected xml)
gravatar for andre.verissimo
5 weeks ago by
andre.verissimo0 wrote:

I was trying to download clinical data from the TCGA-KIRC project and it is failing.

I found out it was downloading a TXT file and trying to read it as an XML (id: 64a1b6e7-d037-4502-bbad-0d07849fc32e and file: nationwidechildrens.org_clinical_nte_kirc.txt


>   gdc$clinical        <- GDCprepare_clinic(query$clinical, = 'patient')
  |==                                                                                                          |   2%Error in doc_parse_file(con, encoding = encoding, as_html = as_html, options = options) : 
  Start tag expected, '<' not found [4]

The code to replicate is below, using bioconductor 3.7 and TCGAbiolinks 2.8.1

project <- 'TCGA-KIRC'
query <- list()
query$clinical <- GDCquery(project = project,
                             data.category = "Clinical")

download.out <- GDCdownload(query$clinical, method = 'api')

gdc <- list()
gdc$clinical        <- GDCprepare_clinic(query$clinical, = 'patient')
ADD COMMENTlink modified 5 weeks ago • written 5 weeks ago by andre.verissimo0

I believe the same question is asked in biostars:

ADD REPLYlink written 5 weeks ago by andre.verissimo0
gravatar for tiagochst
5 weeks ago by
Brazil - University of São Paulo/ Los Angeles - Cedars-Sinai Medical Center
tiagochst130 wrote:

Hello I fixed the documentation yesterday.

It seems the parsed TXT were added to the same group as the XML files.

You need to add file.type = "xml" as filter.

 query <- GDCquery(project = 'TCGA-KIRC', data.category = "Clinical",file.type = "xml")

ADD COMMENTlink written 5 weeks ago by tiagochst130
Thanks for the reply, I've been trying to test confirm to mark this question as resolved, but it continues to say that `GDC server down, try to use this package later`. I will comment back when it allows.
ADD REPLYlink written 4 weeks ago by andre.verissimo0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 240 users visited in the last hour