Using TCGAbiolinks 2.5.9.
It seems that the GDC API might have changed something because
GDCquery("TCGA-BRCA")
Returns:
o GDCquery: Searching in GDC database
Genome of reference: hg38 Warning: 40 parsing failures. row # A tibble: 5 x 5 col row col expected actual file expected actual 1 1 8 columns 71 columns '[https://gdc-api.nci.nih.gov/projects?size=1000&format=tsv](https://gdc-api.nci.nih.gov/projects?size=1000&format=tsv)' file 2 2 8 columns 71 columns '[https://gdc-api.nci.nih.gov/projects?size=1000&format=tsv](https://gdc-api.nci.nih.gov/projects?size=1000&format=tsv)' row 3 3 8 columns 71 columns '[https://gdc-api.nci.nih.gov/projects?size=1000&format=tsv](https://gdc-api.nci.nih.gov/projects?size=1000&format=tsv)' col 4 4 8 columns 71 columns '[https://gdc-api.nci.nih.gov/projects?size=1000&format=tsv](https://gdc-api.nci.nih.gov/projects?size=1000&format=tsv)' expected 5 5 8 columns 71 columns '[https://gdc-api.nci.nih.gov/projects?size=1000&format=tsv](https://gdc-api.nci.nih.gov/projects?size=1000&format=tsv)' ... ................. ... .............................................................................................. ........ .............................................................................................. ...... .................. [... truncated]
|disease_type.2 |disease_type.5 |disease_type.4 |disease_type.6 |
|:---------------------------|:-----------------------------------|:--------------------|:----------------------------|
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|Thymic Epithelial Neoplasms |Complex Mixed and Stromal Neoplasms |Basal Cell Neoplasms |Ductal and Lobular Neoplasms |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
|NA |NA |NA |NA |
Error in checkProjectInput(project) : Please set a valid project argument from the column id above. Project TCGA-BRCA was not found.
In addition: Warning messages:
1: Unnamed col_types should have the same length as `col_names`. Using smaller of the two.
2: In rbind(names(probs), probs_f) : number of columns of result is not a multiple of vector length (arg 1)
3: Unknown or uninitialised column: 'project_id'.
4: Unknown or uninitialised column: 'project_id'.
It seems the problem is in the function
getGDCprojects
in file R/internal.R.Simplifying the code in
getGDCprojects
to download json only seems to solve the issue: