Hi
I've just started using TCGAbiolinks
package and it seems there's a bug in the getGDCprojects
function. I cannot retrieve the
projects from TCGA. When I type getGDCprojects()
, I get:
Warning: 40 parsing failures. row # A tibble: 5 x 5 col row col expected actual expected <int> <chr> <chr> <chr> actual 1 1 <NA> 8 columns 71 columns file 2 2 <NA> 8 columns 71 columns row 3 3 <NA> 8 columns 71 columns col 4 4 <NA> 8 columns 71 columns expected 5 5 <NA> 8 columns 71 columns actual # ... with 1 more variables: file <chr> ... ................. ... .................................. ........ .................................. ...... .................................. .... .................................. ... .................................. ... .................................. ........ .................................. ...... ....................................... See problems(...) for more details. # A tibble: 40 x 8 disease_type.1 disease_type.0 disease_type.3 <chr> <chr> <chr> 1 <NA> Uveal Melanoma <NA> 2 <NA> Stomach Adenocarcinoma <NA> 3 <NA> Lung Squamous Cell Carcinoma <NA> 4 <NA> Uterine Corpus Endometrial Carcinoma <NA> 5 <NA> Cholangiocarcinoma <NA> 6 <NA> Ovarian Serous Cystadenocarcinoma <NA> 7 <NA> Adrenocortical Carcinoma <NA> 8 <NA> Mesothelioma <NA> 9 <NA> Acute Myeloid Leukemia <NA> 10 <NA> Brain Lower Grade Glioma <NA> # ... with 30 more rows, and 5 more variables: disease_type.2 <chr>, # disease_type.5 <chr>, disease_type.4 <chr>, disease_type.7 <chr>, # disease_type.6 <chr> Warning messages: 1: Unnamed `col_types` should have the same length as `col_names`. Using smaller of the two. 2: In rbind(names(probs), probs_f) : number of columns of result is not a multiple of vector length (arg 1) 3: Unknown or uninitialised column: 'project_id'.
EDIT: I've found out getGDCprojects
requires readr
package to be loaded (it should be loaded automatically). Furthermore, the line containing:
projects <- read_tsv("https://gdc-api.nci.nih.gov/projects?size=1000&format=tsv", col_types = "cccccccc")
only works if the col_types
argument is removed.
Please, which version of the package are you using? I don't get that with the last version.
You can also use: `biocLite('BioinformaticsFMRP/TCGAbiolinks')`