Question: R biomaRt : listDatasets issue
gravatar for mamunbabu2001
10 months ago by
mamunbabu20010 wrote:

Hi All,
I struggling to connect to ENSEMBL 93 using R biomaRt package.
listData set returns 68 datasets instead of 98 listed the example page and “hsapiens_gene_ensembl” is not included in the list.

> library("biomaRt")
> listMarts()


$               biomart               version
    1 ENSEMBL_MART_ENSEMBL      Ensembl Genes 93
    2   ENSEMBL_MART_MOUSE      Mouse strains 93
    3     ENSEMBL_MART_SNP  Ensembl Variation 93
    4 ENSEMBL_MART_FUNCGEN Ensembl Regulation 93

>  ensembl=useMart("ensembl")
> dim(listDatasets(ensembl))

> dim(listDatasets(ensembl))
    [1] 68  3

> ensembl = useMart("ensembl",dataset="hsapiens_gene_ensembl")

And when I try to connect using the dataset name I get the following error.

$ Error in checkDataset(dataset = dataset, mart = mart) :
  The given dataset:  hsapiens_gene_ensembl , is not valid.  Correct dataset names can be obtained with the listDatasets() function.

Thanks in advance for help and suggestions.




My R session details


R version 3.4.1 (2017-06-30)
Platform: x86_64-apple-darwin15.6.0 (64-bit)
Running under: OS X El Capitan 10.11.6


[1] en_GB.UTF-8/en_GB.UTF-8/en_GB.UTF-8/C/en_GB.UTF-8/en_GB.UTF-8

attached base packages:
[1] parallel  stats4    stats     graphics  grDevices utils     datasets
[8] methods   base     

other attached packages:
[1] BiocInstaller_1.28.0 biomaRt_2.34.2       annotate_1.56.2     
[4] XML_3.98-1.13        AnnotationDbi_1.40.0 IRanges_2.12.0      
[7] S4Vectors_0.16.0     Biobase_2.38.0       BiocGenerics_0.24.0

loaded via a namespace (and not attached):
 [1] Rcpp_0.12.18    digest_0.6.15   bitops_1.0-6    xtable_1.8-2   
 [5] DBI_1.0.0       RSQLite_2.1.1   pillar_1.3.0    rlang_0.2.1    
 [9] blob_1.1.1      tools_3.4.1     bit64_0.9-7     RCurl_1.95-4.11
[13] bit_1.1-14      compiler_3.4.1  memoise_1.1.0   tibble_1.4.2  

biomart • 204 views
ADD COMMENTlink modified 10 months ago by Mike Smith3.7k • written 10 months ago by mamunbabu20010

This is a known issue on Ensembl's end. I had similar issues last week and was told that they resolved the issue. Have you tried a different ensembl host?

human <- useMart(biomart="ensembl", dataset="hsapiens_gene_ensembl",host = "" )

Works for me

ADD REPLYlink modified 10 months ago • written 10 months ago by stephen.williams10
Answer: R biomaRt : listDatasets issue
gravatar for Mike Smith
10 months ago by
Mike Smith3.7k
EMBL Heidelberg / de.NBI
Mike Smith3.7k wrote:

There was an issue with biomaRt that manifested when Ensembl release 91 introduced datasets with apostrophes in e.g. "Ma's Night Monkey" which would lead to the error you were seeing. See A: biomaRt: drerio_gene_ensembl dataset missing or for more details.

You are currently using old versions of both R and biomaRt. I would suggest updating both, in particular you will need biomaRt version 2.34.1 or newer to handle this correctly.

ADD COMMENTlink written 10 months ago by Mike Smith3.7k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 319 users visited in the last hour