Search
Question: R biomaRt : listDatasets issue
0
gravatar for mamunbabu2001
4 months ago by
mamunbabu20010 wrote:

Hi All,
I struggling to connect to ENSEMBL 93 using R biomaRt package.
listData set returns 68 datasets instead of 98 listed the example page and “hsapiens_gene_ensembl” is not included in the list.


> library("biomaRt")
> listMarts()

================

$               biomart               version
    1 ENSEMBL_MART_ENSEMBL      Ensembl Genes 93
    2   ENSEMBL_MART_MOUSE      Mouse strains 93
    3     ENSEMBL_MART_SNP  Ensembl Variation 93
    4 ENSEMBL_MART_FUNCGEN Ensembl Regulation 93

>  ensembl=useMart("ensembl")
> dim(listDatasets(ensembl))

> dim(listDatasets(ensembl))
    [1] 68  3
     

> ensembl = useMart("ensembl",dataset="hsapiens_gene_ensembl")


And when I try to connect using the dataset name I get the following error.

$ Error in checkDataset(dataset = dataset, mart = mart) :
  The given dataset:  hsapiens_gene_ensembl , is not valid.  Correct dataset names can be obtained with the listDatasets() function.

Thanks in advance for help and suggestions.

Mamun

 

 

My R session details

==================

R version 3.4.1 (2017-06-30)
Platform: x86_64-apple-darwin15.6.0 (64-bit)
Running under: OS X El Capitan 10.11.6

locale:

[1] en_GB.UTF-8/en_GB.UTF-8/en_GB.UTF-8/C/en_GB.UTF-8/en_GB.UTF-8

attached base packages:
[1] parallel  stats4    stats     graphics  grDevices utils     datasets
[8] methods   base     

other attached packages:
[1] BiocInstaller_1.28.0 biomaRt_2.34.2       annotate_1.56.2     
[4] XML_3.98-1.13        AnnotationDbi_1.40.0 IRanges_2.12.0      
[7] S4Vectors_0.16.0     Biobase_2.38.0       BiocGenerics_0.24.0

loaded via a namespace (and not attached):
 [1] Rcpp_0.12.18    digest_0.6.15   bitops_1.0-6    xtable_1.8-2   
 [5] DBI_1.0.0       RSQLite_2.1.1   pillar_1.3.0    rlang_0.2.1    
 [9] blob_1.1.1      tools_3.4.1     bit64_0.9-7     RCurl_1.95-4.11
[13] bit_1.1-14      compiler_3.4.1  memoise_1.1.0   tibble_1.4.2  

ADD COMMENTlink modified 4 months ago by Mike Smith3.1k • written 4 months ago by mamunbabu20010

This is a known issue on Ensembl's end. I had similar issues last week and was told that they resolved the issue. Have you tried a different ensembl host?

human <- useMart(biomart="ensembl", dataset="hsapiens_gene_ensembl",host = "http://useast.ensembl.org/" )

Works for me

ADD REPLYlink modified 4 months ago • written 4 months ago by stephen.williams10
0
gravatar for Mike Smith
4 months ago by
Mike Smith3.1k
EMBL Heidelberg / de.NBI
Mike Smith3.1k wrote:

There was an issue with biomaRt that manifested when Ensembl release 91 introduced datasets with apostrophes in e.g. "Ma's Night Monkey" which would lead to the error you were seeing. See A: biomaRt: drerio_gene_ensembl dataset missing or https://www.biostars.org/p/289654/#289861 for more details.

You are currently using old versions of both R and biomaRt. I would suggest updating both, in particular you will need biomaRt version 2.34.1 or newer to handle this correctly.

ADD COMMENTlink written 4 months ago by Mike Smith3.1k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 345 users visited in the last hour