Submitting BSgenome package for a new assembly
0
0
Entering edit mode
Manish • 0
@0821d9fe
Last seen 9 months ago
Germany

Hi,

Is it possible to upload BSgenome/GenomicFeatures packages for new genome assemblies of non-model species?

I assembled these genomes and had to create the BSgenome package as it was required for a different tool that I wanted to use. Interestingly, no annotation data was able for this species, which makes me think that it would be helpful for researches working with this species to have these resources.

In case, it is possible to share these resources through bioconductor, could you please share the guidelines/documentation? If it is not possible to share these annotations through bioconductor, then are there other standard strategies for sharing these resources?

Thanks in advance for the help.

Best Manish

bsgenome GenomicFeatures package • 685 views
ADD COMMENT
0
Entering edit mode

Hi Manish,

What assembly is this about? Is it on NCBI? Almost all of the 100+ BSgenome data packages included in Bioconductor were made internally and are maintained by us. With very few exceptions where the package is an external contribution.

FWIW the process of forging a BSgenome data package has been greatly simplified in BioC 3.17. In particular, for an NCBI or UCSC assembly, there's no need to prepare a "seed file" anymore. The new tools are in the new BSgenomeForge package. This should make it easy enough for people to forge a BSgenone data package for their non-model NCBI or UCSC organism.

Best,

H.

ADD REPLY
0
Entering edit mode

Hi Herve,

The assembly is for apricot (Prunus armeniaca). It is not on NCBI yet but will be uploaded there soon. Indeed the process for generating the BSgenome package was easy and I could do that.

I am more curious about the 'sharing' aspect. Based on your reply, am I correct to assume that the recommendation is that users who want to use this assembly generate the BSgenome data package themselves (and individually) using the NCBI/UCSC assembly?

Best Manish

ADD REPLY
1
Entering edit mode

Once the assembly is on NCBI it will be trivial for anybody who wants to use it in Bioconductor to make the BSgenome data package with BSgenomeForge::forgeBSgenomeDataPkgFromNCBI(). It is so easy that there's not much value in adding the package to the Bioconductor repositories.

Note that we would consider doing so if there was something special about it, e.g. the package is hard to make and/or there's a lot of demand for it, but none of this seems to apply in the case of Prunus armeniaca. But hey, I love apricots ;-)

Cheers,

H.

ADD REPLY
0
Entering edit mode

Thanks for the info. This is very helpful. -Manish

ADD REPLY

Login before adding your answer.

Traffic: 788 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6