Add Cannabis sativa genome cs10 to list of BSgenomes
1
0
Entering edit mode
20519572 • 0
@772c3666
Last seen 2.0 years ago
Australia

Enter the body of text here Hi am pretty new to R and I am having trouble adding the cannabis sativa cs10 to BSgenomes, is there a way to do this? Code should be placed in three backticks as shown below


# include your problematic code here with any corresponding output 
# please also include the results of running the following in an R session 

sessionInfo( )
bsgenomes • 587 views
ADD COMMENT
0
Entering edit mode
@herve-pages-1542
Last seen 7 hours ago
Seattle, WA, United States

Hi,

Basically you need to follow the instructions in the "How to forge a BSgenome data package" vignette linked at https://bioconductor.org/packages/BSgenome

Also you might want to take a look at this issue on GitHub where I provide more specific details about how to forge a BSgenome data package for an NCBI assembly (your case). Note that I registered the cs10 assembly in the GenomeInfoDb package today (GenomeInfoDb 1.31.7, see here), which should facilitate the forging process:

> library(GenomeInfoDb)
> cs10 <- getChromInfoFromNCBI("cs10")
> head(cs10)
  SequenceName       SequenceRole AssignedMolecule GenBankAccn Relationship
1            1 assembled-molecule                1  LR213628.1            =
2            2 assembled-molecule                2  LR213632.1            =
3            3 assembled-molecule                3  LR213629.1            =
4            4 assembled-molecule                4  LR213630.1            =
5            5 assembled-molecule                5  LR213631.1            =
6            6 assembled-molecule                6  LR213634.1            =
   RefSeqAccn     AssemblyUnit SequenceLength UCSCStyleName circular
1 NC_044371.1 Primary Assembly      101209240          <NA>    FALSE
2 NC_044375.1 Primary Assembly       96346938          <NA>    FALSE
3 NC_044372.1 Primary Assembly       94670641          <NA>    FALSE
4 NC_044373.1 Primary Assembly       91913879          <NA>    FALSE
5 NC_044374.1 Primary Assembly       88181582          <NA>    FALSE
6 NC_044377.1 Primary Assembly       79335105          <NA>    FALSE

GenomeInfoDb 1.31.7 will become available in BioC 3.15 via BiocManager::install() in the next couple of days. BioC 3.15 is the current devel version of Bioconductor and it requires R 4.2 (current R devel).

Let us know how it goes and don't hesitate to ask again if you have specific questions about the process (in which case you'd need to provide the details of what you've done, what error you're getting, and share your seed file).

Hope this helps,

H.

ADD COMMENT

Login before adding your answer.

Traffic: 635 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6