I have aligned some transcriptome reads on a custom genome made of hg38 supplemented by viral genomes. Unfortunately, in some downstream analyses (using CAGEr), the reads aligning on the viral sequences are ignored because the chromosome name is not in `seqnames(BSgenome.Hsapiens.UCSC.hg38)`. I am thinking of adding the viral sequences to the BSgenome object at run time, but did not find how. Can somebody give me a hint ?
Many thanks ! After studying the vignette and the source code, I ended up forging a package with the following Rmarkdown template. Comments from you and others are welcome!
Minor edit: added ".1" in HPV reference file names for consistency with our internal projects. To readers finding this through search engines, please note that K02718 and X05015 were downloaded from DDBJ for convenience of the example, but updated HPV16 and 18 sequences are available in PaVE.
Glad to see you discovered you could replace the seed file with an ordinary list. Nicely done!
H.