BLAST-like alignment of a DNAString to a BSgenome?
1
0
Entering edit mode
@charles-plessy-7857
Last seen 4 hours ago
Japan

Hello,

I am looking for a way to align a sequence stored in a DNAString object to a BSgenome object with similar speed as with BLAST or long-read aligners. The sequence might be matching a repeated region multiple times, and checking for that is the reason why I would like to do the alignment. Since I find the sequence of interest during an interactive R session, I would prefer to use Bioconductor functions to run the alignment.

I found the previous question https://support.bioconductor.org/p/53668/ which suggested that there was only matchPattern available, which more precise and therefore limited to cases with at most ~15 mismatches. I have not found alternative, but if I missed something, please let me know !

-- Charles

Alignment Genome • 125 views
0
Entering edit mode
@herve-pages-1542
Last seen 3 days ago
Seattle, WA, United States

Hi Charles,

One alternative I'm aware of is annotate::blastSequences() but I've no experience with it.

Best,

H.

0
Entering edit mode

Thanks Hervé. annotate::blastSequences() sends jobs to the NCBI server, so that is not exactly what I was looking for. I searched again and found that the CNEr package had some functions (CNEr::blatCNE and CNEr::lastz) that utilise the 2bit extdata of BSgenome packages. The functions are devoted to a specific use case, so I can not use them directly, but it may be a good start.