BLAST-like alignment of a DNAString to a BSgenome?
1
0
Entering edit mode
@charles-plessy-7857
Last seen 13 months ago
Japan

Hello,

I am looking for a way to align a sequence stored in a DNAString object to a BSgenome object with similar speed as with BLAST or long-read aligners. The sequence might be matching a repeated region multiple times, and checking for that is the reason why I would like to do the alignment. Since I find the sequence of interest during an interactive R session, I would prefer to use Bioconductor functions to run the alignment.

I found the previous question https://support.bioconductor.org/p/53668/ which suggested that there was only matchPattern available, which more precise and therefore limited to cases with at most ~15 mismatches. I have not found alternative, but if I missed something, please let me know !

-- Charles

Alignment Genome • 1.2k views
ADD COMMENT
0
Entering edit mode
@herve-pages-1542
Last seen 13 hours ago
Seattle, WA, United States

Hi Charles,

One alternative I'm aware of is annotate::blastSequences() but I've no experience with it.

Best,

H.

ADD COMMENT
0
Entering edit mode

Thanks Hervé. annotate::blastSequences() sends jobs to the NCBI server, so that is not exactly what I was looking for. I searched again and found that the CNEr package had some functions (CNEr::blatCNE and CNEr::lastz) that utilise the 2bit extdata of BSgenome packages. The functions are devoted to a specific use case, so I can not use them directly, but it may be a good start.

ADD REPLY

Login before adding your answer.

Traffic: 722 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6