Hello,
I am looking for a way to align a sequence stored in a DNAString
object to a BSgenome
object with similar speed as with BLAST or long-read aligners. The sequence might be matching a repeated region multiple times, and checking for that is the reason why I would like to do the alignment. Since I find the sequence of interest during an interactive R session, I would prefer to use Bioconductor functions to run the alignment.
I found the previous question https://support.bioconductor.org/p/53668/ which suggested that there was only matchPattern
available, which more precise and therefore limited to cases with at most ~15 mismatches. I have not found alternative, but if I missed something, please let me know !
-- Charles
Thanks Hervé.
annotate::blastSequences()
sends jobs to the NCBI server, so that is not exactly what I was looking for. I searched again and found that the CNEr package had some functions (CNEr::blatCNE
andCNEr::lastz
) that utilise the 2bit extdata of BSgenome packages. The functions are devoted to a specific use case, so I can not use them directly, but it may be a good start.