Entering edit mode
Hi,
I'm trying to blast a large list of short sequences (~1500 sequences, each 14 nucleotides long) using the blastSequences function from the annotate R package, and get number of hits found in the human genome + transcript DB.
The function returns a list of 0 elements even though using the web-blast (nucleotide blast suite: https://blast.ncbi.nlm.nih.gov/Blast.cgi) returns many hits for the same sequences.
Here is one example where I get 4 hits online, but none using this function:
>blastSequences(x="CTGTCGGTACGCTA") estimated response time 18 seconds list() > sessionInfo() R version 3.3.2 (2016-10-31) Platform: x86_64-apple-darwin13.4.0 (64-bit) Running under: macOS Sierra 10.12.6 locale: [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8 attached base packages: [1] stats4 parallel stats graphics grDevices utils datasets [8] methods base other attached packages: [1] annotate_1.52.1 XML_3.98-1.5 AnnotationDbi_1.36.2 [4] Biobase_2.34.0 rBLAST_0.99.1 devtools_1.13.3 [7] Biostrings_2.42.1 XVector_0.14.0 IRanges_2.8.1 [10] S4Vectors_0.12.1 BiocGenerics_0.20.0 BiocInstaller_1.24.0 [13] gtools_3.5.0 pracma_2.0.7 reshape2_1.4.2 [16] ggplot2_2.2.1 loaded via a namespace (and not attached): [1] Rcpp_0.12.9 git2r_0.19.0 plyr_1.8.4 bitops_1.0-6 [5] tools_3.3.2 zlibbioc_1.20.0 digest_0.6.12 memoise_1.0.0 [9] RSQLite_1.1-2 tibble_1.2 gtable_0.2.0 DBI_0.5-1 [13] curl_2.3 withr_2.0.0 stringr_1.2.0 httr_1.2.1 [17] grid_3.3.2 R6_2.2.0 magrittr_1.5 scales_0.4.1 [21] assertthat_0.1 colorspace_1.3-2 xtable_1.8-2 labeling_0.3 [25] quadprog_1.5-5 stringi_1.1.2 RCurl_1.95-4.8 lazyeval_0.2.0 [29] munsell_0.4.3
Any help would be greatly appreciated.