Hi,
I'm supposed to do reciprocal best hit using blastp for a long list of proteins for which I'm given only the UniProtKB/TrEMBL identifier. I have to find the ortholog proteins for each of the given proteins. I also have to find orthologs in a given list of species.
Previously, when I had only one Protein, I would do it through online tool in NCBI, there I can give the identifier and also limit the species. However, with more proteins in hand, this is not feasible anymore.
I downloaded Blast, but I cannot download any databases (memory problem, and I need refseq_protein database).
I'm familiar with R and would like to do it in R, but sending online queries to the NCBI blastp.
I searched a lot and was not able to find hint.
I installed orthologr, biomaRt and some other packages, but they only used local databases.
I would appreciate it a lot if somebody could help me how I can send online query through R. I have an R code and somewhere in the code I need to call blastp to find orthologs of my proteins among the given list of species.
Thanks and looking forward.
Maah

Thanks a lot. Is there anyway to give NCBI accession number (eg: ''O48946") as an input instead of the sequence?
Use the accession number(s) as the query
> res = blastSequences("O48946", "nr", program="blastp", as="XML") > sapply(res["//Hit_id"], xmlValue) [1] "gi|15236786|ref|NP_194967.1|" "gi|297798722|ref|XP_002867245.1|" [3] "gi|565449881|ref|XP_006285557.1|" "gi|727512843|ref|XP_010432725.1|" [5] "gi|727549104|ref|XP_010447394.1|" "gi|567217696|ref|XP_006412477.1|" [7] "gi|674237085|gb|KFK29850.1|" "gi|922532521|ref|XP_013597875.1|" [9] "gi|674960897|emb|CDX72357.1|" "gi|674919795|emb|CDY13484.1|" > res[["//BlastOutput_query-ID/text()"]] gi|73917709|sp|O48946.1|CESA1_ARATH > res[["//BlastOutput_query-ID/text()"]] gi|73917709|sp|O48946.1|CESA1_ARATH