> e.coli <- readAAStringSet(filepath = "uniprot-proteome_UP000000625.fasta") > summary(e.coli) Length Class Mode 4306 AAStringSet S4 > some.peptide <- AAString("DYWRALQNRIREGHVEDVYAYRRRQ") > summary(some.peptide) Length Class Mode 25 AAString S4 > x <- matchPDict(pdict = some.peptide, subject = e.coli) Error in matchPDict(pdict = some.peptide, subject = e.coli) : please use vmatchPDict() when 'subject' is an XStringSet object (multiple sequence) > x <- vmatchPDict(pdict = some.peptide, subject = e.coli) Error in .local(pdict, subject, max.mismatch, min.mismatch, with.indels, : vmatchPDict() is not ready yet, sorry
What am I doing wrong? I would like to match a list of peptide sequences (here as AAStringSet) vs. a list of Proteins (also AAStringSet).
Greetings,
Tobi
I followed your suggestion to use a for loop:
But the performance is not really great. For 10 queries I already need 4.3 s. That means for 20k queries it will be around 140 min !!! Is this the best I can get from AAStrings match functions?