Entering edit mode
reubenmcgregor88
•
0
@reubenmcgregor88-13722
Last seen 3.8 years ago
I have recently carried out a stipple sequence alignment using DECIPHER package in R. The code was fairly simple as follows:
bod_aa <- readAAStringSet("BOD_aa.fasta")
aligned_aa <- AlignSeqs(bod_aa)
BrowseSeqs(aligned_aa)
This worked nicely but I would like some information on the alignment, specifically the percent homology between the different proteins? If possible percent homology to one of the sequences in particular. I.e. I really want to align all the sequences to one "query sequence" and get a percent homology to that query sequence.
Thanks
Just to clarify, do you mean that you would like to get a distance matrix?
And then look at the distances to a specific sequence (e.g., #1)?
If the distance Matrix looks like this:
BOD1L2 BOD1L1 BOD1
0.8895349 0.9459459 0.0000000
Does that mean BOD1L1 is 94% homologous to BOD1 and BOD1L2 is 88% homologous to BOD1? Or something similar? Sorry for simplicity of questions
The outputs of DistanceMatrix, including examples, are described in:
Generally, it is a matrix containing the distance between each pair of input sequences. The distance is 1 - the similarity (fraction of sites that are identical).
Homology is not well defined in this context, but I assume you mean similarity.
Ah ok, I will go back to reference material and try and understand more,
Thanks