How to get sequence homology percent from a multiple sequence alignment in DECIPHER?
0
0
Entering edit mode
@reubenmcgregor88-13722
Last seen 6 months ago

I have recently carried out a stipple sequence alignment using DECIPHER package in R. The code was fairly simple as follows:

aligned_aa <- AlignSeqs(bod_aa)

BrowseSeqs(aligned_aa)

This worked nicely but I would like some information on the alignment, specifically the percent homology between the different proteins? If possible percent homology to one of the sequences in particular. I.e. I really want to align all the sequences to one "query sequence" and get a percent homology to that query sequence.

Thanks

decipher r protein alignment • 519 views
1
Entering edit mode

Just to clarify, do you mean that you would like to get a distance matrix?

d <- DistanceMatrix(bod_aa)

And then look at the distances to a specific sequence (e.g., #1)?

d[1,]
0
Entering edit mode

If the distance Matrix looks like this:

BOD1L2    BOD1L1      BOD1
0.8895349 0.9459459 0.0000000

Does that mean BOD1L1 is 94% homologous to BOD1 and BOD1L2 is 88% homologous to BOD1? Or something similar? Sorry for simplicity of questions

1
Entering edit mode

The outputs of DistanceMatrix, including examples, are described in:

?DistanceMatrix

Generally, it is a matrix containing the distance between each pair of input sequences.  The distance is 1 - the similarity (fraction of sites that are identical).

Homology is not well defined in this context, but I assume you mean similarity.

0
Entering edit mode

Ah ok, I will go back to reference material and try and understand more,

Thanks