library VaraintFiltration looking for explanation of pedigree file
2
0
Entering edit mode
@claushultschig-13796
Last seen 5.6 years ago

Hello,

I would like to use the VariantFiltration library of Bioconductor of reduce the number of SNP identified for the phenotype of our interest. We are in the fortunate situation to analyze related individuals, but we have different phenotypes (more than absent & present). I would like to use a pedigree file to describe the relationship for use in VariantFiltration.

I tried to understand the example file CEUtrio.ped. In my download this file does not have a header

$head CEUtrio.ped
FX-1800 NA12891 0 0 2 1
FX-1800 NA12892 0 0 1 1
FX-1800 NA12878 NA12892 NA12891 2 2

It is clear the first column is the family ID, the 2nd col the id of the individual, the third col the id of the father, the 4th col the ID of the mother and the 5th col the sex (I guess: 1= male, 2=female). Probably the 6th col the describes the phenotype.

However, I am not clear on the code. I did find different phenotypes for the this CEU Trio with google. In some examples (like the above?) the daughter seems to have a phenotype - in other examples (e.g.  she did not.

Could someone please explain me the code used for describing the phenotype?

I hope I did not miss in my search not any relevant hits - in case I did, I apologize for this post.

Looking forward to a supporting reply.

Thanks & best wishes

Claus

bioconductor variantfiltering • 1.4k views
ADD COMMENT
0
Entering edit mode
Robert Castelo ★ 3.3k
@rcastelo
Last seen 2 days ago
Barcelona/Universitat Pompeu Fabra

hi Claus,

the example data used in the vignette of the VariantFiltering package is an excerpt from three individuals sequenced as part of the 1000 Genomes Project. They are not in principle affected by any disease and in the PED file we put the given sex but we made up the phenotype column for pure illustrative purposes. I guess in other places were this trio is used, you may find the phenotype column unset or set in a different way.

cheers,

robert.

ADD COMMENT
0
Entering edit mode
@claushultschig-13796
Last seen 5.6 years ago

Dear Robert,

Many thanks for your reply. May I ask for the encoding of the phenotype? Can I annotate for multiple phenotypes? And if yes, how do I do this?

Looking forward to your reply

Best,

Claus
 

ADD COMMENT
0
Entering edit mode

Dear Claus,

you're probably not aware, but your question now is written in the space for answers to your first question. you should have written it using the 'ADD COMMENT' link below my answer. this helps to keep some structure in the conversation and quickly identify relevant answers.

regarding you specific question, although this 6th column is commonly known as the phenotype column it just encodes the so-called "affection status", which means that you only code whether the individual is "affected" or not by the disease, you may use -9 o 0 when this affection status is missing. if you do a google search for "PED file format" you may find pointers to documentation about this format, one of then being this one.

if you want to understand more deeply how VariantFiltering filters with the different inheritance models, which is when this phenotype column comes into play, you may consult the corresponding unit tests from the source code in

VariantFiltering/inst/unitTests/test_inheritance-models.R

cheers,

robert.

ADD REPLY

Login before adding your answer.

Traffic: 860 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6