exonsBy (ENSG -> gene id mapping)
1
0
Entering edit mode
atariw ▴ 10
@atariw-7670
Last seen 2.3 years ago
Italy

I downloaded the human Ensdb AH75011 and the Txdb AH75758.

When I call the "exonsBy(object , 'gene')" method on the Txdb object it automatically converts the ENSG to GeneID. It seems that around 20% of my ENSG ids cannot be converted in GeneIDs !

For example: the exons relative to ENSG00000198840 are not present in my Txdb object after having applied the exonsBy method (even if in Genecards it is mapped to EntrezGene 4537 ).

Am I missing something ?

Thx a lot,

annotation • 662 views
ADD COMMENT
0
Entering edit mode
@james-w-macdonald-5106
Last seen 53 minutes ago
United States

Not to be pedantic, but the mapping is from ENST to Gene ID, not ENSG.

Anyway, this is due to what UCSC has done with their knownGene table. In the past they had their own transcript IDs, and the knownGene table was based on a set of UCSC genes, and presumably those genes were based primarily on NCBI data.

But now they base the knownGene table on GENCODE genes, and therefore they use Ensembl transcript IDs. The mapping we use to generate the TxDb packages relies on data from the knownToLocusLink table (where LocusLink was what they called Entrez Gene back in the day, and what NCBI has now shortened to just the Gene database). And mapping Ensembl transcripts to NCBI Gene IDs is a non-trivial endeavor, as there are any number of disagreements between those two annotation services as to what is a gene, and where they might be, and what transcripts arise from that gene, and which exons, etc.

So it's not surprising at all that there are things getting lost in translation. IMO there are too many gotchas that arise when trying to map from EBI/EMBL annotations to NCBI and back again, so I always try to stay with whatever annotations I started with.

In your case that would mean you should stick with the EnsDb, and forgo the TxDb, and don't try to get NCBI Gene IDs.

ADD COMMENT

Login before adding your answer.

Traffic: 724 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6