Hi all,
I have a list of ENSEMBL ids of human lncRNA's for which I am trying to figure out the gene type (or biotype) eg. lincRNA, processed transcript, antisense, sense_overlapping, etc. Now, I have a GTF file (GENCODE v30 ) which contains the gene_id and gene_type argument in the 9th column. I could somehow try to use a code/script to map my IDs using this GTF file, but I was wondering whether there was an easier way to do it using online tools? I tried biomaRt but the current version of the ensemble release collapses the various types of lncRNAs to a single type ie. lncRNA. I really want to get the subtypes for each lncRNA.
P.S. The last GENCODE version with the lncRNA type 'split up' is the v30.
Thanks in advance :)
```
This information is also available in the
ensembldb
EnsDb
databases. To use the information for Ensembl release 100 you could for example get the mapping gene ID to gene biotype with:Or if you have a set of gene IDs you could query the data like that:
but be aware that the ordering in the result
DataFrame
is not the same as the ordering of the query IDs.hope this helps.
cheers, jo