Hello,
Does anyone know of a way to get a list of ENSEMBL gene identifiers for all known tRNA encoding genes for a given species?
At the moment, I am interested in generating such lists for human and mouse.
For other types of genes (rRNAs, snoRNAs, etc.) I am able to use biomaRt to find all such genes, for example:
library(biomaRt) ensembl_mart = useMart(biomart="ensembl") biomart = useDataset('hsapiens_gene_ensembl', mart=ensembl_mart) biomart_genes = getBM(attributes=c("ensembl_gene_id", "gene_biotype"), mart=biomart) my_genes$type = biomart_genes$gene_biotype[match(my_genes$ENSEMBL, biomart_genes$ensembl_gene_id)]
...where "my_genes" is some dataframe of genes (e.g. a count table) with an id field called "ENSEMBL"
tRNAs do not have their own gene_biotype grouping, and therefor a separate approach is needed to find them.
I stumbled across Marc's FDB.UCSC.tRNAs database package and was able to figure out how to at least get a list of tRNA genes:
library('FDb.UCSC.tRNAs') names(features(FDb.Hsapiens.UCSC.hg19.tRNAs))
However, I'm not sure how to map from these entries back to my ENSEMBL gene ids.
Anyone know of a better way?
Keith
Hi Keith
I was in a similar situation a few months ago, when I was working with the ensembl gtf files (for mouse and human), which only contain the mitochondrial tRNA genes. I approached the ensembl helpdesk, and was adviced to use the Perl API to get them.
Hans-Rudolf
Hi Hans-Rudolf,
Thanks for the suggestion -- I will check out the Perl API.
Did you have any luck using it to query tRNAs?
yes, it worked very well for my needs. The trick was:
$slice->get_all_SimpleFeatures('tRNAscan') }
Hans-Rudolf