Ensembl or Gencode version of TxDb.Hsapiens.UCSC.hg19.knownGene
1
1
Entering edit mode
@adaikalavan-ramasamy-5765
Last seen 10.1 years ago
United Kingdom
Hi there, I am using the R package TxDb.Hsapiens.UCSC.hg19.knownGene to annotate SNPs using the locateVariants() function from the VariantAnnotation package. For reasons of compatibility with other pipelines I developed, I would like to use Ensembl/Gencode annotation. Is there a package for this? Otherwise please advice on how best to generate this myself. Thank you. Regards, Adai [[alternative HTML version deleted]]
Annotation Annotation • 4.9k views
ADD COMMENT
1
Entering edit mode
@james-w-macdonald-5106
Last seen 36 minutes ago
United States
Hi Adai, See makeTranscriptDbFromBiomart() in the GenomicFeatures package. Best, Jim On Friday, October 18, 2013 9:18:46 AM, Adaikalavan Ramasamy wrote: > Hi there, > > I am using the R package TxDb.Hsapiens.UCSC.hg19.knownGene to annotate SNPs > using the locateVariants() function from the VariantAnnotation package. > > For reasons of compatibility with other pipelines I developed, I would like > to use Ensembl/Gencode annotation. Is there a package for this? Otherwise > please advice on how best to generate this myself. > > Thank you. > > Regards, Adai > > [[alternative HTML version deleted]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor -- James W. MacDonald, M.S. Biostatistician University of Washington Environmental and Occupational Health Sciences 4225 Roosevelt Way NE, # 100 Seattle WA 98105-6099
ADD COMMENT
0
Entering edit mode
Dear James, Thank you. This works brilliantly. However it now returns ENSG ids (previously entrez IDs). When I try to map these into gene symbols using og.Hs.egENSEMBL, about 20% of ENSG ids are not found. Should I use another database/package to do this conversion? On Fri, Oct 18, 2013 at 2:27 PM, James W. MacDonald <jmacdon@uw.edu> wrote: > Hi Adai, > > See makeTranscriptDbFromBiomart() in the GenomicFeatures package. > > Best, > > Jim > > > > > On Friday, October 18, 2013 9:18:46 AM, Adaikalavan Ramasamy wrote: > >> Hi there, >> >> I am using the R package TxDb.Hsapiens.UCSC.hg19.**knownGene to annotate >> SNPs >> using the locateVariants() function from the VariantAnnotation package. >> >> For reasons of compatibility with other pipelines I developed, I would >> like >> to use Ensembl/Gencode annotation. Is there a package for this? Otherwise >> please advice on how best to generate this myself. >> >> Thank you. >> >> Regards, Adai >> >> [[alternative HTML version deleted]] >> >> ______________________________**_________________ >> Bioconductor mailing list >> Bioconductor@r-project.org >> https://stat.ethz.ch/mailman/**listinfo/bioconductor<https: stat.e="" thz.ch="" mailman="" listinfo="" bioconductor=""> >> Search the archives: http://news.gmane.org/gmane.** >> science.biology.informatics.**conductor<http: news.gmane.org="" gmane="" .science.biology.informatics.conductor=""> >> > > -- > James W. MacDonald, M.S. > Biostatistician > University of Washington > Environmental and Occupational Health Sciences > 4225 Roosevelt Way NE, # 100 > Seattle WA 98105-6099 > [[alternative HTML version deleted]]
ADD REPLY
0
Entering edit mode
On Fri, Oct 18, 2013 at 12:37 PM, Adaikalavan Ramasamy <adaikalavan.ramasamy at="" gmail.com=""> wrote: > Dear James, > > Thank you. This works brilliantly. However it now returns ENSG ids > (previously entrez IDs). > > When I try to map these into gene symbols using og.Hs.egENSEMBL, about 20% > of ENSG ids are not found. Should I use another database/package to do this > conversion? Hi, Adai. You might consider using the biomaRt package. ; ) Sean
ADD REPLY
0
Entering edit mode
Dear Sean, Apologies for the late reply. I tried biomaRt and again about 17% of the ENSG ids are missing. In the end, I uploaded the list to http://www.ensembl.org/biomart/ and got all the IDs mapped. It appears that processed clone genes (e.g. AL161908.1, RP11-492E3.2) were not included in the R package biomaRt or og.Hs.egENSEMBL. Not ideal for future automation but OK for now. Thank you. Regards, Adai Regards, Adai On Fri, Oct 18, 2013 at 5:39 PM, Sean Davis <sdavis2@mail.nih.gov> wrote: > On Fri, Oct 18, 2013 at 12:37 PM, Adaikalavan Ramasamy > <adaikalavan.ramasamy@gmail.com> wrote: > > Dear James, > > > > Thank you. This works brilliantly. However it now returns ENSG ids > > (previously entrez IDs). > > > > When I try to map these into gene symbols using og.Hs.egENSEMBL, about > 20% > > of ENSG ids are not found. Should I use another database/package to do > this > > conversion? > > Hi, Adai. > > You might consider using the biomaRt package. ; ) > > Sean > [[alternative HTML version deleted]]
ADD REPLY
0
Entering edit mode
On Sun, Oct 27, 2013 at 6:51 PM, Adaikalavan Ramasamy <adaikalavan.ramasamy at="" gmail.com=""> wrote: > Dear Sean, > > Apologies for the late reply. I tried biomaRt and again about 17% of the > ENSG ids are missing. > > In the end, I uploaded the list to http://www.ensembl.org/biomart/ and got > all the IDs mapped. It appears that processed clone genes (e.g. AL161908.1, > RP11-492E3.2) were not included in the R package biomaRt or og.Hs.egENSEMBL. > Not ideal for future automation but OK for now. Hi, Adai. The R biomaRt package actually queries the ensembl biomart, so they two will give the same results if given the same query against the same dataset. Versions need to match, so that might be something to check. Sean > Thank you. > > Regards, Adai > > > > > > Regards, Adai > > > On Fri, Oct 18, 2013 at 5:39 PM, Sean Davis <sdavis2 at="" mail.nih.gov=""> wrote: > >> On Fri, Oct 18, 2013 at 12:37 PM, Adaikalavan Ramasamy >> <adaikalavan.ramasamy at="" gmail.com=""> wrote: >> > Dear James, >> > >> > Thank you. This works brilliantly. However it now returns ENSG ids >> > (previously entrez IDs). >> > >> > When I try to map these into gene symbols using og.Hs.egENSEMBL, about >> 20% >> > of ENSG ids are not found. Should I use another database/package to do >> this >> > conversion? >> >> Hi, Adai. >> >> You might consider using the biomaRt package. ; ) >> >> Sean >> > > [[alternative HTML version deleted]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
ADD REPLY

Login before adding your answer.

Traffic: 951 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6