VariantAnnotation: readVcf on nonmodel organism
1
0
Entering edit mode
Yannick Wurm ▴ 40
@yannick-wurm-5519
Last seen 2.7 years ago
United Kingdom
Dear list, I work with a non-model organism (an ant) for which we sequenced the genome. The genome is on NCBI but isn't annotated in the standard R-accessible databases. I'm now trying to load a VCF file into R (with the aim of subsequent plotting with ggbio). > library(VariantAnnotation) > readVcf("mymappings.vcf.gz") This fails because readVcf wants a "genome" argument. I haven't been able to find an example using a non-established genome. How can I generate such a genome for my species? The genome is in the form of 10,000 scaffolds. I'm looking for a quick hack, not a complex submission to another database. FWIW This is on R 2.15.0 with latest bioconductor. Thanks for your help & kind regards, Yannick -------------------------------------- Yannick Wurm http://yannick.poulet.org Ants, Genomes & Evolution ? y.wurm at qmul.ac.uk ? skype:yannickwurm ? +44 207 882 3049 5.03A Fogg ? School of Biological & Chemical Sciences ? Queen Mary, University of London ? Mile End Road ? E1 4NS London ? UK Easy custom BLAST interface: http://www.sequenceserver.com
Organism genomes Organism genomes • 1.7k views
ADD COMMENT
0
Entering edit mode
@valerie-obenchain-4275
Last seen 2.9 years ago
United States
Hi Yannick, The 'genome' argument to readVcf() is a character string specifying the build of the genome the variants are mapped to. It's present use is for your own bookkeeping so there are no "invalid" strings. In this case using the build that is on NCBI would be logical or you could use your own tag, readVcf("mymappings.vcf.gz", "draftgenome") Valerie On 09/26/2012 10:56 AM, Yannick Wurm wrote: > Dear list, > > I work with a non-model organism (an ant) for which we sequenced the genome. The genome is on NCBI but isn't annotated in the standard R-accessible databases. > > I'm now trying to load a VCF file into R (with the aim of subsequent plotting with ggbio). > >> library(VariantAnnotation) >> readVcf("mymappings.vcf.gz") > This fails because readVcf wants a "genome" argument. I haven't been able to find an example using a non-established genome. How can I generate such a genome for my species? The genome is in the form of 10,000 scaffolds. I'm looking for a quick hack, not a complex submission to another database. > > FWIW This is on R 2.15.0 with latest bioconductor. > > Thanks for your help& kind regards, > > Yannick > > > > > > -------------------------------------- > Yannick Wurm http://yannick.poulet.org > Ants, Genomes& Evolution ? y.wurm at qmul.ac.uk ? skype:yannickwurm ? +44 207 882 3049 > 5.03A Fogg ? School of Biological& Chemical Sciences ? Queen Mary, University of London ? Mile End Road ? E1 4NS London ? UK > Easy custom BLAST interface: http://www.sequenceserver.com > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
ADD COMMENT
0
Entering edit mode
Ha that's hilarious! I wouldn't have thunk it could be so easy! thanks Sean & Valerie!! yannick On 26 Sep 2012, at 19:49, Valerie Obenchain wrote: > Hi Yannick, > > The 'genome' argument to readVcf() is a character string specifying the build of the genome the variants are mapped to. It's present use is for your own bookkeeping so there are no "invalid" strings. In this case using the build that is on NCBI would be logical or you could use your own tag, > > readVcf("mymappings.vcf.gz", "draftgenome") > > Valerie > > > On 09/26/2012 10:56 AM, Yannick Wurm wrote: >> Dear list, >> >> I work with a non-model organism (an ant) for which we sequenced the genome. The genome is on NCBI but isn't annotated in the standard R-accessible databases. >> >> I'm now trying to load a VCF file into R (with the aim of subsequent plotting with ggbio). >> >>> library(VariantAnnotation) >>> readVcf("mymappings.vcf.gz") >> This fails because readVcf wants a "genome" argument. I haven't been able to find an example using a non-established genome. How can I generate such a genome for my species? The genome is in the form of 10,000 scaffolds. I'm looking for a quick hack, not a complex submission to another database. >> >> FWIW This is on R 2.15.0 with latest bioconductor. >> >> Thanks for your help& kind regards, >> >> Yannick >> >> >> >> >> >> -------------------------------------- >> Yannick Wurm http://yannick.poulet.org >> Ants, Genomes& Evolution ? y.wurm at qmul.ac.uk ? skype:yannickwurm ? +44 207 882 3049 >> 5.03A Fogg ? School of Biological& Chemical Sciences ? Queen Mary, University of London ? Mile End Road ? E1 4NS London ? UK >> Easy custom BLAST interface: http://www.sequenceserver.com >> _______________________________________________ >> Bioconductor mailing list >> Bioconductor at r-project.org >> https://stat.ethz.ch/mailman/listinfo/bioconductor >> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor >
ADD REPLY

Login before adding your answer.

Traffic: 765 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6