RNA seq analysis of Arabidopsis genome
1
0
Entering edit mode
Divya, D. ▴ 10
@divya-d-4847
Last seen 9.6 years ago
Hi, I want to do the GO analysis for Arabidopsis using GOseq package. From the PDF I understood that the package relies on the UCSC genome browser to extract information regarding gene length, GO category. But UCSC genome browser has information about animal kingdom and I am unable to add Arabidopsis genome information which is available from TAIR. I have used "org.At.tair.db<http: www.bioconductor.org="" packages="" relea="" se="" data="" annotation="" html="" org.at.tair.db.html="">" but this doesn't add information regarding gene length. I also looked at the "genomic Feature" to add gene length but this also relies on UCSC browser. Kindly guide me to analyse RNA seq data for Arabidopsis genome. Waiting in anticipation, Regards, Divya Vashisht [[alternative HTML version deleted]]
GO Category goseq GO Category goseq • 2.2k views
ADD COMMENT
0
Entering edit mode
@matthew-young-4865
Last seen 9.6 years ago
Hi Divya, The organism packages do not keep any information on gene length, so goseq obtains it from the UCSC. As you have noted, this is not available for Arabidopsis. Unfortunately, this means you will have to obtain this information from elsewhere and format it for use with goseq. Information on formatting length data is available in the help for the nullp function and in the vignette, but in short you need to give the bias.data argument a vector the same length as your DEgenes vector which contains the length of each gene. Cheers, Matt On Mon, Sep 12, 2011 at 10:54 AM, Divya, D. <d.divya@uu.nl> wrote: > Hi, > > I want to do the GO analysis for Arabidopsis using GOseq package. From the > PDF I understood that the package relies on the UCSC genome browser to > extract information regarding gene length, GO category. But UCSC genome > browser has information about animal kingdom and I am unable to add > Arabidopsis genome information which is available from TAIR. > > I have used "org.At.tair.db< > http://www.bioconductor.org/packages/release/data/annotation/html/or g.At.tair.db.html>" > but this doesn't add information regarding gene length. I also looked at the > "genomic Feature" to add gene length but this also relies on UCSC browser. > > Kindly guide me to analyse RNA seq data for Arabidopsis genome. > > Waiting in anticipation, > Regards, > Divya Vashisht > > [[alternative HTML version deleted]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor@r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor > [[alternative HTML version deleted]]
ADD COMMENT
0
Entering edit mode
Dear Matt would it be a feasible alternative for Divya to use expression level (estimated e.g. by each gene's average count in the data, or some function thereof) for sampling bias adjustment in bias.data, rather than length? Best wishes Wolfgang Sep/19/11 2:34 PM, Matthew Young scripsit:: > Hi Divya, > > The organism packages do not keep any information on gene length, so goseq > obtains it from the UCSC. As you have noted, this is not available for > Arabidopsis. Unfortunately, this means you will have to obtain this > information from elsewhere and format it for use with goseq. Information on > formatting length data is available in the help for the nullp function and > in the vignette, but in short you need to give the bias.data argument a > vector the same length as your DEgenes vector which contains the length of > each gene. > > Cheers, > > Matt > > On Mon, Sep 12, 2011 at 10:54 AM, Divya, D.<d.divya at="" uu.nl=""> wrote: > >> Hi, >> >> I want to do the GO analysis for Arabidopsis using GOseq package. From the >> PDF I understood that the package relies on the UCSC genome browser to >> extract information regarding gene length, GO category. But UCSC genome >> browser has information about animal kingdom and I am unable to add >> Arabidopsis genome information which is available from TAIR. >> >> I have used "org.At.tair.db< >> http://www.bioconductor.org/packages/release/data/annotation/html/o rg.At.tair.db.html>" >> but this doesn't add information regarding gene length. I also looked at the >> "genomic Feature" to add gene length but this also relies on UCSC browser. >> >> Kindly guide me to analyse RNA seq data for Arabidopsis genome. >> >> Waiting in anticipation, >> Regards, >> Divya Vashisht >> >> [[alternative HTML version deleted]] >> >> _______________________________________________ >> Bioconductor mailing list >> Bioconductor at r-project.org >> https://stat.ethz.ch/mailman/listinfo/bioconductor >> Search the archives: >> http://news.gmane.org/gmane.science.biology.informatics.conductor >> > > [[alternative HTML version deleted]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor -- Wolfgang Huber EMBL http://www.embl.de/research/units/genome_biology/huber
ADD REPLY

Login before adding your answer.

Traffic: 845 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6