Search
Question: R packages for position anotation?
0
7.5 years ago by
Jinyan Huang190
Jinyan Huang190 wrote:
Dear all, Is there a Bioconductor packages can do like something like this? Giving a position, it can tell where this position locate in the gene (3' UTR, 5' UTR Exon, intron and so on). Thanks.
modified 7.5 years ago by Steve Lianoglou12k • written 7.5 years ago by Jinyan Huang190
0
7.5 years ago by
Denali
Steve Lianoglou12k wrote:
Hi, On Mon, Dec 13, 2010 at 10:59 AM, Jinyan Huang <jhuang.ceph at="" gmail.com=""> wrote: > Dear all, > Is there a Bioconductor packages can do like something like this? > Giving a position, it can tell where this position locate in the gene > (3' UTR, 5' UTR Exon, intron and so on). With a bit of coding, you can use the GenomicFeatures package to get this info. Build your TranscriptDb, then look at the help pages in ?transcriptsBy If you store your query positions in a GRanges object, you can use the results returned from the functions above with findOverlaps/countOverlaps to find your hits. -steve -- Steve Lianoglou Graduate Student: Computational Systems Biology ?| Memorial Sloan-Kettering Cancer Center ?| Weill Medical College of Cornell University Contact Info: http://cbio.mskcc.org/~lianos/contact
Hi Steve, Thanks for your suggestion. What I understand your suggestion is I shoud use exonsBy, intronsByTranscript, fiveUTRsByTranscrip, threeUTRsByTranscript separately. For example, if I want to check whether a position is in exon or not. I do as fellow. But this is just suitable for one position each time. Is it possible to check many positions at the same time? Thanks. >txdb <- loadFeatures(samplefile) >annotGr <- exonsBy(txdb, "tx") >gr <- GRanges( seqnames = c("chr11"), ranges = IRanges(start = c(74994500), end = c(74994500)), strand = c("-")) >OL <- findOverlaps(query = annotGr, subject = gr) >tdata <- annotGr[unique(queryHits(OL)),] >tdata GRangesList of length 2 \$44081 GRanges with 16 ranges and 3 elementMetadata values On Mon, Dec 13, 2010 at 5:23 PM, Steve Lianoglou <mailinglist.honeypot at="" gmail.com=""> wrote: > Hi, > > On Mon, Dec 13, 2010 at 10:59 AM, Jinyan Huang <jhuang.ceph at="" gmail.com=""> wrote: >> Dear all, >> Is there a Bioconductor packages can do like something like this? >> Giving a position, it can tell where this position locate in the gene >> (3' UTR, 5' UTR Exon, intron and so on). > > With a bit of coding, you can use the GenomicFeatures package to get this info. > > Build your TranscriptDb, then look at the help pages in ?transcriptsBy > > If you store your query positions in a GRanges object, you can use the > results returned from the functions above with > findOverlaps/countOverlaps to find your hits. > > -steve > > -- > Steve Lianoglou > Graduate Student: Computational Systems Biology > ?| Memorial Sloan-Kettering Cancer Center > ?| Weill Medical College of Cornell University > Contact Info: http://cbio.mskcc.org/~lianos/contact >
Hi, On Tue, Dec 14, 2010 at 1:37 PM, Jinyan Huang <jhuang.ceph at="" gmail.com=""> wrote: > Hi Steve, > > Thanks for your suggestion. > > What I understand your suggestion is I shoud use exonsBy, > intronsByTranscript, fiveUTRsByTranscrip, threeUTRsByTranscript > separately. For example, if I want to check whether a position is in > exon or not. I do as fellow. But this is just suitable for one > position each time. Is it possible to check many positions at the same > time? Thanks. I'm not sure which direction of "many" you are looking for. If you want to test if multiple positions are in any exon, you can make your gr GRanges object bigger (include those positions), and tease apart your findOverlaps results by which subjectHits are overlapping with which queryHits ... Is that what you're asking? -- Steve Lianoglou Graduate Student: Computational Systems Biology ?| Memorial Sloan-Kettering Cancer Center ?| Weill Medical College of Cornell University Contact Info: http://cbio.mskcc.org/~lianos/contact
0
7.5 years ago by
Denali
Steve Lianoglou12k wrote:
Hi Jinyang, Don't forget to hit "reply all" when replying to bioconductor emails so that all correspondence is on list and, therefore, helpful to other people who might land in the same boat as you. On Tue, Dec 14, 2010 at 4:45 PM, Jinyan Huang <jhuang.ceph at="" gmail.com=""> wrote: > Hi Steve, > > Thanks. Now I know how to use subjectHits to check them. > > I want to check whether are they located in 5'UTR or 3'UTR or exon or > intron or non-gene region. What I think I shoud check these positions > using exonsBy, intronsByTranscript, fiveUTRsByTranscrip, > threeUTRsByTranscript separately. If a position is not in the result > of above, this position should be a non-gene region. Is it right? Well, assuming your annotations are "perfect," what else could it be? -steve -- Steve Lianoglou Graduate Student: Computational Systems Biology ?| Memorial Sloan-Kettering Cancer Center ?| Weill Medical College of Cornell University Contact Info: http://cbio.mskcc.org/~lianos/contact