Search
Question: rtracklayer issue (getTable)
0
gravatar for Mark Robinson
8.2 years ago by
Mark Robinson1.1k
Mark Robinson1.1k wrote:
Hello BioC list. I've encountered a small problem when trying to download some tracks from UCSC with rtracklayer. Specifically, I'm trying to get the "mrna" table. The example below works great for the "refGene" table, but gives an error for the "mrna" table. Any suggestions? Thanks, Mark > library(rtracklayer) Loading required package: RCurl Loading required package: bitops > session <- browserSession("UCSC") > genome(session) <- "hg18" > options(stringsAsFactors=FALSE) > q1 <- ucscTableQuery(session, "refGene",GenomicRanges(197848272, 198064461, "chr3")) > tab1 <- getTable(q1) > head(tab1[,1:5]) bin name chrom strand txStart 1 2094 NM_198565 chr3 + 197851052 2 261 NM_032898 chr3 - 197917544 3 2095 NM_017861 chr3 + 197923642 4 2095 NM_002577 chr3 + 197951124 > q2 <- ucscTableQuery(session, "mrna",GenomicRanges(197848272, 198064461, "chr3")) > tab2 <- getTable(q2) Error in scan(file, what, nmax, sep, dec, quote, skip, nlines, na.strings, : line 1 did not have 2 elements > traceback() 5: scan(file = file, what = what, sep = sep, quote = quote, dec = dec, nmax = nrows, skip = 0, na.strings = na.strings, quiet = TRUE, fill = fill, strip.white = strip.white, blank.lines.skip = blank.lines.skip, multi.line = FALSE, comment.char = comment.char, allowEscapes = allowEscapes, flush = flush, encoding = encoding) 4: read.table(f, sep = "\t", header = TRUE) 3: .local(object, ...) 2: getTable(q2) 1: getTable(q2) > sessionInfo() R version 2.9.1 (2009-06-26) i386-apple-darwin8.11.1 locale: en_CA.UTF-8/en_CA.UTF-8/C/C/en_CA.UTF-8/en_CA.UTF-8 attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] rtracklayer_1.4.1 RCurl_0.98-1 bitops_1.0-4.1 loaded via a namespace (and not attached): [1] Biobase_2.4.1 Biostrings_2.12.8 BSgenome_1.12.3 IRanges_1.2.3 [5] XML_2.6-0 --------- library(rtracklayer) session <- browserSession("UCSC") genome(session) <- "hg18" options(stringsAsFactors=FALSE) qry1 <- ucscTableQuery(session, "refGene",GenomicRanges(197848272, 198064461, "chr3")) tab1 <- getTable(qry1) head(tab1[,1:5]) qry2 <- ucscTableQuery(session, "mrna",GenomicRanges(197848272, 198064461, "chr3")) tab2 <- getTable(qry2) traceback() sessionInfo() --------- ------------------------------ Mark Robinson, PhD (Melb) Epigenetics Laboratory, Garvan Bioinformatics Division, WEHI e: m.robinson at garvan.org.au e: mrobinson at wehi.edu.au p: +61 (0)3 9345 2628 f: +61 (0)3 9347 0852
ADD COMMENTlink modified 8.2 years ago by Michael Lawrence620 • written 8.2 years ago by Mark Robinson1.1k
0
gravatar for Michael Lawrence
8.2 years ago by
Michael Lawrence620 wrote:
On Thu, Oct 1, 2009 at 11:47 PM, Mark Robinson <mrobinson@wehi.edu.au>wrote: > Hello BioC list. > > I've encountered a small problem when trying to download some tracks from > UCSC with rtracklayer. Specifically, I'm trying to get the "mrna" table. > The example below works great for the "refGene" table, but gives an error > for the "mrna" table. > > Any suggestions? > > Looked into this and found that the table browser has a slightly different set of tracks from the browser itself. I added a trackNames method (version 1.5.19) on the UCSCTableQuery, so that it's possible to know which tracks are available. >From that, you'll see that 'mrna' needs to be 'all_mrna'. Michael > Thanks, > Mark > > > library(rtracklayer) > Loading required package: RCurl > Loading required package: bitops > > session <- browserSession("UCSC") > > genome(session) <- "hg18" > > options(stringsAsFactors=FALSE) > > q1 <- ucscTableQuery(session, "refGene",GenomicRanges(197848272, > 198064461, "chr3")) > > tab1 <- getTable(q1) > > head(tab1[,1:5]) > bin name chrom strand txStart > 1 2094 NM_198565 chr3 + 197851052 > 2 261 NM_032898 chr3 - 197917544 > 3 2095 NM_017861 chr3 + 197923642 > 4 2095 NM_002577 chr3 + 197951124 > > q2 <- ucscTableQuery(session, "mrna",GenomicRanges(197848272, 198064461, > "chr3")) > > tab2 <- getTable(q2) > Error in scan(file, what, nmax, sep, dec, quote, skip, nlines, na.strings, > : > line 1 did not have 2 elements > > traceback() > 5: scan(file = file, what = what, sep = sep, quote = quote, dec = dec, > nmax = nrows, skip = 0, na.strings = na.strings, quiet = TRUE, > fill = fill, strip.white = strip.white, blank.lines.skip = > blank.lines.skip, > multi.line = FALSE, comment.char = comment.char, allowEscapes = > allowEscapes, > flush = flush, encoding = encoding) > 4: read.table(f, sep = "\t", header = TRUE) > 3: .local(object, ...) > 2: getTable(q2) > 1: getTable(q2) > > sessionInfo() > R version 2.9.1 (2009-06-26) > i386-apple-darwin8.11.1 > > locale: > en_CA.UTF-8/en_CA.UTF-8/C/C/en_CA.UTF-8/en_CA.UTF-8 > > attached base packages: > [1] stats graphics grDevices utils datasets methods base > > other attached packages: > [1] rtracklayer_1.4.1 RCurl_0.98-1 bitops_1.0-4.1 > > loaded via a namespace (and not attached): > [1] Biobase_2.4.1 Biostrings_2.12.8 BSgenome_1.12.3 IRanges_1.2.3 > [5] XML_2.6-0 > > > > > > --------- > library(rtracklayer) > session <- browserSession("UCSC") > genome(session) <- "hg18" > options(stringsAsFactors=FALSE) > qry1 <- ucscTableQuery(session, "refGene",GenomicRanges(197848272, > 198064461, "chr3")) > tab1 <- getTable(qry1) > head(tab1[,1:5]) > qry2 <- ucscTableQuery(session, "mrna",GenomicRanges(197848272, 198064461, > "chr3")) > tab2 <- getTable(qry2) > traceback() > sessionInfo() > --------- > > > ------------------------------ > Mark Robinson, PhD (Melb) > Epigenetics Laboratory, Garvan > Bioinformatics Division, WEHI > e: m.robinson@garvan.org.au > e: mrobinson@wehi.edu.au > p: +61 (0)3 9345 2628 > f: +61 (0)3 9347 0852 > > _______________________________________________ > Bioconductor mailing list > Bioconductor@stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor > [[alternative HTML version deleted]]
ADD COMMENTlink written 8.2 years ago by Michael Lawrence620
Hi Michael. It looks like I'm still getting some unusual behaviour with 1.5.19 (2.10) ... and am unsuccessful with the mrna table on 1.4.1 (2.9). I'm sending the output for both versions below. It seems 1.5.19 is only gathering results for a small region on chrX and 1.4.1 doesn't accept the "all_mrna" table. I'm using fairly old versions of R 2.9 and 2.10, but I'd guess that that is not the issue. Any suggestions? Thanks, Mark [1.4.1 -- 2.9] --------------------------------- > library(rtracklayer) Loading required package: RCurl Loading required package: bitops > session <- browserSession("UCSC") > genome(session) <- "hg18" > q1 <- ucscTableQuery(session, "refGene") > q1 Get track 'refGene' within hg18:*:*-* > refGene <- getTable(q1) > q2 <- ucscTableQuery(session, "all_mrna") Error in normArgTrack(x, track) : Unknown track: all_mrna > sessionInfo() R version 2.9.1 (2009-06-26) i386-apple-darwin8.11.1 locale: en_CA.UTF-8/en_CA.UTF-8/C/C/en_CA.UTF-8/en_CA.UTF-8 attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] rtracklayer_1.4.1 RCurl_0.98-1 bitops_1.0-4.1 loaded via a namespace (and not attached): [1] Biobase_2.4.1 Biostrings_2.12.9 BSgenome_1.12.3 IRanges_1.2.3 [5] XML_2.6-0 --------------------------------- [1.5.19 -- 2.10] --------------------------------- > library(rtracklayer) Loading required package: RCurl Loading required package: bitops > session <- browserSession("UCSC") > genome(session) <- "hg18" > q1 <- ucscTableQuery(session, "refGene") > q1 Get track 'refGene' within hg18:chrX:151073054-151383976 > refGene <- getTable(q1) > q2 <- ucscTableQuery(session, "all_mrna") > q2 Get track 'all_mrna' within hg18:chrX:151073054-151383976 > mrna <- getTable(q2) > sessionInfo() R version 2.10.0 Under development (unstable) (2009-05-22 r48594) i386-apple-darwin8.11.1 locale: [1] en_CA.UTF-8/en_CA.UTF-8/C/C/en_CA.UTF-8/en_CA.UTF-8 attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] rtracklayer_1.5.19 RCurl_0.98-1 bitops_1.0-4.1 loaded via a namespace (and not attached): [1] Biobase_2.5.8 Biostrings_2.13.48 BSgenome_1.13.16 IRanges_1.3.87 [5] XML_2.6-0 --------------------------------- On 3-Oct-09, at 11:10 AM, Michael Lawrence wrote: > > > On Thu, Oct 1, 2009 at 11:47 PM, Mark Robinson > <mrobinson at="" wehi.edu.au=""> wrote: > Hello BioC list. > > I've encountered a small problem when trying to download some tracks > from UCSC with rtracklayer. Specifically, I'm trying to get the > "mrna" table. The example below works great for the "refGene" > table, but gives an error for the "mrna" table. > > Any suggestions? > > > Looked into this and found that the table browser has a slightly > different set of tracks from the browser itself. I added a > trackNames method (version 1.5.19) on the UCSCTableQuery, so that > it's possible to know which tracks are available. > > From that, you'll see that 'mrna' needs to be 'all_mrna'. > > Michael > > Thanks, > Mark > > > library(rtracklayer) > Loading required package: RCurl > Loading required package: bitops > > session <- browserSession("UCSC") > > genome(session) <- "hg18" > > options(stringsAsFactors=FALSE) > > q1 <- ucscTableQuery(session, "refGene",GenomicRanges(197848272, > 198064461, "chr3")) > > tab1 <- getTable(q1) > > head(tab1[,1:5]) > bin name chrom strand txStart > 1 2094 NM_198565 chr3 + 197851052 > 2 261 NM_032898 chr3 - 197917544 > 3 2095 NM_017861 chr3 + 197923642 > 4 2095 NM_002577 chr3 + 197951124 > > q2 <- ucscTableQuery(session, "mrna",GenomicRanges(197848272, > 198064461, "chr3")) > > tab2 <- getTable(q2) > Error in scan(file, what, nmax, sep, dec, quote, skip, nlines, > na.strings, : > line 1 did not have 2 elements > > traceback() > 5: scan(file = file, what = what, sep = sep, quote = quote, dec = dec, > nmax = nrows, skip = 0, na.strings = na.strings, quiet = TRUE, > fill = fill, strip.white = strip.white, blank.lines.skip = > blank.lines.skip, > multi.line = FALSE, comment.char = comment.char, allowEscapes > = allowEscapes, > flush = flush, encoding = encoding) > 4: read.table(f, sep = "\t", header = TRUE) > 3: .local(object, ...) > 2: getTable(q2) > 1: getTable(q2) > > sessionInfo() > R version 2.9.1 (2009-06-26) > i386-apple-darwin8.11.1 > > locale: > en_CA.UTF-8/en_CA.UTF-8/C/C/en_CA.UTF-8/en_CA.UTF-8 > > attached base packages: > [1] stats graphics grDevices utils datasets methods base > > other attached packages: > [1] rtracklayer_1.4.1 RCurl_0.98-1 bitops_1.0-4.1 > > loaded via a namespace (and not attached): > [1] Biobase_2.4.1 Biostrings_2.12.8 BSgenome_1.12.3 > IRanges_1.2.3 > [5] XML_2.6-0 > > > > > > --------- > library(rtracklayer) > session <- browserSession("UCSC") > genome(session) <- "hg18" > options(stringsAsFactors=FALSE) > qry1 <- ucscTableQuery(session, "refGene",GenomicRanges(197848272, > 198064461, "chr3")) > tab1 <- getTable(qry1) > head(tab1[,1:5]) > qry2 <- ucscTableQuery(session, "mrna",GenomicRanges(197848272, > 198064461, "chr3")) > tab2 <- getTable(qry2) > traceback() > sessionInfo() > --------- > > > ------------------------------ > Mark Robinson, PhD (Melb) > Epigenetics Laboratory, Garvan > Bioinformatics Division, WEHI > e: m.robinson at garvan.org.au > e: mrobinson at wehi.edu.au > p: +61 (0)3 9345 2628 > f: +61 (0)3 9347 0852 > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor > ------------------------------ Mark Robinson, PhD (Melb) Epigenetics Laboratory, Garvan Bioinformatics Division, WEHI e: m.robinson at garvan.org.au e: mrobinson at wehi.edu.au p: +61 (0)3 9345 2628 f: +61 (0)3 9347 0852
ADD REPLYlink written 8.2 years ago by Mark Robinson1.1k
Hi Mark, I was not planning on porting the fix to the released version, as it's not that critical. A new Bioc release is coming soon anyway. As for the devel version, I'm guessing this is more to your liking: > session <- browserSession("UCSC") > q2 <- ucscTableQuery(session, "all_mrna") > q2 Get track 'all_mrna' within hg18:*:*-* That is, you are probably not interested in the (default) chrX but the entire genome. I keep going back and forth on the logic for deriving the ranges. I made the change based on the reasoning that one will be working through a session, with one or more views, and end up at a region of interest. As the query is constructed from the session, the user would expect the default query range to be that of the session. In fact, this is the behavior of the UCSC table browser. To get what you need, you'll want to specify the range when constructing the query, e.g: > q2 <- ucscTableQuery(session, "all_mrna", GenomicRanges(genome = "hg18")) To get the entire genome's worth of data. I just tweaked the logic to make this work in version 1.5.20. Btw, you don't need the genome<- call at the beginning, just do it at query construction time. Michael On Sun, Oct 4, 2009 at 3:41 PM, Mark Robinson <mrobinson@wehi.edu.au> wrote: > Hi Michael. > > It looks like I'm still getting some unusual behaviour with 1.5.19 (2.10) > ... and am unsuccessful with the mrna table on 1.4.1 (2.9). > > I'm sending the output for both versions below. It seems 1.5.19 is only > gathering results for a small region on chrX and 1.4.1 doesn't accept the > "all_mrna" table. I'm using fairly old versions of R 2.9 and 2.10, but I'd > guess that that is not the issue. > > Any suggestions? > > Thanks, > Mark > > > [1.4.1 -- 2.9] > --------------------------------- > > library(rtracklayer) > Loading required package: RCurl > Loading required package: bitops > > session <- browserSession("UCSC") > > genome(session) <- "hg18" > > q1 <- ucscTableQuery(session, "refGene") > > q1 > Get track 'refGene' within hg18:*:*-* > > refGene <- getTable(q1) > > q2 <- ucscTableQuery(session, "all_mrna") > Error in normArgTrack(x, track) : Unknown track: all_mrna > > sessionInfo() > R version 2.9.1 (2009-06-26) > i386-apple-darwin8.11.1 > > locale: > en_CA.UTF-8/en_CA.UTF-8/C/C/en_CA.UTF-8/en_CA.UTF-8 > > attached base packages: > [1] stats graphics grDevices utils datasets methods base > > other attached packages: > [1] rtracklayer_1.4.1 RCurl_0.98-1 bitops_1.0-4.1 > > loaded via a namespace (and not attached): > [1] Biobase_2.4.1 Biostrings_2.12.9 BSgenome_1.12.3 IRanges_1.2.3 > [5] XML_2.6-0 > --------------------------------- > > > > [1.5.19 -- 2.10] > --------------------------------- > > library(rtracklayer) > Loading required package: RCurl > Loading required package: bitops > > session <- browserSession("UCSC") > > genome(session) <- "hg18" > > q1 <- ucscTableQuery(session, "refGene") > > q1 > Get track 'refGene' within hg18:chrX:151073054-151383976 > > refGene <- getTable(q1) > > q2 <- ucscTableQuery(session, "all_mrna") > > q2 > Get track 'all_mrna' within hg18:chrX:151073054-151383976 > > mrna <- getTable(q2) > > sessionInfo() > R version 2.10.0 Under development (unstable) (2009-05-22 r48594) > i386-apple-darwin8.11.1 > > locale: > [1] en_CA.UTF-8/en_CA.UTF-8/C/C/en_CA.UTF-8/en_CA.UTF-8 > > attached base packages: > [1] stats graphics grDevices utils datasets methods base > > other attached packages: > [1] rtracklayer_1.5.19 RCurl_0.98-1 bitops_1.0-4.1 > > loaded via a namespace (and not attached): > [1] Biobase_2.5.8 Biostrings_2.13.48 BSgenome_1.13.16 IRanges_1.3.87 > [5] XML_2.6-0 > --------------------------------- > > On 3-Oct-09, at 11:10 AM, Michael Lawrence wrote: > > >> >> On Thu, Oct 1, 2009 at 11:47 PM, Mark Robinson <mrobinson@wehi.edu.au> >> wrote: >> Hello BioC list. >> >> I've encountered a small problem when trying to download some tracks from >> UCSC with rtracklayer. Specifically, I'm trying to get the "mrna" table. >> The example below works great for the "refGene" table, but gives an error >> for the "mrna" table. >> >> Any suggestions? >> >> >> Looked into this and found that the table browser has a slightly different >> set of tracks from the browser itself. I added a trackNames method (version >> 1.5.19) on the UCSCTableQuery, so that it's possible to know which tracks >> are available. >> >> From that, you'll see that 'mrna' needs to be 'all_mrna'. >> >> Michael >> >> Thanks, >> Mark >> >> > library(rtracklayer) >> Loading required package: RCurl >> Loading required package: bitops >> > session <- browserSession("UCSC") >> > genome(session) <- "hg18" >> > options(stringsAsFactors=FALSE) >> > q1 <- ucscTableQuery(session, "refGene",GenomicRanges(197848272, >> 198064461, "chr3")) >> > tab1 <- getTable(q1) >> > head(tab1[,1:5]) >> bin name chrom strand txStart >> 1 2094 NM_198565 chr3 + 197851052 >> 2 261 NM_032898 chr3 - 197917544 >> 3 2095 NM_017861 chr3 + 197923642 >> 4 2095 NM_002577 chr3 + 197951124 >> > q2 <- ucscTableQuery(session, "mrna",GenomicRanges(197848272, 198064461, >> "chr3")) >> > tab2 <- getTable(q2) >> Error in scan(file, what, nmax, sep, dec, quote, skip, nlines, na.strings, >> : >> line 1 did not have 2 elements >> > traceback() >> 5: scan(file = file, what = what, sep = sep, quote = quote, dec = dec, >> nmax = nrows, skip = 0, na.strings = na.strings, quiet = TRUE, >> fill = fill, strip.white = strip.white, blank.lines.skip = >> blank.lines.skip, >> multi.line = FALSE, comment.char = comment.char, allowEscapes = >> allowEscapes, >> flush = flush, encoding = encoding) >> 4: read.table(f, sep = "\t", header = TRUE) >> 3: .local(object, ...) >> 2: getTable(q2) >> 1: getTable(q2) >> > sessionInfo() >> R version 2.9.1 (2009-06-26) >> i386-apple-darwin8.11.1 >> >> locale: >> en_CA.UTF-8/en_CA.UTF-8/C/C/en_CA.UTF-8/en_CA.UTF-8 >> >> attached base packages: >> [1] stats graphics grDevices utils datasets methods base >> >> other attached packages: >> [1] rtracklayer_1.4.1 RCurl_0.98-1 bitops_1.0-4.1 >> >> loaded via a namespace (and not attached): >> [1] Biobase_2.4.1 Biostrings_2.12.8 BSgenome_1.12.3 IRanges_1.2.3 >> [5] XML_2.6-0 >> >> >> >> >> >> --------- >> library(rtracklayer) >> session <- browserSession("UCSC") >> genome(session) <- "hg18" >> options(stringsAsFactors=FALSE) >> qry1 <- ucscTableQuery(session, "refGene",GenomicRanges(197848272, >> 198064461, "chr3")) >> tab1 <- getTable(qry1) >> head(tab1[,1:5]) >> qry2 <- ucscTableQuery(session, "mrna",GenomicRanges(197848272, 198064461, >> "chr3")) >> tab2 <- getTable(qry2) >> traceback() >> sessionInfo() >> --------- >> >> >> ------------------------------ >> Mark Robinson, PhD (Melb) >> Epigenetics Laboratory, Garvan >> Bioinformatics Division, WEHI >> e: m.robinson@garvan.org.au >> e: mrobinson@wehi.edu.au >> p: +61 (0)3 9345 2628 >> f: +61 (0)3 9347 0852 >> >> _______________________________________________ >> Bioconductor mailing list >> Bioconductor@stat.math.ethz.ch >> https://stat.ethz.ch/mailman/listinfo/bioconductor >> Search the archives: >> http://news.gmane.org/gmane.science.biology.informatics.conductor >> >> > ------------------------------ > Mark Robinson, PhD (Melb) > Epigenetics Laboratory, Garvan > Bioinformatics Division, WEHI > e: m.robinson@garvan.org.au > e: mrobinson@wehi.edu.au > p: +61 (0)3 9345 2628 > f: +61 (0)3 9347 0852 > ------------------------------ > > > > > > [[alternative HTML version deleted]]
ADD REPLYlink written 8.2 years ago by Michael Lawrence620
0
gravatar for Michael Lawrence
8.2 years ago by
Michael Lawrence620 wrote:
On Mon, Oct 5, 2009 at 9:55 AM, Marc Carlson <mcarlson@fhcrc.org> wrote: > Whoa. > > Do you still plan to have the default behavoir be just chromosome X? > Because that would be quite surprising to me... > > Thinking about this some more, this would be surprising to those just wanting to use rtracklayer to download tracks. It seems like this is becoming a more popular use case than controlling the browser from R. Thus, I made ucscTableQuery() only inherit the genome from the session, so that: > session <- browserSession() > q2 <- ucscTableQuery(session, "all_mrna") Get track 'all_mrna' within hg18:*:*-* That's version 1.5.21. Michael Marc > > > > Michael Lawrence wrote: > > Hi Mark, > > > > I was not planning on porting the fix to the released version, as it's > not > > that critical. A new Bioc release is coming soon anyway. > > > > As for the devel version, I'm guessing this is more to your liking: > > > > > >> session <- browserSession("UCSC") > >> q2 <- ucscTableQuery(session, "all_mrna") > >> q2 > >> > > Get track 'all_mrna' within hg18:*:*-* > > > > That is, you are probably not interested in the (default) chrX but the > > entire genome. I keep going back and forth on the logic for deriving the > > ranges. I made the change based on the reasoning that one will be working > > through a session, with one or more views, and end up at a region of > > interest. As the query is constructed from the session, the user would > > expect the default query range to be that of the session. In fact, this > is > > the behavior of the UCSC table browser. > > > > To get what you need, you'll want to specify the range when constructing > the > > query, e.g: > > > > > >> q2 <- ucscTableQuery(session, "all_mrna", GenomicRanges(genome = > "hg18")) > >> > > > > To get the entire genome's worth of data. I just tweaked the logic to > make > > this work in version 1.5.20. Btw, you don't need the genome<- call at the > > beginning, just do it at query construction time. > > > > Michael > > > > On Sun, Oct 4, 2009 at 3:41 PM, Mark Robinson <mrobinson@wehi.edu.au> > wrote: > > > > > >> Hi Michael. > >> > >> It looks like I'm still getting some unusual behaviour with 1.5.19 > (2.10) > >> ... and am unsuccessful with the mrna table on 1.4.1 (2.9). > >> > >> I'm sending the output for both versions below. It seems 1.5.19 is only > >> gathering results for a small region on chrX and 1.4.1 doesn't accept > the > >> "all_mrna" table. I'm using fairly old versions of R 2.9 and 2.10, but > I'd > >> guess that that is not the issue. > >> > >> Any suggestions? > >> > >> Thanks, > >> Mark > >> > >> > >> [1.4.1 -- 2.9] > >> --------------------------------- > >> > >>> library(rtracklayer) > >>> > >> Loading required package: RCurl > >> Loading required package: bitops > >> > >>> session <- browserSession("UCSC") > >>> genome(session) <- "hg18" > >>> q1 <- ucscTableQuery(session, "refGene") > >>> q1 > >>> > >> Get track 'refGene' within hg18:*:*-* > >> > >>> refGene <- getTable(q1) > >>> q2 <- ucscTableQuery(session, "all_mrna") > >>> > >> Error in normArgTrack(x, track) : Unknown track: all_mrna > >> > >>> sessionInfo() > >>> > >> R version 2.9.1 (2009-06-26) > >> i386-apple-darwin8.11.1 > >> > >> locale: > >> en_CA.UTF-8/en_CA.UTF-8/C/C/en_CA.UTF-8/en_CA.UTF-8 > >> > >> attached base packages: > >> [1] stats graphics grDevices utils datasets methods base > >> > >> other attached packages: > >> [1] rtracklayer_1.4.1 RCurl_0.98-1 bitops_1.0-4.1 > >> > >> loaded via a namespace (and not attached): > >> [1] Biobase_2.4.1 Biostrings_2.12.9 BSgenome_1.12.3 IRanges_1.2.3 > >> [5] XML_2.6-0 > >> --------------------------------- > >> > >> > >> > >> [1.5.19 -- 2.10] > >> --------------------------------- > >> > >>> library(rtracklayer) > >>> > >> Loading required package: RCurl > >> Loading required package: bitops > >> > >>> session <- browserSession("UCSC") > >>> genome(session) <- "hg18" > >>> q1 <- ucscTableQuery(session, "refGene") > >>> q1 > >>> > >> Get track 'refGene' within hg18:chrX:151073054-151383976 > >> > >>> refGene <- getTable(q1) > >>> q2 <- ucscTableQuery(session, "all_mrna") > >>> q2 > >>> > >> Get track 'all_mrna' within hg18:chrX:151073054-151383976 > >> > >>> mrna <- getTable(q2) > >>> sessionInfo() > >>> > >> R version 2.10.0 Under development (unstable) (2009-05-22 r48594) > >> i386-apple-darwin8.11.1 > >> > >> locale: > >> [1] en_CA.UTF-8/en_CA.UTF-8/C/C/en_CA.UTF-8/en_CA.UTF-8 > >> > >> attached base packages: > >> [1] stats graphics grDevices utils datasets methods base > >> > >> other attached packages: > >> [1] rtracklayer_1.5.19 RCurl_0.98-1 bitops_1.0-4.1 > >> > >> loaded via a namespace (and not attached): > >> [1] Biobase_2.5.8 Biostrings_2.13.48 BSgenome_1.13.16 > IRanges_1.3.87 > >> [5] XML_2.6-0 > >> --------------------------------- > >> > >> On 3-Oct-09, at 11:10 AM, Michael Lawrence wrote: > >> > >> > >> > >>> On Thu, Oct 1, 2009 at 11:47 PM, Mark Robinson <mrobinson@wehi.edu.au> > >>> wrote: > >>> Hello BioC list. > >>> > >>> I've encountered a small problem when trying to download some tracks > from > >>> UCSC with rtracklayer. Specifically, I'm trying to get the "mrna" > table. > >>> The example below works great for the "refGene" table, but gives an > error > >>> for the "mrna" table. > >>> > >>> Any suggestions? > >>> > >>> > >>> Looked into this and found that the table browser has a slightly > different > >>> set of tracks from the browser itself. I added a trackNames method > (version > >>> 1.5.19) on the UCSCTableQuery, so that it's possible to know which > tracks > >>> are available. > >>> > >>> From that, you'll see that 'mrna' needs to be 'all_mrna'. > >>> > >>> Michael > >>> > >>> Thanks, > >>> Mark > >>> > >>> > >>>> library(rtracklayer) > >>>> > >>> Loading required package: RCurl > >>> Loading required package: bitops > >>> > >>>> session <- browserSession("UCSC") > >>>> genome(session) <- "hg18" > >>>> options(stringsAsFactors=FALSE) > >>>> q1 <- ucscTableQuery(session, "refGene",GenomicRanges(197848272, > >>>> > >>> 198064461, "chr3")) > >>> > >>>> tab1 <- getTable(q1) > >>>> head(tab1[,1:5]) > >>>> > >>> bin name chrom strand txStart > >>> 1 2094 NM_198565 chr3 + 197851052 > >>> 2 261 NM_032898 chr3 - 197917544 > >>> 3 2095 NM_017861 chr3 + 197923642 > >>> 4 2095 NM_002577 chr3 + 197951124 > >>> > >>>> q2 <- ucscTableQuery(session, "mrna",GenomicRanges(197848272, > 198064461, > >>>> > >>> "chr3")) > >>> > >>>> tab2 <- getTable(q2) > >>>> > >>> Error in scan(file, what, nmax, sep, dec, quote, skip, nlines, > na.strings, > >>> : > >>> line 1 did not have 2 elements > >>> > >>>> traceback() > >>>> > >>> 5: scan(file = file, what = what, sep = sep, quote = quote, dec = dec, > >>> nmax = nrows, skip = 0, na.strings = na.strings, quiet = TRUE, > >>> fill = fill, strip.white = strip.white, blank.lines.skip = > >>> blank.lines.skip, > >>> multi.line = FALSE, comment.char = comment.char, allowEscapes = > >>> allowEscapes, > >>> flush = flush, encoding = encoding) > >>> 4: read.table(f, sep = "\t", header = TRUE) > >>> 3: .local(object, ...) > >>> 2: getTable(q2) > >>> 1: getTable(q2) > >>> > >>>> sessionInfo() > >>>> > >>> R version 2.9.1 (2009-06-26) > >>> i386-apple-darwin8.11.1 > >>> > >>> locale: > >>> en_CA.UTF-8/en_CA.UTF-8/C/C/en_CA.UTF-8/en_CA.UTF-8 > >>> > >>> attached base packages: > >>> [1] stats graphics grDevices utils datasets methods base > >>> > >>> other attached packages: > >>> [1] rtracklayer_1.4.1 RCurl_0.98-1 bitops_1.0-4.1 > >>> > >>> loaded via a namespace (and not attached): > >>> [1] Biobase_2.4.1 Biostrings_2.12.8 BSgenome_1.12.3 IRanges_1.2.3 > >>> [5] XML_2.6-0 > >>> > >>> > >>> > >>> > >>> > >>> --------- > >>> library(rtracklayer) > >>> session <- browserSession("UCSC") > >>> genome(session) <- "hg18" > >>> options(stringsAsFactors=FALSE) > >>> qry1 <- ucscTableQuery(session, "refGene",GenomicRanges(197848272, > >>> 198064461, "chr3")) > >>> tab1 <- getTable(qry1) > >>> head(tab1[,1:5]) > >>> qry2 <- ucscTableQuery(session, "mrna",GenomicRanges(197848272, > 198064461, > >>> "chr3")) > >>> tab2 <- getTable(qry2) > >>> traceback() > >>> sessionInfo() > >>> --------- > >>> > >>> > >>> ------------------------------ > >>> Mark Robinson, PhD (Melb) > >>> Epigenetics Laboratory, Garvan > >>> Bioinformatics Division, WEHI > >>> e: m.robinson@garvan.org.au > >>> e: mrobinson@wehi.edu.au > >>> p: +61 (0)3 9345 2628 > >>> f: +61 (0)3 9347 0852 > >>> > >>> _______________________________________________ > >>> Bioconductor mailing list > >>> Bioconductor@stat.math.ethz.ch > >>> https://stat.ethz.ch/mailman/listinfo/bioconductor > >>> Search the archives: > >>> http://news.gmane.org/gmane.science.biology.informatics.conductor > >>> > >>> > >>> > >> ------------------------------ > >> Mark Robinson, PhD (Melb) > >> Epigenetics Laboratory, Garvan > >> Bioinformatics Division, WEHI > >> e: m.robinson@garvan.org.au > >> e: mrobinson@wehi.edu.au > >> p: +61 (0)3 9345 2628 > >> f: +61 (0)3 9347 0852 > >> ------------------------------ > >> > >> > >> > >> > >> > >> > >> > > > > [[alternative HTML version deleted]] > > > > _______________________________________________ > > Bioconductor mailing list > > Bioconductor@stat.math.ethz.ch > > https://stat.ethz.ch/mailman/listinfo/bioconductor > > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor > > > > > > [[alternative HTML version deleted]]
ADD COMMENTlink written 8.2 years ago by Michael Lawrence620
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 169 users visited in the last hour