Search
Question: R getting "stuck" with RoadMap resource retrieval through AnnotationHub
0
gravatar for vanesa_1230
6 months ago by
vanesa_12300 wrote:

Greetings!

I'm having a hard time trying to retrieve RoadMap resources using AnnotationHub (e.g.  ah[['AH49441']] ). The download does reach a 100%, but then the process will never exit. I don't get any error message, but I left this overnight to see if it would eventually end the process but it just didn´t happen. Is there anything going on with the server? Thanks in advance

ADD COMMENTlink modified 6 months ago • written 6 months ago by vanesa_12300
0
gravatar for James W. MacDonald
6 months ago by
United States
James W. MacDonald45k wrote:

Seems to work for me. Maybe try again?

> hub <- AnnotationHub()
updating metadata: retrieving 1 resource
  |======================================================================| 100%
snapshotDate(): 2017-04-24
> z <- hub[["AH49441"]]
require("rtracklayer")
downloading from 'https://annotationhub.bioconductor.org/fetch/56161'
retrieving 1 resource
  |======================================================================| 100%
> z
BigWigFile object
resource: C:/Users/jmacdon/AppData/Roaming/AppData/.AnnotationHub/56161
> mcols(hub)["AH49441",]
DataFrame with 1 row and 15 columns
                                         title   dataprovider      species
                                   <character>    <character>  <character>
AH49441 E002_RRBS_FractionalMethylation.bigwig BroadInstitute Homo sapiens

> import(z)
GRanges object with 3878598 ranges and 1 metadata column:
            seqnames               ranges strand |             score
               <Rle>            <IRanges>  <Rle> |         <numeric>
        [1]     chr1       [10497, 10497]      * | 0.970000028610229
        [2]     chr1       [10498, 10498]      * | 0.970000028610229
        [3]     chr1       [10525, 10525]      * | 0.970000028610229
        [4]     chr1       [10526, 10526]      * | 0.970000028610229
        [5]     chr1       [10542, 10542]      * |              0.75
        ...      ...                  ...    ... .               ...
  [3878594]     chrY [59033065, 59033065]      * | 0.699999988079071
  [3878595]     chrY [59033091, 59033091]      * | 0.280000001192093
  [3878596]     chrY [59033092, 59033092]      * | 0.280000001192093
  [3878597]     chrY [59033101, 59033101]      * | 0.230000004172325
  [3878598]     chrY [59033102, 59033102]      * | 0.230000004172325
  -------
  seqinfo: 24 sequences from an unspecified genome
>
ADD COMMENTlink written 6 months ago by James W. MacDonald45k
0
gravatar for shepherl
6 months ago by
shepherl ♦♦ 510
shepherl ♦♦ 510 wrote:

I also cannot reproduce the problem.  Could you please provide the exact code you ran as well as the results of sessionInfo(). 

ADD COMMENTlink written 6 months ago by shepherl ♦♦ 510

Thank you for your answers. I have now been able to retrieve 3 different resourses at most before R or RStudio gets "stuck". I actually have to terminate R since it wouldn't allow me to manually stop the process. I'm not sure if the data set I'm trying to retrieve could be too large or there's something going on with the internet connection. This is the code I'm using:

> ah <- AnnotationHub()

snapshotDate(): 2016-10-11

>

> #SUBSET BROAD INSTITUTE DATA

> epiRoad <- subset(ah, species=="Homo sapiens" & dataprovider=="BroadInstitute")

>

> #SELECT CONSOLIDATED GENOMES

> epiRoad_consolidated <- query(epiRoad, c("E{1}[0-9]{3}[^0-9]", "EID_metadata"), pattern.op=`|`)

>

> #SELECT H3K4ME1 NARROW PEAKS

> narrow.h3k4me1 <- query(epiRoad_consolidated, c("h3k4me1", "narrow ChIP-seq"))

>

> data.narrow.h3k4me1 <- lapply(X = names(narrow.h3k4me1), FUN = function(x){epiRoad_consolidated[[x]]})

require(“rtracklayer”)

downloading from ‘https://annotationhub.bioconductor.org/fetch/35299’

retrieving 1 resource

  |==============================================================================| 100%

ADD REPLYlink modified 6 months ago • written 6 months ago by vanesa_12300
1

I'm still having trouble reproducing as I can run your code above with no issues.  What is your output of sessionInfo()? We want to make sure you are using the most updated versions of all the packages for your version of R, you can check by using biocLite().  From the look of the snapshot date of AnnotationHub, I would guess that you are running an older version of R and AnnotationHub;  I would suggest if you continue to have issues to update to the most current release. Also, you might consider checking your system's memory to make sure you have enough space to download.  

ADD REPLYlink modified 6 months ago • written 6 months ago by shepherl ♦♦ 510

I really appreciate all your collaboration. I believe that I finally found out that the issue was the network connection...I feel so ashamed. Eventhough some packages are not the most updated version, the code is being running today successfully without the need of making any update. Today's data transfer rate is still low, but at least it's allowing me to do the download. Sorry for the inconvenience; I didn't check this yesterday since I didn't really believe it could be that bad. Many thanks!!!

 

ADD REPLYlink written 6 months ago by vanesa_12300
1

No problem.  We've all dealt with slow network issues.  Feel free to post more if you find you are having any other issues. 

ADD REPLYlink written 6 months ago by shepherl ♦♦ 510
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 141 users visited in the last hour