Hey Martin, thanks for the answer!
But now I have a different problem - when running:
AnnotationHub()[['AH47030']]
I get this error:
snapshotDate(): 2015-11-19
downloading from ‘https://annotationhub.bioconductor.org/fetch/52498’
‘https://annotationhub.bioconductor.org/fetch/52499’
retrieving 2 resources
Error: failed to load 'AnnotationHub' resource
name: AH47030
title: clinvar.vcf.gz from dbSNP, GRCh38_b142 assembly
reason: 2 resources failed to download
In addition: There were 15 warnings (use warnings() to see them)
Followed by:
Warning messages:
1: 'AnnotationHub' database may not be current
database: ‘C:/Users/ElhananBY/Documents/AppData/.AnnotationHub/annotationhub.sqlite3’
reason: Problem with the SSL CA cert (path? access rights?)
2: In curl::curl_fetch_disk(url, x$path, handle = handle) :
progress callback must return boolean
...
7: In curl::curl_fetch_disk(url, x$path, handle = handle) :
progress callback must return boolean
8: download failed
hub path: ‘https://annotationhub.bioconductor.org/fetch/52498’
cache path: ‘C:/Users/ElhananBY/Documents/AppData/.AnnotationHub/52498’
reason: Problem with the SSL CA cert (path? access rights?)
9: In curl::curl_fetch_disk(url, x$path, handle = handle) :
progress callback must return boolean
...
14: In curl::curl_fetch_disk(url, x$path, handle = handle) :
progress callback must return boolean
15: download failed
hub path: ‘https://annotationhub.bioconductor.org/fetch/52499’
cache path: ‘C:/Users/ElhananBY/Documents/AppData/.AnnotationHub/52499’
reason: Problem with the SSL CA cert (path? access rights?)
Also, when trying to insert AnnotationHub() to a variable:
> hub = AnnotationHub()
Warning message:
'AnnotationHub' database may not be current
database: ‘C:/Users/ElhananBY/Documents/AppData/.AnnotationHub/annotationhub.sqlite3’
reason: Problem with the SSL CA cert (path? access rights?)
Further, the last two sources don't work for me:
AH47022 ftp://ftp.ncbi.nih.gov/snp/organisms/human_9606_b142_GRCh37p13/VCF/clinical_vcf_set/clinvar.vcf.gz
AH47030 ftp://ftp.ncbi.nih.gov/snp/organisms/human_9606_b142_GRCh38/VCF/clinical_vcf_set/clinvar.vcf.gz
The address seems to be wrong - it shouldn't contain "clinical_vcf_set", but rather just be:
ftp://ftp.ncbi.nih.gov/snp/organisms/human_9606_b142_GRCh38/VCF/clinvar-latest.vcf.gz
So now I just manually downloaded the vcf.gz file and used readvcf to input it, which seems to work.
Thanks
Just a follow up to let you know this is fixed in AnnotationHub 2.3.16 in devel and 2.2.5 in release. Both should be available with biocLite() tomorrow ~noon PST or immediately from svn:
svn co https://hedgehog.fhcrc.org/bioconductor/trunk/madman/Rpacks/AnnotationHub
svn co https://hedgehog.fhcrc.org/bioconductor/branches/RELEASE_3_2/madman/Rpacks/AnnotationHub
Records for the old files (moved, changed, no longer available, etc.) have been removed and replacements for GRCh37 and GRCh38 have been added. The snapshot date you want is 2016-03-09.
This is what you'll see when trying to access a resource that's no longer available:
> hub[["AH47030"]]
Error in .AnnotationHub_get1(x[idx]) :
no records found for the given index
> hub["AH47030"]
AnnotationHub with 0 records
# snapshotDate(): 2016-03-09
Maybe we could make that first message even more informative by adding the date removed (if any).
Valerie
Still a problem for me.
hub = AnnotationHub()
snapshotDate(): 2016-03-09
> gse = query(hub, "GSE62944")[[1]]
Error: failed to load 'AnnotationHub' resource
name: AH28855
title: RNA-Sequencing and clinical data for 7706 tumor samples from The Cancer Genome Atlas
reason: 1 resources failed to download
In addition: Warning message:
download failed
hub path: ‘https://annotationhub.bioconductor.org/fetch/34295’
cache path: ‘C:/LispHome/AppData/.AnnotationHub/34295’
reason: Forbidden (HTTP 403).
Any tips?
Thanks for reporting this. Just to clarify, your original question was about accessing clinvar files; these have been re-generated and are now available. Now you're asking about accessing the GSE62944 TCGA data, this is a different resource unrelated to the original problem. It's helpful (for me) to keep problems with the different resources in separate posts.
The GSE62944 data is in an S3 bucket. For some reason access was restricted (not public). I have fixed that and the resource is now available:
In the future please open a new issue if the data are not the same type as referenced in the original.
Valerie
Thank you so much. Works now