Hi there,
I am very new to Bioconductor and also to the field of Bioinformatics. However, I have a bunch of siRNA-sequences and I need to find the target genes (human genome).
I got the advice to download the genome via AnnotationHub.
I managed to download the file "Homo_sapiens.GRCh37.74.dna.toplevel.fa" via AnnotationHub, though I am not sure that is the right source to use.
I was a bit confused by the error message:
In curl::curl_fetch_disk(url, x$path, handle = handle) : progress callback must return boolean
The data object now looks like this:
> res
class: FaFile
path: /Users/niederle/.AnnotationHub/12356
index: /Users/niederle/.AnnotationHub/16142
isOpen: TRUE
yieldSize: NA
Then, I should convert that into BioStrings but I did not manage to do that.
If I call:
> genome <- readDNAStringSet("/Users/xyz/.AnnotationHub/12356") > genome > genome A DNAStringSet instance of length 346 width seq names [1] 249250621 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN...NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 1 dna:chromosome ... [2] 135534747 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN...NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 10 dna:chromosome... [3] 135006516 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN...NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 11 dna:chromosome... [4] 133851895 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN...NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 12 dna:chromosome... [5] 115169878 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN...NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 13 dna:chromosome... ... ... ... [342] 0 / [343] 1006 AABBCCDDGNGHHHKKMMNNRRSSTTVVWWYYYTKCMAABBBCCDD-GGHHKKMWMNSNNCRR...DDHGGHWHRGKAKMMNNRRSSTTVVWSWKYYWGC.TNYVMA.YST-VSH+D+-W-.T.KC.W Error in nchar(snippet_name) : invalid multibyte string 1
I don't know how to continue. Can anybody give me some useful hints? Anyhow, I don't know how to extract the gene information.