Hi there,
I am very new to Bioconductor and also to the field of Bioinformatics. However, I have a bunch of siRNA-sequences and I need to find the target genes (human genome).
I got the advice to download the genome via AnnotationHub.
I managed to download the file "Homo_sapiens.GRCh37.74.dna.toplevel.fa" via AnnotationHub, though I am not sure that is the right source to use.
I was a bit confused by the error message:
In curl::curl_fetch_disk(url, x$path, handle = handle) : progress callback must return boolean
The data object now looks like this:
> res
class: FaFile
path: /Users/niederle/.AnnotationHub/12356
index: /Users/niederle/.AnnotationHub/16142
isOpen: TRUE
yieldSize: NA
Then, I should convert that into BioStrings but I did not manage to do that.
If I call:
> genome <- readDNAStringSet("/Users/xyz/.AnnotationHub/12356")
> genome
> genome
A DNAStringSet instance of length 346
width seq names
[1] 249250621 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN...NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 1 dna:chromosome ...
[2] 135534747 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN...NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 10 dna:chromosome...
[3] 135006516 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN...NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 11 dna:chromosome...
[4] 133851895 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN...NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 12 dna:chromosome...
[5] 115169878 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN...NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 13 dna:chromosome...
... ... ...
[342] 0 /
[343] 1006 AABBCCDDGNGHHHKKMMNNRRSSTTVVWWYYYTKCMAABBBCCDD-GGHHKKMWMNSNNCRR...DDHGGHWHRGKAKMMNNRRSSTTVVWSWKYYWGC.TNYVMA.YST-VSH+D+-W-.T.KC.W
Error in nchar(snippet_name) : invalid multibyte string 1
I don't know how to continue. Can anybody give me some useful hints? Anyhow, I don't know how to extract the gene information.
