Hi,
I have a custom genome assembly contained in .fasta format. Sequences for each of the chromosomes are compiled into a single .fasta file. I'd like to forge a BSgenome object for this genome assembly (not publicly available) I've looked through the vignette and I'm stuck on how to construct the seed object. I know how to generate a BSgenome object using forgeBSgenomeDataPkgFromNCBI(), but I'm stuck on how to construct a BSgenome object from a genome assembly without an accession. Can anyone guide me through the next steps?
Thanks!
setwd("C:\\...\\Desktop\\wd\\dump\\seqs_srcdir")
e122<-readDNAStringSet("E122.fasta", format="fasta",use.names=TRUE)
chroms<-Seqinfo(
seqnames=c("chrA","chrB","chrC","chrD","chrE","chrF","chrMT"),
seqlengths=width(e122),
isCircular=as.logical(c("FALSE","FALSE","FALSE","FALSE","FALSE","FALSE","TRUE"),
genome=c("E122","E122","E122","E122","E122","E122","E122")))
chroms
Seqinfo object with 7 sequences (1 circular) from an unspecified genome:
seqnames seqlengths isCircular genome
chrA 2326463 FALSE <NA>
chrB 3127439 FALSE <NA>
chrC 3366774 FALSE <NA>
chrD 3699297 FALSE <NA>
chrE 4325132 FALSE <NA>
chrF 4126616 FALSE <NA>
chrMT 47890 TRUE <NA>
I've tried the following, but I must be missing something:
This is my seed.dcf file:
You are. The reason I pointed you to
forgeBSgenomeDataPkgFromTwobitFile
is because it's much simpler, doesn't require a seed file, and is the preferred way of making aBSgenome
package. In addition, you don't appear to be setting your seed file up correctly anyway. But since the function I have pointed you to only requires the 2bit file, and is easier to use, I am not going to try to figure out where you went wrong, and instead will point you to the new function instead.