Entering edit mode
m a
▴
10
@m-a-3789
Last seen 10.6 years ago
Hello,
I would like to make simple statistics on a specific DNA sequence. In
order
to do that a need to import a sequence with a fasta extension.
http://www.ncbi.nlm.nih.gov/nuccore/9626243?report=fasta&log$=seqview
After download I run the folliwing code with the package seqinr :
dnafile <- system.file("sequences/seqbac.fasta", package = "seqinr")
cc<-read.fasta(file = dnafile)
cc gives me then the following vector
...
[47764] "t" "c" "c" "c" "t"
......
My problem is I would like now to use that vector to perform basic
statistics eg; GC content analysis, base frequencies . I hardly see
how ?
For instance an histogram on my vector like hist(cc) don't work
My first intention by the way was to use biostring package to import
fasta
file, like readFASTA(" directory",strip.desc=TRUE). But how sould I
know
under which directory I have to put data ? Because I ve tried few
directories but he still do not found my data
Thanks in advance,
Moses
Student in biostatistics
[[alternative HTML version deleted]]