Dear R great helpers,
I'm very naive in RNA-seq analysis, and I'm practicing bias correction for the data with 'seqbias' package.
I'm working with RNA-seq data acquired from several species using various Ensembl genome fasta. The vignette is easy to follow. However, in Sampling topic (page 2), when applying with real genome fasta file, I'm not able to figure the suitable the number of interavals and bases required (due to my limited knowledge). In brief, how can I come up with 'n' and 'm' values in this command line on page 2 of vignette.
> I <- random.intervals( ref_seqs, n = 5, m = 100000 )
Please suggest me,
Kaj
R version 3.1.2 (2014-10-31)
Platform: x86_64-pc-linux-gnu (64-bit)
locale:
[1] LC_CTYPE=en_GB.UTF-8 LC_NUMERIC=C
[3] LC_TIME=en_GB.UTF-8 LC_COLLATE=en_GB.UTF-8
[5] LC_MONETARY=en_GB.UTF-8 LC_MESSAGES=en_GB.UTF-8
[7] LC_PAPER=en_GB.UTF-8 LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_GB.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats4 parallel stats graphics grDevices utils datasets
[8] methods base
other attached packages:
[1] Rsamtools_1.18.3 seqbias_1.14.0 Biostrings_2.34.1
[4] XVector_0.6.0 GenomicRanges_1.18.4 GenomeInfoDb_1.2.5
[7] IRanges_2.0.1 S4Vectors_0.4.0 BiocGenerics_0.12.1
loaded via a namespace (and not attached):
[1] bitops_1.0-6 tools_3.1.2 zlibbioc_1.12.0