Entering edit mode
megapode32559
•
0
@megapode32559-23129
Last seen 4.7 years ago
I see this count matrix. But it is not clear what rows correspond to differentially expressed genes. Is there a way to know what genes are set as differentially expressed in the simulation? Thanks.
R> library(DESeq2)
R> dds <- makeExampleDESeqDataSet()
R> head(dds@assays@data@listData$counts)
sample1 sample2 sample3 sample4 sample5 sample6 sample7 sample8
sample9 sample10 sample11 sample12
[1,] 58 92 164 76 54 79 106 104
87 53 45 52
[2,] 8 17 21 6 7 9 28 13
10 15 24 24
[3,] 0 0 0 0 1 0 0 0
2 0 0 6
[4,] 77 112 58 91 42 61 81 45
93 79 98 114
[5,] 34 119 48 29 17 44 67 28
35 26 28 32
[6,] 9 4 0 0 0 3 0 0
1 0 1 1
I am not asking for the results of DESeq(). I am asking in the simulated data without going through DESeq() function. How to know what genes are set to differentially expressed during the simulation?
These are 'random' negative binomial distributed integers, which simulate RNA-seq read counts. As their generation is random, the genes that are statistically significantly differentially expressed is also, therefore, random:
The code:
There is a column in
mcols(dds)
which gives the true effect size / LFC across the condition.Note that
betaSD
can be specified as a vector, so you can use it to generate null NB genes if you want for the first X% of genes, etc.