Hello,
So, I am working with RNA-seq data and voom/limma workflow. After I finished my workflow and get the list of DE genes, I tried to filter by p-value. I choose <=0.1 for my cutoff value. After that, I chekced one of gene and it isn't in my list. I noticed that it maybe because the p-value and it is. The p-value is big, almost 0.2. I tried to find the reason why p-value is big. I check the read count. The raw read count is actually quite big. Below is the readcount:
1 | Cat_1_1 | 2097.070 |
2 | Cat_1_2 | 1866.160 |
3 | Cat_1_3 | 2539.440 |
4 | Cat_1_4 | 2048.650 |
5 | Cat_1_5 | 1628.770 |
6 | Cat_1_7 | 3241.710 |
7 | Cat_2_1 | 807.168 |
8 | Cat_2_2 | 7171.430 |
9 | Cat_2_3 | 8759.580 |
10 | Cat_3_1 | 1213.360 |
11 | Cat_3_2 | 339.301 |
12 | Cat_3_3 | 2096.140 |
13 | Cat_3_4 | 888.941 |
14 | Cat_3_5 | 1381.800 |
15 | Cat_3_6 | 3281.890 |
16 | Cat_3_7 | 2498.580 |
Below is values after I used voom:
row.names | x | |
---|---|---|
1 | Cat_1_1 | 5.383155 |
2 | Cat_1_2 | 5.202185 |
3 | Cat_1_3 | 5.568586 |
4 | Cat_1_4 | 5.500774 |
5 | Cat_1_5 | 5.625384 |
6 | Cat_1_7 | 5.878762 |
7 | Cat_2_1 | 4.356846 |
8 | Cat_2_2 | 7.397590 |
9 | Cat_2_3 | 7.628476 |
10 | Cat_3_1 | 5.568482 |
11 | Cat_3_2 | 4.578878 |
12 | Cat_3_3 | 6.982345 |
13 | Cat_3_4 | 5.806681 |
14 | Cat_3_5 | 6.466003 |
15 | Cat_3_6 | 7.618232 |
16 | Cat_3_7 | 7.357043 |
I checked the Cat_1 vs Cat_3. What is the reason the p-value is big? With that read count, I hope I can get the p-value to be significant and that gene is one of the important gene to check.
What is your experimental design? Based on what you say you're comparing, I assume that this is a one-way layout with three groups - Cat_1, Cat_2 and Cat_3 - is that correct? Also, why are your read counts not integer values?