DESeq2 with uneven group size
1
0
Entering edit mode
@zohaib-anwar-22899
Last seen 5.8 years ago
Denmark

Dear all,

I have analyzed RNA-seq dataset, see the experimental design below.
The two questions we are trying to answer are:

  1. Find DE genes between different conditions. It can be seen that each condition has 4 samples (biological replicates) for which I performed pairwise differential expression using DESeq2. e.g. (A-B), (A-C), (A-D), (A-E), (B-C) .... (D-E). I can see the DE genes.

  2. Find DE genes and fold change difference between 2 Events X and Y as listed in column 3 in table below. In this case the unequal group sizes has raised some concerns by the reviewer of the study.

| Samples   | Condition     | Event     | Replicates    |
|---------- |-----------    |-------    |------------   |
| Sample1   | A             | X         | 1             |
| Sample2   | A             | X         | 2             |
| Sample3   | A             | X         | 3             |
| Sample4   | A             | X         | 4             |
| Sample5   | B             | X         | 1             |
| Sample6   | B             | X         | 2             |
| Sample7   | B             | X         | 3             |
| Sample8   | B             | X         | 4             |
| Sample9   | C             | X         | 1             |
| Sample10  | C             | X         | 2             |
| Sample11  | C             | X         | 3             |
| Sample12  | C             | X         | 4             |
| Sample13  | D             | X         | 1             |
| Sample14  | D             | X         | 2             |
| Sample15  | D             | X         | 3             |
| Sample16  | D             | X         | 4             |
| Sample17  | E             | Y         | 1             |
| Sample18  | E             | Y         | 2             |
| Sample19  | E             | Y         | 3             |
| Sample20  | E             | Y         | 4             |

To my understanding DESeq2 is able to calculate DE for uneven group sizes as it calculates the group means before fold change calculation. Also in an example in DESeq2 vignette, different number of replicates for each condition were used though the difference was not as high as in this study. I would like feedback on this design to improve the statistical analysis.

Best regards, Zohaib

deseq2 • 1.1k views
ADD COMMENT
0
Entering edit mode
@mikelove
Last seen 2 days ago
United States

There's not really a software question here, or if there is I can't see it clearly. DESeq2 can perform inference regardless of whether the sample sizes are balanced or not. The power is limited in unbalanced designs with respect to balanced, but there isn't anything the software can do about that.

If you want to compare E to the average of A-D you can use a numeric contrast, see ?results, and the numeric value would be, e.g. c(-1/4, -1/4, -1/4, -1/4, 1) with a design of ~0 + condition.

ADD COMMENT

Login before adding your answer.

Traffic: 792 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6