Hello All,
I'm dealing with a rather complex RNAseq experiment that follows a nested structure of the tested variables that could be represented like this:
SampleName tissue temp time dev_stage rep
Sample1 crown 21 am ds1 rep1
Sample2 crown 21 am ds1 rep3
Sample3 crown 21 am ds1 rep4
Sample4 crown 21 am ds2 rep1
Sample5 crown 21 am ds2 rep2
Sample6 crown 21 am ds2 rep3
Sample7 crown 21 am ds2 rep4
Sample8 crown 21 am ds3 rep1
Sample9 crown 21 am ds3 rep2
Sample10 crown 21 am ds3 rep3
Sample11 crown 21 am ds3 rep4
From the several related posts related to experiment design on multifactor experiments, I came up with the design formula which looks like this:
exp.dds=DESeqDataSetFromMatrix(countData = counts_noZeros.df,
colData = exp_coldata,
design = ~0+tissue:temp:time:dev_stage)
I'm interested in determining the effect of these 4 variables on gene expression. For which I've 2 questions,
1) If I keep the design as is, is it correct that DESeq will account for a gene's expression considering a nested effect of these variables? In which case, is the model representation correct?
2) If I want to determine expression of only one of these variables should the design formula look something like this?
design = ~0+tissue
Any help is appreciated. With thanks.
Best, Sandeep
Hi Michael,
Sorry I should have explained the exp. design better, the full experiment consists of an expression dataset comprising:
The metadata therefore looks something like this:
The key question is -- can I test the effect of any single variable at one time? In which case, I fear I might be ignoring the confounding effect of the other variables. Or should I use a nested model to the level to which I want to test the effect of a variable. For instance, if I had to check the effect of time, would the correct design be -
or should it be
Let me know if I missed something in either of these models.
Thanks.
Best, Sandeep
I don’t have spare time unfortunately to work out what user’s statistical analysis and designs should be, but I have to limit my time on the support site for software related questions.
Hi Michael,
Sorry I should have explained the exp. design better, the full experiment consists of an expression dataset comprising:
The metadata therefore looks something like this:
The key question is -- can I test the effect of any single variable at one time? In which case, I fear I might be ignoring the confounding effect of the other variables. Or should I use a nested model to the level to which I want to test the effect of a variable. For instance, if I had to check the effect of time, would the correct design be -
or should it be
Let me know if I missed something in either of these models.
Thanks.
Best, Sandeep