I have an RNA-seq dataset with 8 samples (genes as rows, samples as columns):
AA - FP SC T1 (Follicular phase, solvent control, replicate 1)
AB - FP T1 (Follicular phase, treated, replicate 1)
AC - FP SC T2 (Follicular phase, solvent control, replicate 2)
AD - FP T2 (Follicular phase, treated, replicate 2)
AG - LP SC T1 (Luteal phase, solvent control, replicate 1)
AE - LP T1 (Luteal phase, treated, replicate 1)
AF - LP SC T2 (Luteal phase, solvent control, replicate 2)
AH - LP T2 (Luteal phase, treated, replicate 2)
My goal is to perform differential expression analysis between FP and LP, but I want to remove the effect of solvent controls so that I capture only the biological differences between the two phases.
What would be the correct way to handle the solvent controls in DESeq2 for this type of design?
Possible approaches I considered
Subtract SC from treated counts within each replicate before running DESeq2 (e.g., AB-AA = FP_T1_corrected).: This reduces to 4 corrected samples: FP_T1, FP_T2, LP_T1, LP_T2. But here I am concerned about whether the subtraction can create negative counts, and only 2 replicates per group remain.
Model SC directly in DESeq2 using an interaction term: Metadata includes phase (FP/LP), condition (SC/Treated), and replicate (T1/T2).
design ~ replicate + phase + condition + phase:condition
Then use the interaction term (phaseLP.conditionTreated) to test whether LP vs FP differences hold after adjusting for SC.
What is the best practice in this situation?
I have checked DEGs between Solvent Control (SC) of FP and LP and there are noticebale differences in terms of DEGs.
This has identified quite a good number of DEGs.
Next approach,
How do I calculate the res here by negating the SC effect?
res <- results(dds, name = "phaseLP.conditionTreated")
Not sure I understand. You ask how to do something and then show how one would do so.
I would like to clarify whether the command
res <- results(dds, name = "phaseLP.conditionTreated")
is the correct approach to generate differentially expressed genes (DEGs) when aiming to account for or negate the effect of the solvent control.