Note: I do not have biological replicates. I have read the warning messages and am aware that the analysis without replicates probably will not yield any meaning results. I want to make sure that I have the concept down for future reference.
What is the appropriate way of performing differential expression analysis when I have:
- 6 samples from the same cell line
- 3 sample types based on the phenotype (A, B, C)
- 2 times; 0 (C) & 48 (T) hours
In short, the 6 samples are as follows: cell-AC, cell-AT, cell-BC, cell-BT, cell-CC, cell-CT
Q1: If I want to perform differential expression analysis between just the sample types (A vs. B or A vs. C or B vs. C), is it correct to first subset the DESeqDataSet such that samples that belong to types that are being compared are included and set the design to ~type?
If I want to do sample-to-sample comparisons (AC vs. BC, AT vs. BT, ...; all 15 possible comparisons), is it correct to first subset the DESeqDataSet such that we only include samples of interest (e.g. only AC and BC for AC vs. BC) and set the design to ~type+condition?
Edit: I think the second question can be answered with the answer to a previously asked question A: DESEq2 comparison with mulitple cell types under 2 conditions.
Q3: It looks like for my samples, control samples cluster together and treatment samples cluster together. Is it okay to group the samples based on the condition to carry out differential expression analysis (AC & BC & CC vs. AT & BT & CT)? If so, would the design have to be set to ~type+condition or just ~condition?