Starting context: I'm pretty intro-level at bioinformatics. I'm using ATACseq datasets. 4 groups (multiple replicates per group) of the following format:
Group 1 = ConditionA:TreatmentX, Group 2 = ConditionA:TreatmentY, Group 3 = ConditionB:TreatmentX, Group 4 = ConditionB:TreatmentY
First, I want to obtain the set of genomic regions that do NOT change in accessibility between groups 1 and 2 (call this {1⋂2}). Is this possible? The vignettes for DESeq2 and edgeR seem to only describe how to obtain regions that differ between groups.
Second, assuming I now have {1⋂2}, I next need to obtain the genomic regions that differ between {1⋂2} and the combination of genomic regions in groups 3 and 4 (call this {3⋃4}). I think I could do this in a similar fashion to edgeR's example, ((drug.2hr - drug.0hr) - (placebo.2hr - placebo.0hr)). Except in my case, I need the opposite function of (drug.2hr - drug.0hr), and a new function for (placebo.2hr - placebo.0hr) that effectively combines the regions defined in both placebo.2hr and placebo.0hr. Something like ((drug.2hr + drug.0hr) - (placebo.2hr OR placebo.0hr)).
Third, I don't know how to obtain {3⋃4}. Should I simply consider all replicates of group 3 as though they were more replicates of group 4, then generate a consensus peakset from that?
One option I considered was to find a way to combine the resulting datasets of {{1⋂2} - 3} and {{1⋂2} - 4}, but while this skirts my third problem, I still don't know how to describe to edgeR or DESeq2 the logical relationship of ⋂.