In RNA-seq/microarray data, enrichment analysis of gene sets generally takes 2 forms: 1) Identify a set of genes that are significant by some metric (usually FDR < 5%) and evaluate enrichment compared to some background set of genes; 2) use the test statistics directly to avoid an arbitrary cutoff. In limma #2 is implemented in cameraPR(), geneSetTest() and wilcoxGST() functions.
Let's turn to ChIP-seq, ATAC-seq data or any assay that yields genomic intervals. Just like standard differential expression analysis, consider a differential 'binding' test between two sets of samples. After getting the test statistics for each interval, I want to see if these differential intervals are enrichments for some biological function. Test #1, where a subset of intervals are called 'significant', is implemented in GREAT great.stanford.edu). GREAT accounts for false positive enrichments due to overlap of genomic intervals with genes in specific function set even under the null.
How do I perform test #2 with ChiP-seq intervals? The analysis must a) use the test statistics directly, rather than dividing intervals into 'significant' and 'non-significant' sets, and b) use a GREAT-style method to avoid false positive enrichments.
Does a method / implementation like this exist?