Chip-seq analysis with input
Entering edit mode
damian.kao • 0
Last seen 6.1 years ago

I read through the other posts about using RNA-seq packages to analyze Chip-seq data ( It seems like the general consensus is to just ignore the input control samples or build a black-list and look at differential binding between IP samples.

If I do want to incorporate input control into my differential binding, is it valid to include that data as another factor in the design matrix and perform a difference of difference?

So for every library, I would have a "IP" factor with two levels (IP/Input) and also a "sample" factor with two levels for treatment and control. 

I am not very well versed in R, would it be possible to make this kind of contrast? And would this type of contrast even be valid?

chip-seq differential binding analysis • 857 views
Entering edit mode
Aaron Lun ★ 27k
Last seen 13 hours ago
The city by the bay

Can you do it? Yes, because GLMs are very flexible and can easily test for differences of differences.

Should you do it? That's harder to answer, but in most cases, probably not. See A: csaw with negative controls for an explanation of why looking for differences of differences tends to be counterproductive.

Entering edit mode

Thanks. I am trying to understand what you meant when you wrote:

However, this becomes problematic when changes in chromatin state (that affect input coverage) coincide with changes in binding....

Are you saying that for a given genomic region that has signal in the input control, that signal is composed of various factors (mappability, gc content, open chromatin...). One of the factors of input control signal could also be the protein of interest that we are IP-ing. To then try to determine a difference between the protein of interest and the input control (which includes signal from the protein of interest) would not be productive. You would essentially be equalizing out the sample vs input signal.

Entering edit mode

Yes, the risk is that the changes in the input signal (e.g., due to changes in chromatin accessibility) are biologically correlated with genuine changes in ChIP signal (e.g., due to more protein binding in regions of open chromatin). So if you use the former to "correct" the latter, you would weaken or lose the DB effect. Anecdotes suggest that this does indeed happen, which is why I whine about it.


Login before adding your answer.

Traffic: 502 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6