Dear all, I would have a more general question regarding bias correction in ChIP-seq, and i would appreciate having your suggestions.
Let's assume that someone did a ChIP-seq for a protein in 2 conditions (a. CTL, and b. KD), and we compare : 1 -- all the genes in the genome ("all") , vs 2. -- a specific subset ("subset").
After KD, there is some increase in ChIP seq intensity genome-wide (for "all" genes), although the increase on the genes in "subset" is much stronger. more specifically, looking at the MEDIAN values in each experiment, let's assume :
CTL -- "all" genes : 10
KD -- "all" genes : 20
CTL -- "subset" genes : 15
KD -- "subset" genes : 45
The question would be : what type of scaling procedure (beside computing the Z-scores), would you recommend in order to bring CTL--"all genes" and KD--"all genes" on approx the same median value ?
thank you very much,
-- bogdan
yes, just noticed this old post about cpm() : How can I get the normalized read counts from TMM?. And it helps ;)