Question: normalization of CHIP-seq samples (ChIP and INPUT)
gravatar for Bogdan
6 days ago by
Palo Alto, CA, USA
Bogdan480 wrote:

Dear all,

please could I have your advise on the following : shall we have multiple samples from ChIP experiments (where the proteins A, B, C were immuno-precipitated on the chromatin) and INPUT DNA (from the samples A, B, and C), would we normalize them all together (i.e. ChIP + INPUT) in edgeR, for example ?  Many thanks.


-- bogdan


ADD COMMENTlink modified 6 days ago by Aaron Lun17k • written 6 days ago by Bogdan480

If you haven't already, you should check out the csaw package, both the User's Guide and the associated publication.

ADD REPLYlink written 6 days ago by Ryan C. Thompson6.2k

Dear Ryan, thank you for your reply. Here the question is not very much about TMM normalization (implemented in csaw or edgeR); I would be interested to know  whether I could normalize in the same procedure both ChIP samples and INPUT_DNA samples. Sorry if I missed the discussion in the manual. Any suggestions are welcome.

ADD REPLYlink written 6 days ago by Bogdan480
gravatar for Aaron Lun
6 days ago by
Aaron Lun17k
Cambridge, United Kingdom
Aaron Lun17k wrote:

Well, it's fairly easy to normalize the ChIP sample to the input sample for each protein, see Section 4.2 of the csaw user's guide. However, I would question the sensibility of normalizing ChIP samples for different proteins, as this sounds like a precursor to some sort of comparison of binding intensities between different proteins. The scientific value of this analysis is questionable and the results seem hard to interpret. For example, what does differential binding mean if you have different proteins involved? How do you deal with differences in antibody efficiency?

ADD COMMENTlink modified 6 days ago • written 6 days ago by Aaron Lun17k

Dear Aaron, great to hear from you, and thank you for suggestions.

On a side note, thought I shall ask (following some old postings) : can we obtain the normalized counts by using also CPM function (below). Many thanks !

libSizes <- as.vector(colSums(x))  

y <- DGEList(counts=x, group=group, lib.size=libSizes)


y <- calcNormFactors(y)


counts.per.m <- cpm(y, normalized.lib.sizes=TRUE)
ADD REPLYlink written 5 days ago by Bogdan480

Thank you. just found the answer : edgeR TMM values

ADD REPLYlink written 5 days ago by Bogdan480
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 418 users visited in the last hour