Hi , I am currently working on 12 ATAC-Seq samples using Diffbind. Following the tutorial, the first step involved generating a dba object and setting the minoverlap parameter to 2, which resulted in obtaining 60k peaks. The next step involved counting the reads of each peak in each sample. However, I have noticed that some peaks appear to have a high count number despite not being included in my input peak list. Can someone explain the reason for this discrepancy? For example, from the count table, I can see peak_1 has 120 counts in the sample A, but when I go back to check the input peakset of sample A, there is not peak_1.
library(DiffBind)
dataOb <- dba(sampleSheet = "table.samplesheet.csv", minOverlap=2)
dataOb <- dba.count(dataOb, summits=FALSE, minOverlap=2,
score=DBA_SCORE_NORMALIZED,
bUseSummarizeOverlaps = TRUE)
write.csv(dataOb[["binding"]], file="peaks.counts.csv")
Many thanks Huifang