SICER and Diffbind
5
0
Entering edit mode
@sergioespeso-gil-6997
Last seen 4.8 years ago
New York

Hi I am using Diffbind to report the DBS (I know that SICER has already a tool for doing so, but I have a huge dataset with different treatments and I just want to try with DiffBind)

In any case, have you ever tried it? Even if the results that I get are good and consistent with other peak callers, it is a bit weird to me to do not find any correlation among the samples in the occupancy correlation heatmap plot. I am using the test-W200-G600-FDR.01-island.bed files.

Any guess?

Thanks a lot

Sergio

Sicer diffbind • 2.5k views
ADD COMMENT
1
Entering edit mode
Gord Brown ▴ 670
@gord-brown-5664
Last seen 3.9 years ago
United Kingdom

Hi, Sergio,

It's fairly normal not to see much correlation at the stage of having loaded the sample sheet, before counting reads (and sometimes even after counting).  ChIP-seq can be pretty noisy, and in a straight occupancy analysis, even tiny noise peaks have the same weight as bigger ones.  If you still see weak clustering after the dba.analyze step, then there's a problem.

If you post an example heat map, I can tell you whether it looks similar to what I usually see.

Cheers,

 - Gord

ADD COMMENT
0
Entering edit mode
@sergioespeso-gil-6997
Last seen 4.8 years ago
New York

Ok! thanks. It is only before counting the reads, after is ok (check the occupancy heatmap bellow).  I was a bit concerned because I used other peak callers for the same dataset and I could see some correlation in the occupancy heatmap, but not when I used SICER. I was just wondering if that was because I was using the wrong bed file , but I got more or less the same results in the GO associations, so I guess that it is ok (test-W200-G600-FDR.01-island.bed)

I also thought that it could be because the bed file was a bit different , but it seems that is not the case. With some peak callers I needed to change a bit the output file in order to get a correct input for DiffBind (MUSIC for example) 

Thanks a lot Gordon! 

Cheers
Sergio


ADD COMMENT
1
Entering edit mode

Hi,

There's something very wrong with your heat map.  There's no way every sample could be exactly the same distance apart from every other sample, which is what this shows (unless the data were artificially generated to create this pattern).  I can't really imagine how you could produce this via DiffBind.  Could you post (or email to me, if you prefer) your sample sheet and the top 50 or so lines of each peak file?

 - Gord

ADD REPLY
0
Entering edit mode
@sergioespeso-gil-6997
Last seen 4.8 years ago
New York

Yeap, 

I will write you.  thanks Gord! 

Sergio

ADD COMMENT
0
Entering edit mode
@sergioespeso-gil-6997
Last seen 4.8 years ago
New York

 

Now is working, :-) 

 

I will paste your answer in the support of both Bioconductor and SICER google group, maybe someone else will have the same problem. 

 

Have a nice day!

 

Sergio

On 07 Apr 2015, at 14:36, Gord wrote:

Hi,

 

The 4-column format isn't really "bed" format.  Bed would have 6 columns (usually):

 

chrom start end name score strand

 

If you want to use the 4-column format, set the PeakCaller column to "raw".

 

Cheers,

 

 - Gord

ADD COMMENT
0
Entering edit mode
@sergioespeso-gil-6997
Last seen 4.8 years ago
New York

Just a clarification, I just realised that there is not need to change the MUSIC output bed file. I did because I saw that was a bit different from MACS output, but there is not need.  

ADD COMMENT

Login before adding your answer.

Traffic: 782 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6