Question

DiffBind MA plot

0

Entering edit mode

rbronste ▴ 60

@rbronste-12189

Last seen 4.4 years ago

Hi,

I just wanted to get some clarification on MA plots and exactly what they represent. Including one I made of my diff peak data. I am wondering what the darker blue area is close to the origin as well as further down the log concentration and exactly what it represents in comparison with the lighter blue? Also maybe in this context what is the log concentration representing? Thanks for the clarification.

Rob

https://dl.dropboxusercontent.com/u/105647823/Screen%20Shot%202017-02-20%20at%2010.30.58%20PM.png

diffbind ma plot • 2.1k views

ADD COMMENT • link updated 7.2 years ago by Rory Stark ★ 5.2k • written 7.2 years ago by rbronste ▴ 60

score 1 · Answer 1 · 2017-02-21

Hello Rob-

Each dot in the plot represents a consensus binding site. For the X axis, "concentration" refers to the mean (normalized) number of reads across all the samples for that binding site. This is reported as a log2 value, so as you go from left to right, the overall binding affinity (read density) is doubling. The dark blue are close to the origin represents sites that have very low binding affinity overall (ie very few ChIP-seq reads overlap those sites).

By default this is "Smoothed" plot, so instead of plotting each point (binding site), darker colors are used to show that there are a lot of points in that area. Lighter areas have few or no points. You can get a non-smoothed version of this plot setting bSmooth=FALSE when calling dba.plotMA(). You can get the same data expressed as a XY plot by setting bXY=TRUE.

In your plot, the dark spot near the origin is a cluster of sites that have very low read counts and also don't change much. The main dark region shows sites with increasing binding activity (high X values) but not much change between conditions (Y close to 0). Both of the dense blue areas are shifted slightly below a fold change of 0 (Y axis), indicating a tendency to see more reads in the second sample group.

The red points are "significantly differentially bound" sites. The absolute values of the fold changes are greater than 2 (since the Y-axis is also on a log2 scale, indicates at least a 4-fold change in binding affinity). The red dots on the outer diagonal lines are usually sites that have no binding in one condition and substantial binding in the other condition.

A colleague of mine calls these "fish plots", as many of them (such as yours) remind her of tropical fish.

Cheers-

Rory