Question

Limma Model Matrix Understanding

1

Entering edit mode

andrew.j.skelton73 ▴ 370

@andrewjskelton73-7074

Last seen 5 weeks ago

United Kingdom

Hi,

I'd just like to increase my understanding of my grouped model design.

foo <- as.factor(c("A","A","A","B","B","B"))
mod <- model.matrix(~0 + foo)
mod <- model.matrix(~1 + foo)

For the example above, I know that adding the ~1 or ~0 stops the first term of the groupings from being absorbed into the intercept, but I'm just wondering what the differences are between using 0 and 1. Does anyone have a good explanation of this?

Thanks

limma limma design matrix model matrix • 5.4k views

ADD COMMENT • link updated 9.0 years ago by Aaron Lun ★ 28k • written 9.0 years ago by andrew.j.skelton73 ▴ 370

0

Entering edit mode

Actually adding ~1 to the model doesn't do anything. It just adds an intercept to a model that already has one, and so has no effect. You might have intended to write ~-1, which has the same effect as ~0.

ADD REPLY • link 4.2 years ago Gordon Smyth 50k

Gordon Smyth · Accepted Answer · 2015-05-07

From a practical perspective, the interpretation of your coefficients will change. In the first model with ~0, each coefficient represents the average of the samples at each level of foo, i.e., the first coefficient represents the average for A, while the second represents the average of B. In the second model, the intercept represents the average of the samples for A, while the second coefficient represents the increase in the average of B over that of A. Indeed, the systematic component for the samples in B is the sum of the first and second coefficients, while the systematic component for samples in A is just comprised of the first coefficient.

Both models are valid inputs into limma. The choice between the two depends on which one is easier to interpret. For complicated models, I like to do without the intercept as I can interpret each coefficient by itself, without having to mentally consider its meaning relative to some intercept value. However, for simple models like the one you presented above, the analysis may be easier with the intercept as you can just drop the second coefficient to get the difference between the two groups - there's no need to use contrasts.fit.