Anyone see whats causing EdgeR to report Design matrix not of full rank.?
2
0
Entering edit mode
@nickschurch-9861
Last seen 8.1 years ago

Anyone have any idea why this ecperimental design is prompting a "Design matrix not of full rank." error?

Sample condition1 conditions2 conditions3 batch lane2 lane3 lane4 lane5 lane6 lane7 lane8 lane9 lane10 lane11 spikein_mix
1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0
2 0 0 1 0 0 0 0 0 0 0 0 0 0 0 1
3 0 0 1 0 1 0 0 0 0 0 0 0 0 0 0
4 0 0 1 0 1 0 0 0 0 0 0 0 0 0 1
5 0 0 1 0 0 1 0 0 0 0 0 0 0 0 0
6 0 0 1 0 0 1 0 0 0 0 0 0 0 0 1
7 0 0 1 0 0 0 1 0 0 0 0 0 0 0 0
8 0 1 0 0 0 0 1 0 0 0 0 0 0 0 1
9 0 1 0 1 0 0 0 1 0 0 0 0 0 0 0
10 0 1 0 1 0 0 0 1 0 0 0 0 0 0 1
11 0 1 0 1 0 0 0 0 1 0 0 0 0 0 0
12 0 1 0 1 0 0 0 0 1 0 0 0 0 0 1
13 0 1 0 1 0 0 0 0 0 1 0 0 0 0 0
14 0 1 0 1 0 0 0 0 0 1 0 0 0 0 1
15 1 0 0 1 0 0 0 0 0 0 1 0 0 0 1
16 1 0 0 1 0 0 0 0 0 0 1 0 0 0 0
17 1 0 0 1 0 0 0 0 0 0 0 1 0 0 1
18 1 0 0 1 0 0 0 0 0 0 0 1 0 0 0
19 1 0 0 1 0 0 0 0 0 0 0 0 1 0 1
20 1 0 0 1 0 0 0 0 0 0 0 0 1 0 0
21 1 0 0 1 0 0 0 0 0 0 0 0 0 1 1

Apparently "lane6" and "lane11" are the problems. lane11 only has a single member which I thought might be the problem, but giving this sample membership of lane10 insteap of 11 doesn't help, it just shifts the problem to "lane7" and "lane10".

 

 

edger design matrix • 878 views
ADD COMMENT
0
Entering edit mode
Aaron Lun ★ 28k
@alun
Last seen 5 hours ago
The city by the bay

Easy. batch is equal to the sum of lane5 to lane11. So, for any estimate of the batch coefficient, you can obtain an equally good GLM fit by increasing batch and decreasing all lane5 to lane11 coefficients by the same amount. To fix this, you should drop batch from your design, as the batch effect is captured by the lane coefficients anyway. Note that if you block on the lane, your last sample is basically useless, because all its information is used to estimate the lane11 coefficient.

That said, is it really necessary to block on the individual lanes? For current (Illumina) protocols, I'd be surprised if sequencing lane had a large effect. If the samples don't cluster by lane on a MDS plot, I would forget about blocking on the lanes and just block on the batch effect, you'll get more power with a simpler model. Whether or not spike-in mix is added seems like an unnecessary factor as well.

ADD COMMENT
0
Entering edit mode
@nickschurch-9861
Last seen 8.1 years ago

OK, I see. I was hoping to be able to probe what the impact of batch and lane were seperately rather than leave one out. I agree that lane probably doesn't have much effect, but figured I'd actually try for a full description of the experiemnt. Thanks.

ADD COMMENT

Login before adding your answer.

Traffic: 859 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6