about paired-samples in edgeR
2
1
Entering edit mode
Bogdan ▴ 670
@bogdan-2367
Last seen 6 months ago
Palo Alto, CA, USA

Dear all,

would appreciate a suggestion please : considering a replicated experiment comparing Normal vs Tumor :

Normal1 Normal2 Tumor1 Tumor2

when setting up the list of samples for differential analysis in edgeR,

group <- factor(c("normal","normal","tumor", "tumor"))
y <- DGEList(counts=x,group=group)
y <- calcNormFactors(y)
y <- estimateCommonDisp(y)

how could I indicate that Normal1 is paired with Tumor1, and Normal2 is paired with Tumor2 ? thank you !

-- bogdan

edger • 3.0k views
ADD COMMENT
2
Entering edit mode
b.nota ▴ 360
@bnota-7379
Last seen 3.6 years ago
Netherlands

You'll have to add a factor with subjects in the design.

group <- factor(c("normal","normal","tumor", "tumor"))

subject <- factor(1, 2, 1, 2)

design <- model.matrix(~0+group+subject)

etc.

ADD COMMENT
3
Entering edit mode

Note that it should be:

subject <- factor(c(1,2,1,2))
ADD REPLY
2
Entering edit mode

Also, there is no advantage gained by adding "0+" to the model formula. The analysis would be equivalent and slightly simpler without it.

ADD REPLY
0
Entering edit mode
Bogdan ▴ 670
@bogdan-2367
Last seen 6 months ago
Palo Alto, CA, USA

Dear all, thank you all for your comments : shall I just write then :

design <- model.matrix(~group+subject)

Also, please could you let me know if there are major advantages/disadvantages of limma versus edgeR for DE analysis ?

 

ADD COMMENT
1
Entering edit mode

Both packages will do the job when it comes to detecting DE genes in a routine analysis. limma (+ voom) is faster and has access to more methodology (e.g., duplicateCorrelation) compared to edgeR, courtesy of lots of things being easier when you assume normality. However, voom does rely on the presence of a well-fitted mean-variance trend to estimate the precision weights. For some applications (though not usually DE analyses), the spread of abundances is too low to stably fit the trend, and in such cases edgeR will give better performance. edgeR also handles low counts better, which is worth considering if you want to focus on lowly-expressed genes or are dealing with very low-coverage data (e.g., single-cell stuff).

ADD REPLY
0
Entering edit mode

Thank you Aaron ! great to hear from you !

ADD REPLY

Login before adding your answer.

Traffic: 934 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6