Question

Limma Voom duplicateCorrelation Design

0

Entering edit mode

bruce.moran ▴ 30

@brucemoran-8388

Last seen 4.2 years ago

Ireland

Hi,

want to get this straight as my first time using limma-voom and duplicateCorrelation.

My metadata has multiple Tissue (either tumour or normal) across two Diets. Each Individual had multiple biopsies, from either normal, tumour or multiple of each.

conds
    Individual Tissue Diet
S1           1 Normal   NC
S2           1 Tumour   NC
S3           1 Tumour   NC
S4           1 Tumour   NC
S5           2 Normal   NC
S6           3 Normal   NC
S7           4 Tumour   NC
S8           4 Tumour   NC
S9           5 Tumour   NC
S10          5 Tumour   NC
S11          5 Tumour   NC
S12          5 Normal   NC
S13          6 Tumour   NC
S14          6 Tumour   NC
S15          6 Tumour   NC
S16          6 Tumour   NC
S17          7 Tumour   NC
S18          7 Tumour   NC
S19          7 Tumour   NC
S20          7 Normal   NC
S21          8 Normal  HFD
S22         10 Normal  HFD
S23         11 Tumour  HFD
S24         11 Tumour  HFD
S25         11 Tumour  HFD
S26         12 Tumour  HFD
S27         12 Tumour  HFD
S28         12 Tumour  HFD
S29         12 Tumour  HFD
S30         13 Normal  HFD
S31         21 Tumour  HFD
S32         22 Tumour  HFD
S33         22 Tumour  HFD
S34         22 Tumour  HFD
S35         22 Normal  HFD

I believe I can use duplicateCorrelation setting block=Individual to account for correlated gene expression from individuals. I make a combined DietTissue as we believe gene expression in Tissue will be affected by Diet.

DietTissue <- factor(paste(conds$Diet, conds$Tissue, sep="."))
DTdesign <- model.matrix(~0+DietTissue, conds)
colnames(DTdesign)[seq_len(nlevels(DietTissue))] <- levels(DietTissue)
DTkeep <- filterByExpr(dge, design)
DTdge <- dge[keep, keep.lib.sizes=FALSE]
DTdge <- calcNormFactors(DTdge)

##voom
DTdcv <- voom(DTdge, design, plot=FALSE)

##DC: https://support.bioconductor.org/p/94280/#94290; https://support.bioconductor.org/p/59700/
DTcor <- duplicateCorrelation(DTdcv, 
                              block=conds$Individual)
DTdcdcv <- voom(DTdge, 
                DTdesign, 
                correlation=DTcor$consensus.correlation, 
                block=conds$Individual)
DTdcfit <- lmFit(DTdcdcv, 
                 DTdesign, 
                 cor=DTcor$consensus.correlation,
                 block=conds$Individual)
DTdcmc <- makeContrasts(HFD.Tumour_Normal = HFD.Tumour - HFD.Normal,
                        NC.Tumour_Normal = NC.Tumour - NC.Normal,
                        levels=design)
DTdcfitmc <- contrasts.fit(DTdcfit, DTdcmc)
DTdcfitmc <- eBayes(DTdcfitmc)

Any comments with respect to this analysis would be greatly appreciated.

Bruce

limma voom duplicatecorrelation • 2.9k views

ADD COMMENT • link updated 7.2 years ago by Aaron Lun ★ 29k • written 7.2 years ago by bruce.moran ▴ 30

score 6 · Accepted Answer · 2018-11-27

6

Entering edit mode

Aaron Lun ★ 29k

@alun

Last seen 3 minutes ago

The city by the bay

Looks fine to me. I'd do a second duplicateCorrelation after the second voom. I might also consider setting robust=TRUE in eBayes, any benefit depends on whether you have genes with outlier variances.

ADD COMMENT • link 7.2 years ago Aaron Lun ★ 29k

0

Entering edit mode

Hi Aaron,

thanks for the reply, late response as I got no notification mail.

I find very small change after second round of duplicateCorrelation:

DTcor <- duplicateCorrelation(DTdcv, block=conds$Individual)
DTcor$consensus.correlation
[1] 0.4353269

DTdcv2 <- voom(DTdge, DTdesign, correlation=DTcor$consensus.correlation, block=conds$Individual)
DTcor <- duplicateCorrelation(DTdcv2, block=conds$Individual)
DTcor$consensus.correlation
[1] 0.4356611

I would like to perform MDS post duplicateCorrelation. Is there a way to apply a scaling factor accounting for the correlation to counts/CPM on a per-sample/individual level that you know of?

Thanks for the tip on robust=TRUE.

Bruce.

ADD REPLY • link 7.1 years ago bruce.moran ▴ 30

0

Entering edit mode

In general, no. The reason we have to use duplicateCorrelation in the first place is because we don't have enough information in the experimental design to estimate the effect of the blocking factor, which means we can't compute corrected expression values that are free of said effect. At least, not without also removing confounded effects of interest, which is the diet in your case. I don't think there's an obvious visual analogy in the MDS plot for the effect of duplicateCorrelation. The closest I can think of is somehow "squeezing" all samples together in a manner that preserves the differences within each level of the blocking factor.

Long story short, don't worry about it and just make your MDS plot. If the diet effect is strong, it should show up fine regardless of the fact that it's confounded with the individual effect.

ADD REPLY • link 7.1 years ago Aaron Lun ★ 29k

0

Entering edit mode

OK, good to know, and yes, with strong effect (tumour vs. normal) we find an effect as expected, but the diet effect is minimal. Not sure I won't worry about it though=D Thanks for your advice, and your continued support of these kinds of questions, it is incredibly helpful.

ADD REPLY • link 7.1 years ago bruce.moran ▴ 30