Question

Adjusting for known covariates before coexpression analysis with WGCNA

1

Entering edit mode

mikhael.manurung ▴ 280

@mikhaelmanurung-17423

Last seen 2.1 years ago

Netherlands

Dear all,

I would like to adjust my whole-blood RNA-Seq count data matrix for cell type composition (obtained from hematological analysis & flow cytometry) before doing a coexpression network analysis with WGCNA.

So far, I did the following:

# I use DESeq2's vst to remove mean-variance relationship in the data
dds <- DESeq2::DESeqDataSetFromMatrix(counts, colData, design = ~ group)
dds <- DESeq2::vst(dds, blind = TRUE)
vst <- assay(dds)

# adjust for confounding variables
vst_adjusted <- limma::removeBatchEffect(
  x = vst,
  covariates = c(cellA, cellB, cellC) # numeric vectors containing scaled cell proportion
)

However, according to this link from other forum I can apparently insert the covariates into the design matrix when making the DESeqDataSet and then set blind = FALSE during the variance-stabilizing transformation.

There are also those who recommend using ComBat from sva by inserting my covariates to the mod parameter.

Which one is the best way for my goal?

Thank you for your kind response.

Best regards, Mikhael

wgcna coexpression sva • 2.3k views

ADD COMMENT • link updated 5.2 years ago by Peter Langfelder ★ 3.0k • written 5.2 years ago by mikhael.manurung ▴ 280

score 3 · Answer 1 · 2019-05-23

3

Entering edit mode

Peter Langfelder ★ 3.0k

@peter-langfelder-4469

Last seen 4 months ago

United States

It is my understanding that ComBat cannot handle continuous nuisance variables (i.e., variables you want to remove). It can handle continuous covariates which in this case means variables whose effect you want to keep; these are supplied in the 'mod' argument for ComBat.

For removing the effect of continuous covariates, I personally use WGCNA's empiricalBayesLM but removeBatchEffect should work as well. Just check your code, I would expect that you will need

covariates = cbind(cellA, cellB, cellC)

not

covariates = c(cellA, cellB, cellC).

ADD COMMENT • link 5.2 years ago Peter Langfelder ★ 3.0k

0

Entering edit mode

Dear Peter,

Thank you for your prompt response. For empiricalBayesLM, would you advise feeding the group variables into the retainedCovariates argument?

Best, Mikhael

ADD REPLY • link 5.2 years ago mikhael.manurung ▴ 280

1

Entering edit mode

Retained covariates are those whose effect you want to preserve. My understanding is that you want to remove the cell type abundance/composition information, not retain it; removed variables should go into the removedCovariates argument.

ADD REPLY • link 5.2 years ago Peter Langfelder ★ 3.0k