Question: Controlling for batch after Salmon-Tximport
gravatar for mad6kj
3 months ago by
mad6kj0 wrote:

I have recently tried to implement the Samon-Tximport-DESeq2 pipeline with my latest RNA-Seq dataset however I have a problem with some of the downstream analysis and potential compatibility with other programs. I have found that the dataset has quite large batch effects and I am hoping to regress these out prior to downstream analysis (mainly WGCNA). My general workflow at present has been to use tximport to get gene-level count estimates and then to run the standard DESeq2 pipeline from there.

I was then planning to normalize these counts (vst) and run a simple regression on the transformed object to remove the effects of batch. The residuals I would then use for WGCNA. My concern however is that it is mentioned throughout the documentation that one should not normally use these transformed objects for any purpose involving DE. I was hoping to get some advice on whether this would be appropriate and if not, what other methods may be more suitable. I have a general population cohort and I am looking at more covariates of interest than disease status (age, gender, family history etc etc).

Many thanks,


ADD COMMENTlink modified 3 months ago by James W. MacDonald47k • written 3 months ago by mad6kj0
gravatar for James W. MacDonald
3 months ago by
United States
James W. MacDonald47k wrote:

You normally don't use residuals from a model fit as input to another model fit, because you are then cheating by ignoring the fact that you have lost degrees of freedom (in the original model fit) that you are not accounting for in the second model fit. This isn't an issue for things like WGCNA, which aren't inferential techniques. In other words, you won't be using the residuals for differential expression, but instead will be using for WGCNA, which doesn't do that sort of thing.

ADD COMMENTlink written 3 months ago by James W. MacDonald47k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 232 users visited in the last hour