Hello,
I want to use removeBatchEffect() on the expression data (Elist) prior to drawing a heatmap based on the expression of sig diff genes. Those sig diff genes were generated from limma linear modelling, with the batch factor already included in the linear model.
I saw people use removeBatchEffect(y, batch=batch) and removeBatchEffect(y, batch=batch, design=design). I would very much like to know in what condition I should include the design matrix, and when to also include covariates ??? Any comments would be very appreciated. Thank you in advance!
removeBatchEffect(x, batch=NULL, covariates=NULL, design=matrix(1,ncol(x),1), ...)
Thanks,
Xiayu
Hi, Ryan
That's very clear. Thank you very much for your instructions. Then when should I include covariates, or what kind of covariates I should adjusted for ?? Covariates in the model should already be in the design matrix, right?
Thanks,
Xiayu
<p>Hi Xiayu,</p>
<p>The covariates argument is just a more general way to specify batch
effects to be corrected for. If you had more than 2 batch factors, or
if you had some continuous numerical covariates to correct for, you
would manually construct your own "batch design matrix" and pass it as
the covariates argument. If all you have is one or two batch factors,
then all you need to do is pass them for batch and batch2, and you
don't need to worry about the covariates argument.</p>
<p>-Ryan</p>
Hi, Ryan
Thank you for your input! One more quick follow-up question, considering your example of specifying design=model.matrix(~Condition), and batch=Batch, what if I also have a random effect in my limma design? do I need to put that variable(subject as below) anywhere in the removeBatchEffect command or just ignore it?
Thanks,
Xiayu
Well, I'm not as familiar with random effects analysis, but the normal way to use duplicateCorrelation is to pass corfit$consensus as the correlation argument to lmFit. The help page for removeBatchEffect states that any additional arguments are passed to lmFit, so I think I would simply do likewise and pass the same correlation argument to removeBatchEffect.
Thanks a lot! I think you are right. I will do that. But what if the Batch itself was treated as a random effect in the model ?? Can I put it in batch=Batch in the removeBatchEffect command? Sorry but I promise this is the last question.
Thank you so much!
Xiayu
I don't think you can do that. removeBatchEffect works by treating the batches as fixed effects, so I don't think it's valid to specify a random effect there.
Hello,
Would that also be the way to proceed in case the batch effect was estimated via sva?
The reason for my doubt is that sva does receive the design matrix as its input (mod1 parameter). So since the batch effect was estimated while taking into account the design, I am not sure it is necessary to "take it into account" again.