Question

DESeq2 batch correction and possible ComBat/SVA over-correction of RNA-seq data

0

Entering edit mode

grimmerucsf • 0

@grimmerucsf-9682

Last seen 6.5 years ago

We have three RNA-seq batches for ~60 samples where batches are known and the "condition" is not well-distributed across batches. For example:

batch A = all tumor subtype 1

batch B = all tumor subtype 2

batch C = a mix of subtype 1 and subtype 2

For this reason, I suspect batch correction may not be possible due to perfect confounding on batches A/B. Assuming DESeq2 is not able to use the mixed batch C as an intermediary to do all 3 at once, it seems like I would need to split batch C into subtype 1 and subtype2 and do two separate batch corrections:

batch A + batch C subtype 1

batch B + batch C subtype 2

This would make the subtypes impossible to compare, but if that's the best I can do, so be it.

Secondly, assuming one of these above approaches works, I am not finding much detail on what DESeq2 actually does for default batch correction when having design = ~condition+batch for DESeqDataSetFromMatrix and having VST and RLOG transformations as blind = FALSE. Does this "DESeq batch correction" fully correct for batches, to where ComBat or SVA would be unneccessary afterwards? PCA results from this DESeq batch correction plus ComBat seem way too clean, with groups splitting too neatly, which makes me suspect f over-biasing the corrections for condition. If I want to normalize the data using DESeq2 and then run ComBat or SVA, should I steer clear of all batch information in DESeq2? Meaning I should designate design = ~condition for DESeqDataSetFromMatrix and having VST and RLOG transformations as blind = TRUE?

I apologize if this is a repeat question, but I can't seem to find much advice on these possible overlaps between two really useful and widespread tools.

deseq2 combat sva rna-seq • 1.7k views

ADD COMMENT • link updated 6.5 years ago by Michael Love 43k • written 6.5 years ago by grimmerucsf • 0

score 0 · Answer 1 · 2018-09-17

0

Entering edit mode

Michael Love 43k

@mikelove

Last seen 8 days ago

United States

The DESeq2 transformations do no correction based on the design. The only apply a log like transformation to the values.

Does that answer your question?

ADD COMMENT • link 6.5 years ago Michael Love 43k

0

Entering edit mode

If the batch designations come into play later in the DE calculations, but not at all in the transformations, it seems that ComBat may be the culprit. Thanks!

ADD REPLY • link 6.5 years ago grimmerucsf • 0