Question

Combining two bulk RNA seq datasets

0

Entering edit mode

Tanvi • 0

@b7729e52

Last seen 17 months ago

Switzerland

Hello, I am trying to combine multiple bulk RNA Seq datasets which contain multiple conditions in all of them. I obtained the counts matrix using featurecounts and I am not sure if I should normalize each dataset and then combine them using ComBat or Limma or combine them first and then normalize and log transform. Additionally, I would like to perform differential expression analysis and gene set enrichment analysis on the dataset, therefore would be happy to know the best course of action here.

RNASeqData DESeq2 Normalization • 2.0k views

ADD COMMENT • link 19 months ago Tanvi • 0

score 1 · Answer 1 · 2024-06-17

1

Entering edit mode

W. Evan Johnson ▴ 870

@w-evan-johnson-5447

Last seen 19 months ago

United States

Hello, I would recommend you use BatchQC (bioconductor package) to evaluate the extent to which batch effects impact your data. Not all batches of data require correction, and the less you need to do the better. Can you ignore the batch effect? Can you merely include batch as a covariate in your model? Or do you need to apply ComBat to your log counts per million or ComBat-Seq to your counts data? It depends on the dataset and your batch effects which is the best strategy--.

ADD COMMENT • link 19 months ago W. Evan Johnson ▴ 870

0

Entering edit mode

Thank you for your suggestion. However, my R version on server is 4.1.1 and BatchQC requires R version more than 4.3, but I created PCA plot to check for batch effect and I do not think there is a significant batch effect. Thanks for your help again!

ADD REPLY • link 19 months ago Tanvi • 0

score 0 · Answer 2 · 2024-06-17

0

Entering edit mode

James W. MacDonald 68k

@james-w-macdonald-5106

Last seen 1 day ago

United States

This question is off-topic for this site. You might try over on biostars.org instead.

ADD COMMENT • link 19 months ago James W. MacDonald 68k