Combining two bulk RNA seq datasets
2
0
Entering edit mode
Tanvi • 0
@b7729e52
Last seen 4 months ago
Switzerland

Hello, I am trying to combine multiple bulk RNA Seq datasets which contain multiple conditions in all of them. I obtained the counts matrix using featurecounts and I am not sure if I should normalize each dataset and then combine them using ComBat or Limma or combine them first and then normalize and log transform. Additionally, I would like to perform differential expression analysis and gene set enrichment analysis on the dataset, therefore would be happy to know the best course of action here.

RNASeqData DESeq2 Normalization • 953 views
ADD COMMENT
1
Entering edit mode
@w-evan-johnson-5447
Last seen 6 months ago
United States

Hello, I would recommend you use BatchQC (bioconductor package) to evaluate the extent to which batch effects impact your data. Not all batches of data require correction, and the less you need to do the better. Can you ignore the batch effect? Can you merely include batch as a covariate in your model? Or do you need to apply ComBat to your log counts per million or ComBat-Seq to your counts data? It depends on the dataset and your batch effects which is the best strategy--.

ADD COMMENT
0
Entering edit mode

Thank you for your suggestion. However, my R version on server is 4.1.1 and BatchQC requires R version more than 4.3, but I created PCA plot to check for batch effect and I do not think there is a significant batch effect. Thanks for your help again!

ADD REPLY
0
Entering edit mode
@james-w-macdonald-5106
Last seen 11 hours ago
United States

This question is off-topic for this site. You might try over on biostars.org instead.

Login before adding your answer.

Traffic: 1052 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6