Question about normalizing and removing batch effects from bulk RNA-seq data from NCBI-SRA
1
0
Entering edit mode
@aaed3153
Last seen 9 months ago
United States

I have about 5210 runs from ~500 studies and want to use this matrix for downstream analysis like expression analysis. I want to look at the expression of certain genes across certain conditions I'm interested in. I have the expression matrix and need to normalize the matrix and remove batch effects in the data. I read up online and it seems that I run svaseq on the count data to find batches and then run CombatSeq to which I provide the batches which can then remove the batches. Is this the right way to do this? There are a lot of things online and I'm immensely confused. I would appreciate any feedback! Thank you!

Arabidopsis_thaliana_Data RNASeqR Normalization BatchEffect • 831 views
ADD COMMENT
0
Entering edit mode
@james-w-macdonald-5106
Last seen 7 hours ago
United States

You don't use svaseq prior to ComBat_seq, they are different things.

If you already know the batches and simply want to remove the technical differences between batches, then ComBat_seq will do that for you. If you suspect that there are technical differences between samples, one possibility being a batch effect, then you can use svaseq to estimate surrogate variables that you then use in your linear model to account for the technical variability. Have you read the vignette?

ADD COMMENT

Login before adding your answer.

Traffic: 665 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6