Hi,
I am currently following the RUVseq vignette to perform RUVg normalization and to identify potential unwanted variation in my dataset. My RLE plot before and after RUVg normalization suggests that the unwanted variations have been addressed. However, I have now stumbled upon the question about what really are the RUVg normalized counts stored in object@assayData$normalizedCounts
, and what types of exploratory analyses can I perform with it as input? Based on my understanding (I might be wrong), the RUVg normalized counts are obtained by regressing the original counts on the unwanted factors. So how different are these normalized counts compared with upperquartile normalization, and if these are essentially raw counts regressed on unwanted variation, why aren't they suitable as input for DE analysis?
Furthermore, I would like to manually draw my own batch-resolved PCA and perform hierarchical clustering after RUVg normalization. With this, should I use object@assayData$normalizedCounts
as input, or is there anyway to do this with the unwanted variation identified by RUVg. I think VST normalization of raw counts
from DESeq2
does a really nice job in performing these types of visualizations. In this case, does using the RUVg normalized counts as input serve the same purpose on generating batch-resolved visualizations. Thanks for the help!
You can use
removeBatchEffects
fromlimma
on thevst
-transformed counts, and provide the RUV factors toremoveBatchEffects
via thecovariates
argument. This visualizes the effect of regressing these factors.Oh wow. Never thought about the combination of using both
Limma
andDESeq2
, though it sounds technically feasible. That being said, do you have any recommended tutorials or vignettes on how to do this correctly, since I guess this procedure is quite inordinary, so probably having a guide is the safest approach.It's perfectly ordinary, even the vignette mentions it:
https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#why-after-vst-are-there-still-batches-in-the-pca-plot