I want to do some analysis with the GSVA function in R, but I want to ask you if the matrix should be just normalized with DESeq2 or also log trasformed. I did this passage:
dds_campione <- DESeqDataSetFromMatrix(countData=raw,
colData=condition_breakfast,
design=~ Campione)
smallestGroupSize <- 3
keep <- rowSums(counts(dds_campione) >= 10) >= smallestGroupSize
dds_campione <- dds_campione[keep,]
matriceFiltrata <- counts(dds_campione, normalized = TRUE)
risultati_ssGSVA_temba <- gsva(expr = as.matrix(matriceFiltrata),
gset.idx.list = tebua)
Is it ok or I also have to log trasformed the matrix before the last passage? Thank you
When I log trasform my matrix after the DESeq2 normalization, I have some negative value, should I add a 1 to all my matrix?
Thank you
Michela
No, negative values may occur after log transformation when the
log(x)
is taken from a positive numberx with 0 < x < 1
, and therefore expression level quantities such as log CPM or log TPM units of expression may be negative when CPM or TPM units are between 0 and 1, which arise from low levels of expression, but GSVA has no problem with that. What it is also recommended in general, for analysis such as differentialy expression, but also for GSVA, is to filter out lowly-expressed genes.