Recommendations for normalisation post-deseq2
1
0
Entering edit mode
@marcosullivan-19786
Last seen 5.2 years ago

Hello,

Can anyone here recommend any posthoc normalisation steps following deseq2 before generating Partial Least Squared Discriminant Analysis (PLSDA) and VIP (Variance of Importance Plots)?

I presume it's not appropriate to just perform this code and use it to generate a PLSDA with these counts?

Generates DeSeqDataSet Normalisedcounts and prints as CSV

dds <- estimateSizeFactors(dds) Normalisedcounts<-counts(dds, normalized=TRUE) write.csv(Normalisedcounts, file="27_Norm.csv")

Apologies if this seems like an uneducated question but I'm new to both 'big data' statistics and deseq2.

deseq2 normalization posthoc • 606 views
ADD COMMENT
1
Entering edit mode
@mikelove
Last seen 1 day ago
United States

I don't know what these plots are.

Take a look at the vignette and workflow for DESeq2. We have counts(dds, normalized=TRUE) which just scales the counts for library size.

There are also transformations which are good for computing distances between samples and plotting. There is a lot of information on these in the vignette and workflow (and the publication), so check there first and then come back here with any remaining questions.

ADD COMMENT
0
Entering edit mode

Thanks for your feedback on this I've now read up on rlog and vsd in the vignette, can I confirm that the VST is a transformation after normalisation according to size factors. As this is my interpretation from your vignette below:

"Above, we used a parametric fit for the dispersion. In this case, the closed-form expression for the variance stabilizing transformation is used by the vst function. If a local fit is used (option fitType="locfit" to estimateDispersions) a numerical integration is used instead. The transformed data should be approximated variance stabilized and also includes correction for size factors or normalization factors. The transformed data is on the log2 scale for large counts."

If so would this code be correct for generating an VST and size factor normalised output from your package?

Code for generating a VST and outputting to CSV

vsd <- assay(varianceStabilizingTransformation(dds, blind=FALSE))
write.csv(vsd, file="27_Norm2.csv")
ADD REPLY
1
Entering edit mode

Yes but to be clear I would do:

vsd <- vst(dds, blind=FALSE)
mat <- assay(vsd)
write.csv(mat, file="file.csv")

vsd in our documentation denotes a variance stabilized dataset, which means it has attached phenotypic data. I use mat when it's a simple matrix.

ADD REPLY

Login before adding your answer.

Traffic: 431 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6