DESeq2 for survival analysis
1
1
Entering edit mode
array chip ▴ 420
@array-chip-4136
Last seen 9 months ago
United States

Dear all,

I am new to RNA-seq analysis with bioconductor. I am wondering if I can use DESeq2 directly to perform survival analysis (e.g. Cox regression)? If not directly, do I understand correctly that we should do rlog or vst transformation on the count data before running Cox regression separately?

Second question is I have RNA-seq dataset with 1000 samples (20000 genes), it seems to be prohibitive to run rlog on this dataset with problem of running out of memory, any suggestion on ways to get around that?

Is there any other DE analysis packages (edgeR, limma, etc) that can perform survival analysis in addition to linear models?

Thanks!

John

deseq2 survival rlog transformation • 4.7k views
ADD COMMENT
0
Entering edit mode

Thank you Michael! This is very helpful.

ADD REPLY
0
Entering edit mode

Michael, just to make sure, if I want to use SAMseq, I should use raw counts without any normalization/transformation (not even sequence depth adjustment), juts like with DESeq(), correct?

ADD REPLY
0
Entering edit mode

Yes, raw counts.

ADD REPLY
3
Entering edit mode
@mikelove
Last seen 16 hours ago
United States

We do not have a Cox PH regression model built into DESeq2. You could use variance stabilized counts for downstream methods.

Another option, if you have sufficient sample size, is to use the survival approach implemented in the SAMseq function (for this approach you should provide raw counts, not transformed by DESeq2 functions):

https://www.rdocumentation.org/packages/samr/versions/2.0/topics/SAMseq

In the vignette and in the workflow, we discuss to use the VST for large sample datasets, and not to attempt to use the rlog.

For 1000+ samples, even the VST might take a long time. You might instead use:

dds <- estimateSizeFactors(dds)
ntd <- normTransform(dds)

which simply applies a log2(x+1) transformation to the normalized counts. You can adjust the pseudocount higher (5,10) to produce more shrinkage of log counts.

ADD COMMENT
0
Entering edit mode

I have a tutorial on Biostars about this: https://www.biostars.org/p/344233/

As Michael implies, use the variance-stabilised expression values, if feasible.

ADD REPLY
0
Entering edit mode

Kevin, following your tutorial, would you still do the z-score normalisation after a vst?

ADD REPLY
0
Entering edit mode

Hey, yes, you could try that.

ADD REPLY

Login before adding your answer.

Traffic: 668 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6