Question

do cox and km plot

0

Entering edit mode

linouhao ▴ 20

@linouhao-15901

Last seen 13 months ago

United States

if I use deseq or edger or limma to do count differential analysis, what kind of data should be sent to do cox km lasso and so on downstream analysis, cpm(x), vst(x), rlog(x), voom(x) or anything else.

thanks a lot

deseq2 • 1.3k views

ADD COMMENT • link updated 3.7 years ago by Michael Love 42k • written 3.7 years ago by linouhao ▴ 20

score 1 · Answer 1 · 2020-09-01

1

Entering edit mode

Michael Love 42k

@mikelove

Last seen 1 day ago

United States

In DESeq2, we recommend VST for downstream tasks aside from DE.

ADD COMMENT • link 3.7 years ago Michael Love 42k

0

Entering edit mode

thanks a lot. do you know edger, does it use cpm is ok? do you know limma, does it use voom is ok?

ADD REPLY • link 3.7 years ago linouhao ▴ 20

1

Entering edit mode

The suggested way to use the support site if you are sending notifications to package developers, is to ask help on their package specifically. I only have time to provide support for my package. If you have a question about another package, make a new post and tag only that package.

ADD REPLY • link 3.7 years ago Michael Love 42k

0

Entering edit mode

thanks a lot. so if I want to just one gene difference in two groups(for example show a differential gene analysed by deseq2), I see people always use tpm or fpkm, but not vst(dds), how to understand it,

ADD REPLY • link 3.7 years ago linouhao ▴ 20

0

Entering edit mode

Hi linouhao, please use either VST if using DESeq2, or CPM (with a prior count) if using EdgeR or limma-voom. For further information on distinguishing between EdgeR and limma-voom, please see here: https://support.bioconductor.org/p/59846/#59917

Please do not use FPKM units for a Cox model. FPKM units are not suitable for anything where cross-sample comparisons are to be performed.

ADD REPLY • link 3.7 years ago Kevin Blighe ★ 3.9k

0

Entering edit mode

CPM (with a prior count) ， how to set the prior count, just 1 or 2 or other int number ,and should cpm use the log=T or not， this cpm value seems just using for filter or downstream analysis other than DE, so I just use cpm(x, log=T, prior.count=1) apply for all, is it ok? thanks a lot

ADD REPLY • link 3.7 years ago linouhao ▴ 20

1

Entering edit mode

Hey, for Cox models, yes, I would use log = TRUE. Regarding the prior count, I would use the default and then check the data distribution via a histogram. Basically, for a Cox model, we want a data distribution that will result in normally-distributed residuals after the model has been fit to the data (so, a good starting point is that the data distribution itself is normal - think 'bell curve').

We could not use FPKM units or RNA-seq count data for a Cox due to the fact that these are not measured on a normal distribution.

ADD REPLY • link 3.7 years ago Kevin Blighe ★ 3.9k

0

Entering edit mode

thanks a lot. if I wan to use TCGA data as train data, and geo as validation, one is counts data, one is microarray, how should I do?

ADD REPLY • link 3.7 years ago linouhao ▴ 20

0

Entering edit mode

If you are using one dataset as a training and the other as a validation, it technically does not matter that they are from different sources (microarray versus RNA-seq), as they will be analysed independently. Same is true if you are just aiming to replicate results across the 2 studies. However, if you want integrate these datasets, then that requires further thought.

Anyway, that is a more general question, so, please ask on a forum for general Bioinformatics, e.g., Biostars (I am also Moderator on Biostars). For the record: I am sure that I have already answered questions on this topic on Biostars, if you can search via your search engine of choice.

ADD REPLY • link 3.7 years ago Kevin Blighe ★ 3.9k

0

Entering edit mode

but when you need to make a model, for example, a model contains several gene expression, you will find the editor will ask you the data comes from different platform, can it use the same fomula? thanks a lot