Question: Can FPKM or TPM values be used as RUVs input ?
gravatar for weiwsmiling
9 months ago by
weiwsmiling0 wrote:


I am trying to normalize my RNA-seq expression using spike-ins by RUVSeq package. However, I noticed that the input for RUVseq is count-based expression. My expression matrix is based on RPKM or TPM from STAR-RSEM pipeline. So I want to ask could RUVSeq also work for RPKM or TPM values? Do I must calculate count values instead?



ADD COMMENTlink modified 8 months ago by davide risso420 • written 9 months ago by weiwsmiling0
gravatar for davide risso
8 months ago by
davide risso420
Weill Cornell Medicine
davide risso420 wrote:

Hi Wei,

RUVSeq is designed to work with gene-level counts. If you are working with TPM or FPKM quantities, you may want to use the ruv package on CRAN or the RUVnormalize package in Bioconductor, depending if you have a supervised or unsupervised problem.

Although these packages were designed for microarray data, you can try to use as input the log of FPKM / TPM, essentially assuming a linear model for these data.

An alternative would be to compute the expected counts from RSEM (perhaps rounding them to nearest integer) and assume that you're dealing with actual counts, using the regular RUVSeq pipeline.

Note that both approaches are non-standard uses of the packages, hence it is extra important for you to check the assumptions of the model (i.e., linear models for ruv / RUVnormalize and log-linear model for RUVSeq).

ADD COMMENTlink written 8 months ago by davide risso420
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 342 users visited in the last hour