Error in Differential Expression Analysis with Data Set from Matrix
Entering edit mode
didemdkn • 0
Last seen 10 days ago

I have an expression matrix containing RSEM values for genes. I want to run DESeq2 on these file using DESeq2 package. Since the RSEM values are values like "54.87632", I multiplied all the values with 10000 to make all of them integers. Then, I uploaded both the expression data and the metadata. I checked the colnames/rownames. Then, I run the code and having the error:

dds <- DESeqDataSetFromMatrix(countData = expression,
                              colData = metadata,
                              design = ~ Group)

#then the error:

converting counts to integer mode
NAs introduced by coercion to integer rangesome variables in design formula are characters, converting to factorsError in validObject(.Object) : 
  invalid class “DESeqDataSet” object: NA values are not allowed in the count matrix

Since I dont have any missing value:


I don`t know how to correct my data/what is wrong about my data. Is there anyone who have the same issue/solution for this problem?

Thank you for your reccomendations!

DESeq2 DifferentialExpression Bioconductor • 117 views
Entering edit mode
Last seen 2 days ago
United States

We have a pipeline for using RSEM sample.genes.results files with DESeq2:

First use these instructions:

Then you can use DESeqDataSetFromTximport.

Entering edit mode

Thank you for yor help.

However, I don't have the RSEM output files. I just downloaded level 3 data of TCGA-BRCA study and I only have the table of expressions and patient codes. The column names are the patient names and the row names are the gene names. That's all I have. Thus, I cannot use RSEM importing. That's why I tried to use "DESeqDataSetFromMatrix".

I don't have the raw data and cannot run RSEM by using them.

Again thank you for your help. If you have any suggestion about this error, it would be very good for me. Since I don't have any former bioinformatics education, I hope I did not think wrongly about this issue.

Entering edit mode

You can round the estimated counts from RSEM and provide to DESeqDataSetFromMatrix. I don't have code for importing the summary data from TCGA, so you will have to do that part on your own.

It will not take into account the gene length changes that we worked on in tximport (see that publication for details).


Login before adding your answer.

Similar Posts
Loading Similar Posts
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.3