DESeq2 Input format
Entering edit mode
uzer • 0
Last seen 5.0 years ago

Hi all,

I have extracted raw count data from raw reads file, and my data set is as follows: The first column is the gene name, and the next four rows are counts for each sample, the first two being control data, and the next two being experimental data of interest. It is very simple, and looks like below. Please note I have already cleaned the data and accounted for feature overlap and intersection. 

gene_symbol_1 1 2 3 4

gene_symbol_2 2 3 4 5

gene_symbol_3 0 11 2 7


The parameters for DESeqDataSetFromMatrix() are as such:

countData := can just be the raw counts

but I am confused as to enter the variables colData, and design.

How can encode the colData matrix? After I have properly encoded colData, how do I input design? I have observed the following sources, but it is not so clear in my context, because I do not have a "summarized experiment" object:

Thank you in advance for your help.

deseq2 counts • 2.2k views
Entering edit mode
Last seen 1 hour ago
United States

colData is a data.frame or DataFrame which contains the information about the columns of the count matrix, i.e. the samples. You could read more about these arguments by typing in the R console:


With a simple two vs two you can do:

colData <- data.frame(condition=factor(c("C","C","T","T")))

The design is an R formula which tells DESeq2 how you want to analyze the data. If you look at the DESeq2 help pages or vignette, you will see that the design for such a comparison is simply: ~ condition.


Login before adding your answer.

Traffic: 317 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6