Question: DESeq2: Continuous and discreate variables in the design
gravatar for raquelgarza95
17 days ago by
raquelgarza950 wrote:


I'm trying to get my head around some DESeq2 results and I would really appreciate some help.

I have two conditions (control and disease) and one continuous variable (age). Each of the conditions would be conformed by 100 individuals with different ages. I want to see which changes occur in the control group as age increases, and the same thing with the disease group (I know this would be optimally with the same individuals in different time points but I'm working with post-mortem tissue so this is the best i can do).

I have set my DESeq2 design as ~condition+age+condition:age

From resultsNames(dds) I get:

[1]  "Intercept" "condition_disease_vs_ctrl" "age" "conditiondisease.age"

I suppose that "conditiondisease.age" is the name I am interested for the disease group. But is "age" the one for the control? (since it is the reference level) Or is it age regardless of the condition? (if it is this option, how can i get "conditionctrl.age"?)

I also have a question on how to interpret the log2FC on this, according to the vignette this would be the change per unit of the continuous variable (age). If age is integers, is this set to have the lowest value (youngest in my setup) as a reference point? Or the highest (oldest)?

And one more, do I have to sort age before giving it to DESeq2? I am guessing no but it doesn't hurt to ask.

Thank you!!

deseq2 R design rna-seq • 151 views
ADD COMMENTlink modified 17 days ago by Michael Love26k • written 17 days ago by raquelgarza950
Answer: DESeq2: Continuous and discreate variables in the design
gravatar for Michael Love
17 days ago by
Michael Love26k
United States
Michael Love26k wrote:

I would recommend ~condition + condition:age which is a bit easier to interpret. You will get an age term for control and an age term for disease, which you can pull out with results(dds, name="..."). And you can contrast the two with results(dds, contrast=list("...", "...")).

The interpretation of the LFC is the log of the fold change in expression for one unit of the variable. There is no specific reference point, it is folded into the intercept no matter what you set to be 0 (whether the youngest, or the oldest, or the sample average).

ADD COMMENTlink written 17 days ago by Michael Love26k

Thank you Michael! I got exactly what I needed from the first part. However, I'm struggling to understand the LFC explanation. What did you mean by the intercept? I didn't map the age to values starting from 0 (maybe I should do this?). I input the ages as they were (30-90). Or maybe I didn't understand what you meant by the intercept. Sorry, it's hard for me to grasp this no reference fold change.

ADD REPLYlink written 17 days ago by raquelgarza950

I may suggest you discuss this with a statistician to have a longer answer regarding the question about a reference point for the continuous variable. The practical answer is that there is no reference point, but it would be good for you to discuss with someone to understand why that is the case and how continuous variables work in linear models.

ADD REPLYlink written 17 days ago by Michael Love26k

Hi again! Thank you, I talked about it with someone and I think I now understand what you meant. You meant that no matter how you set up the continuous variable (what it is set to be 0), the LFC in a linear model is going to be for each step of the continuous variable. This is crystal clear now :-).

But my question was more about how to set the 0 from the continuous variable, or how DESeq2 decides how to order this variable (that's what I was trying to say with the reference point but it was the wrong term, I'm sorry), if the 0 is set to be the minimum value or something else? Maybe it depends on how I sort it (for example ordering colData in age decreasing order)?

I guess it is the minimum value but I don't want to risk having a wrong interpretation since this matter a lot with factors.

Sorry for the confusion, and thank you for all the help!

ADD REPLYlink written 16 days ago by raquelgarza950

There is no reference point for the continuous variable.

ADD REPLYlink written 16 days ago by Michael Love26k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 145 users visited in the last hour