Hi. I am new to RNA-seq data expression analysis and to DESeq2. I have a RNA-seq dataset, and in its annotation data is the column of 'gender', 'age', etc. apart from the column of interest, which is 'condition'.
I wanted to ask that, how can I correct for the columns apart from the 'condition' column. i.e. for 'gender', 'age', etc.
Please excuse me for being a newbie. Thanks.
~gender+whatever+you+want+to+correct+for
, see the section ondesign
formula in the vignette.I see. Yeah, I took a look at the vignette, and it says the same thing you mentioned in addition to 'condition' being the last entry in design.
So, is it something like that?
The order does not matter if you specify the contrast. It's smart to always specify the contrast, if only so you know what you are comparing to what.
Also consider carefully the possible ramifications of making the age data into factors.
Oh okay. I will specify the contrast in that case as well.
So, if there are ramifications, does that mean, that I cannot include 'age' into the design? Or, is there a way to include it in the design without any problems? So far, I was thinking of doing this:
where gender has values 'M' and 'F', and age varies continuously from 18 to 80.