Question

RNA-seq DESeq2 factor design

0

Entering edit mode

Grace • 0

@bc49e602

Last seen 3.1 years ago

Australia

Hi,

I am quite new to RNAseq and this is my first analysis and I am not sure whether I did my analysis correctly in my pipeline. I have been searching online to see whether people have similar experiment layout like mine but couldn't really find one, and I have read through the Beginner's guide for DESeq2 and the vignette and also the great rmarkdown file to explain the factor for DESeq2 (https://rstudio-pubs-static.s3.amazonaws.com/329027_593046fb6d7a427da6b2c538caf601e1.html#example-1-two-group-comparison)

Briefly explaining my experimental layout. I am working on a cotton moth, which the experiment involves a lab susceptible line (Gr), and a CRISPR/Cas9 knockout line that was created from Gr, called Cad, the CRISPR/Cas9 line resulted resistance to a toxin. The experiment also involves treatment with a toxin, but only on Cad line, not on Gr.

Therefore, I have 3 groups that were sequenced, 1: Gr (untreated), 2: Cad (untreated) and 3: Cad (treated). I am interested to find out 2 things from RNAseq, GOAL 1: run DGE analysis to see whether the CRISPR/Cas9 knockout would result in any DGE differences, for group Gr (untreated) and Cad (untreated). AND GOAL 2: run DGE analysis to see whether the treatment with the toxin would result in any DGE differences, for group Cad(untreated) and Cad (treated).

My sample info is created as

              phenotype   treatment
GRuntreat1     sus        untreated
GRuntreat2     sus        untreated
GRuntreat3     sus        untreated
CADuntreat1    res        untreated
CADuntreat2    res        untreated
CADuntreat3    res        untreated
CADtreat1      res          treated
CADtreat2      res          treated
CADtreat3      res          treated

My DESeq2 design at the moment is

DESeqDataSetFromMatrix(countData = counts_reverse,
                                      colData = metaData,
                                      design = ~ phenotype + treatment)

However I feel like I have made a mistake here after reading the document https://rstudio-pubs-static.s3.amazonaws.com/329027_593046fb6d7a427da6b2c538caf601e1.html#example-1-two-group-comparison and reading through Michael and other bioinformatician's reply to other people's design questions.

As I am not interested in discovering the interactions between the treatment and the phenotype as Gr was not treated at all. At the moment my design is based on the treatment as the main factor with 2 levels, treated and untreated, and taking the variance in phenotype into account.

However after reading the information online, I am in doubt about my design... I am seeking an suggestion whether I should treat my sequenced groups as two completely independent analysis.

i.e. change my sample info (metadata) and redo the analysis as

DESeqDataSetFromMatrix(countData = counts_reverse,
                                      colData = metaData,
                                      design = ~ phenotype )

For only the Gr untreated and Cad untreated and set the Gr as reference and

DESeqDataSetFromMatrix(countData = counts_reverse,
                                      colData = metaData,
                                      design = ~ treatment )

For only the CAD untreated and CAD treared and set CAD untreated as the reference Thanks for your valuable time and advice!

Cheers, Grace

DESeq2 • 949 views

ADD COMMENT • link 3.3 years ago Grace • 0

score 0 · Answer 1 · 2022-10-17

0

Entering edit mode

swbarnes2 ★ 1.4k

@swbarnes2-14086

Last seen 4 weeks ago

San Diego

Different designs are better suited for different questions. What exact question are you trying to answer?

ADD COMMENT • link 3.3 years ago swbarnes2 ★ 1.4k

0

Entering edit mode

Hi swbarnes2,

Thanks for your response. I aimed to seek two different answers from the three sets of sequenced groups, 1: GR (untreated) 2: Cad (untreated) 3: Cad(treated). I wish to investigate 1: whether genome editing would result in anything in differential gene expression for GR untreated and Cad untreated, neglecting the treatment, and 2: and the changes of differential gene expression after exposing the resistant strain (CAD) to treatment, for Cad untreated and Cad treated groups, this part is neglecting the GR.

That's why I am thinking maybe my current design wasn't all ideal, as for GOAL1, I am not interested in taking the treatment into consideration and for GOAL2, I am not interested in taking phenotype into consideration.

Cheers, Grace

ADD REPLY • link 3.3 years ago Grace • 0