DESeq2 analysis on time-series RNA experiment
1
0
Entering edit mode
island • 0
@91622621
Last seen 7 months ago
United Kingdom

Hello,

Thank you for the creation of the DESeq2 tool! It has been very useful in understanding the requirements and downstream analysis of an rnaseq experiment.

At the moment I have a phd student who has a time-course experiment where they have low mapping rates for significant portion of their genes. I would like to check a few things, largely the design formula and potential issues with batch effect and time-series. Sadly our stats team is unfamiliar with these concepts and the tool as of yet and has helped as much as they can.

The project owner has two groups, in-vitro & in-vivo, the latter undergo three conditions, condition_1, condition_2, condition_3. I believe there are only two replicates per in-vitro sample.

To identify and compare the top DEGs within the different areas I have made the following design models using the LRT method to test:

  • different conditions (regardless of group):
    ~ condition
    # (reduced model: ~ 1)
    
  • the differences between the two groups:

    ~ group
    # (reduced model: ~ 1)
    
  • the effect each condition has on the groups:

~ group + condition + group:condition
# (reduced model: ~ group + condition)
  • the effect the condition has over time, in each group:
    ~ time_point + group + condition + group:condition
    # (reduced model: time_point + group + condition)
    

I have been informed that one of the groups is missing 2 intermediate time-points and I wanted to ask how this will effect the analysis.

I have also considered using RUVseq to remove any potential batch effect from the data but also looked into using this design:

~ time_point + batchRun + group + condition + group:condition
# (reduced model: ~ time_point + batchRun + group + condition)

From the tutorials and workshop of matrix designs, I unfortunately haven't entirely understood how the contrast or name arguments are used to define the model, including during the use of the results() function.

Many thanks,

RNASeqData DESeq2 RNASeq • 657 views
ADD COMMENT
1
Entering edit mode
@mikelove
Last seen 3 hours ago
United States

Unfortunately, due to restrictions on my time, I have to reserve time on the support site for answering software-related questions. For questions about statistical design and analysis plan, I recommend to work with a local statistician or someone familiar with linear models. DESeq2 uses the same linear model terms as basic linear models in R, so anyone with experience with linear models can help with analysis interpretation.

Regarding RUVseq, I do recommend it's use broadly, especially if PCA reveals what could be substantial technical variation across samples that isn't part of the experimental design. By including the experimental design factors in the construction of RUV factors, you ensure they are orthogonal and therefore can use them in the design (this is in the workflow, your link 1).

ADD COMMENT
0
Entering edit mode

Michael Love thank you for your quick response!

The clustering looked fine for the pilot study and didn't show huge variations so far, so we'll continue on with this..

Apologies I am unsure if this is a statistical design question or just on experimental setup. Can I check whether the experimental design of having at least one of each group and condition in each sequencing run is best to prevent linear combinations? Thanks,

ADD REPLY

Login before adding your answer.

Traffic: 728 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6