Question

DESeq2: How to find out how well the model fits the data?

1

Entering edit mode

lina.faller ▴ 10

@linafaller-9082

Last seen 7.0 years ago

Hi all,

I am using DESeq2 to calculate differentially expressed genes. Is there a good way to determine how well the model fits the data? I'd appreciate it if you could share any insight or resources.

Thanks!
~Lina

rnaseq linear model fit deseq2 • 2.6k views

ADD COMMENT • link updated 6.9 years ago by Michael Love 41k • written 7.0 years ago by lina.faller ▴ 10

0

Entering edit mode

Hi!!

I have the same question. I've got several factors and I would like to know if using all of them creates a better model. Did you find anything about it? I've been looking for an answer, but still not luck.... Thanks!!

ADD REPLY • link 6.0 years ago ceboral • 0

0

Entering edit mode

You seem to be thinking that you are fitting a single model. You aren't. Instead you are simultaneously fitting thousands of models, so asking 'how well a model fits' is in some sense nonsensical. Which model might you be asking about?

In general people just fit a model containing any nuisance variables that may affect the gene expression, and call it good. If you really care, you can test all of the nuisance variables you are including in your model and drop those that aren't 'significant enough', where by that I mean those variables that are either not significant for any gene, or for only a small subset of genes, when testing using a likelihood ratio test.

The downside of including too many nuisance variables is that you may be wasting degrees of freedom and reducing power, or that you may be including variables that are not orthogonal, which can be problematic. Otherwise an overspecified model isn't that big of a deal.

ADD REPLY • link 6.0 years ago James W. MacDonald 65k

0

Entering edit mode

Yes, I know that a different model is tested for each gene considered, but I am testing two different factors and I would like to know which one fits the expression better and I cannot find any Akaike implementation or bayesian information criterion (I cannot rely only in the wald test of the factors because the models aren't nested).

ADD REPLY • link 6.0 years ago ceboral • 0

score 0 · Answer 1 · 2017-05-22

0

Entering edit mode

Bio_Ram • 0

@bio_ram-12851

Last seen 5.6 years ago

The workflows below will give you better insights

https://www.bioconductor.org/help/workflows/rnaseqGene/

https://scilifelab.github.io/courses/rnaseq/labs/DEseq2

ADD COMMENT • link 6.9 years ago Bio_Ram • 0

0

Entering edit mode

This answer is not helpful at all. There is no where in these manuals that tells us how well the model fits the data

ADD REPLY • link 6.4 years ago rrcutler ▴ 70