Search
Question: Log2 fold change value in DESeq2
0
11 months ago by
ambika.pokhrel530 wrote:

Hello everyone,

I am quite confused regarding the Log 2 fold change calculations. I noticed that it gives NA when the gene expression is zero across samples. But how does it calculate LFC value if the expression is 0 in control and higher expression in other treated samples? Does it gives NA to those expression too?  Because those might be the transcripts I am actually interested in.

Thank you,

Ambika

modified 11 months ago by Michael Love19k • written 11 months ago by ambika.pokhrel530
0
11 months ago by
Michael Love19k
United States
Michael Love19k wrote:

Using the latest version of DESeq2 (v1.16), the maximum likelihood estimate of the LFC will be something like log2 of the mean of normalized counts in the group with positive counts. We include a threshold on how low the expected value of the counts can go, which stabilizes the methods and prevents the LFC from going to +/- infinity. So the MLE LFCs can grow very large, e.g. if the group has a mean of 1024, you would get an LFC close to 10.

We recommend using shrinkage after running DESeq() to produce LFCs that are more suitable for ranking or visualization, e.g.:

res <- lfcShrink(dds, coef=2, res=res)

Then the LFC is a posterior estimate, given the counts for that gene, and the distribution of observed LFCs from the experiment. This "moderates" or pulls in the noisy LFC estimates from genes that have a lot of variability or low counts. You can read more about this in the DESeq2 paper:

https://genomebiology.biomedcentral.com/articles/10.1186/s13059-014-0550-8