Search
Question: Using DESeq normalized gene count to replace FPKM?
0
gravatar for Jike Cui
5.1 years ago by
Jike Cui10
Jike Cui10 wrote:
hi, A few papers have concluded that DESeq is more accurate for DE genes discovery than methods using FPKM, and that the bias in FPKM is that a gene’s FPKM depends on the expression of other genes due to the division by library size. Now if my purpose is for visualization or analysis other than looking for DEGs, I wonder if it’s better to replace FPKM by DESeq normalized gene count divided by gene length? Thanks for your comment! Jack [[alternative HTML version deleted]]
ADD COMMENTlink modified 5.1 years ago by Simon Anders3.5k • written 5.1 years ago by Jike Cui10
0
gravatar for Simon Anders
5.1 years ago by
Simon Anders3.5k
Zentrum für Molekularbiologie, Universität Heidelberg
Simon Anders3.5k wrote:
Hi Jack On 23/07/13 17:29, Jike Cui wrote: > A few papers have concluded that DESeq is more accurate for DE genes > discovery than methods using FPKM, and that the bias in FPKM is that a > gene?s FPKM depends on the expression of other genes due to the division by > library size. > > Now if my purpose is for visualization or analysis other than looking for > DEGs, I wonder if it?s better to replace FPKM by DESeq normalized gene > count divided by gene length? Yes, definitely. Look at it this way: To account for sequencing depth, you divide the raw counts by a number which quantifies this depth. Simply using the total number of reads (divided by 1 million) is an obvious but very simplistic choice, and the various other scaling normalization schemes (our median-of-ratios approach from DESeq, but also other similar suggestions such as TMM, etc.) are simply meant to suggest a more clever way to find a number to divide by. In case of DESeq, we try to get this numbers to be close to one. If you want to have the same scale as typical FPKM values (and so have better comparability across experiments), you could then divide everything by something like geometric mean of the total read counts of all samples / 1 million You may want to look, though, also at the variance-stabilizing transformation (VST) and the regularized log transformation (rlog) that we offer in DESeq2, and which, we feel, offers a better input for downstream visualization. Simon
ADD COMMENTlink written 5.1 years ago by Simon Anders3.5k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 287 users visited in the last hour