Question: RNA-Seq and edgeR: remarkable differences between cpms and fitted values
David Rengel70
European Union
David Rengel70 wrote:

Hi, I am treating NovaSeq-produced RNA-Seq data with edgeR. In the analysis I look at what I call “strain” effect. There are DEGs for this factor that I find are biologically hard to explain. In the attached file, there are two examples. It is easy to see why they are declared as differentially expressed since fitted values vary among strains (right hand plots in the figure). What is hard for me to follow is how those fitted values might be obtained when the TMM-normalized cpms do not reveal that behavior (left-hand plots). In the latter case, cpms tend to 0 except for one repeat on the WT strain. I thank you in advance for any help/comment on the matter. Best, David

Keymaker, my friend

1. Can you share a bit of your $samples data.frame and show us the code you used to generate the design and the fit? 2. Where did you get these fitted values from? Is it off of some $fitted.values somewhere?
Gordon Smyth38k
Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia
Gordon Smyth38k wrote:

It doesn't make any sense to plot fitted values because they are proportional to the library sizes, which in turn vary independently of the groups or treatment conditions of interest. The reason why we recommend that you use CPMs or logCPMs for plotting purposes is that the library sizes have been divided out.