Question: How to adjust for different cell type mixtures in differential expression analysis?
gravatar for aec
4 months ago by
aec30 wrote:

Dear all, 

I computed enrichment scores for 64 cell types with xCELL from my bulk RNAseq samples. Now I would like to detect differential expression across 3 groups (control, case1, case2) but adjusting for the different cell type compositions (continuous variables). I was thinking of taking only the most variable cell types across samples (<10). I was wondering if it is really necessary to cut the continuous variables into smaller bins as DESeq2 FAQ says.

would this model be enough?





ADD COMMENTlink modified 4 months ago by Ryan C. Thompson6.1k • written 4 months ago by aec30
gravatar for Ryan C. Thompson
4 months ago by
The Scripps Research Institute, La Jolla, CA
Ryan C. Thompson6.1k wrote:

By my understanding, one problem with putting the cell type compositions directly into the model as numeric covariates is the incompatible scales: that cell fractions are expected to have a linear relationship to gene abundance, while model coefficients are fit on a log scale. You might be better off using SVA to estimate surrogate variables that can account for cell type composition as well as any other sources of systematic variation. Surrogate variables estimated by SVA will be on the correct scale to add directly to the design matrix. SVA also has the advantage of automatically choosing the right number of variables. This isn't too different from what you've already done, since your cell type compositions were also estimated from the data.

ADD COMMENTlink written 4 months ago by Ryan C. Thompson6.1k

Ok Ryan, thanks!

ADD REPLYlink written 3 months ago by aec30
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 148 users visited in the last hour