Question

Including covariates from bulkRNAseq deconvolution to look for differentially expressed genes

0

Entering edit mode

ross.smith • 0

@964c7b49

Last seen 3.0 years ago

Sweden

I would like some guidance when it comes to using DESeq2 to look for DEGs in a bulkRNA seq experiment. I have at my disposal samples that are enriched for endothelial cells from 2 tissues and at 6 developmental stages. I have 3 samples from each condition for a total of 36 samples. An analysis of the endothelial purity of the samples shows that samples from tissue A have more contamination from mural cells that tissue B samples. Uncorrected, a list of DEGs comparing A to B gives many (mostly?) hits that are mural cell genes. Therefore I would like to add a covariate for each sample with the estimated proportion of mural cells. (Here I am hesitant, as I think this has been discouraged in related threads, but I am not entirely sure).

Because I have so few samples as each timepoint, I am reluctant to only use input from a single stage in my design.

So my code now looks something like this:

dds <- DESeqDataSetFromMatrix(countData = cts,
                              colData = coldata,
                              design = ~ Mural + Tissue + Stage + Tissue:Stage)

I would like to identify DEGs when comparing Tissue A vs Tissue B at a specific stage. I think this gets down to specifying contrasts, but I am unsure about how to proceed.

It seems similar to the vignette describing a time course analysis but I don't really want to compare my developmental stages back to a reference stage. I think the code below would give me differences at a given stage but after correcting for the reference level of the stage covariate.

ddsTC <- DESeq(dds, test="LRT", reduced = ~ Mural + Tissue + Stage)

res <- results(ddsTC, name="TissueA.Stage1", test="Wald")

Is there a suggestion for how to handle this type of analysis within DESeq2 or alternatively a suggestion for a better solution?

Thank you for any guidance or advice.

-ross

DESeq2 • 832 views

ADD COMMENT • link updated 3.0 years ago by Michael Love 43k • written 3.0 years ago by ross.smith • 0

score 0 · Answer 1 · 2022-03-17

I can't dispense with statistical consulting advice here due to limits on my time, but your approach seems fine, you need to check however that Mural is not confounded with Tissue and Stage (check boxplot of Mural ~ Tissue * Stage). If it is, then there is an experimental problem that cannot be addressed by statistical methods.