Dear List,
I have xenium dataset or an experiment and control (n=3). I want to perform a differential expression pseudobulk analysis agnostic to location and cell type. Do I use a method originally designed for microarrays (limma) or one originally designed for RNASeq (edgeR, Deseq2, or voom). I would especially appreciate Gordon Smyth's input on this Issue.
Thanks and best wishes,
Rich
Richard Friedman
Hebert Irving Comprehensive Cancer Center
Columbia University Irving Medical Center
It's digital counts, not intensities, so naturally you would go with the RNA-seq methods.
Dear ATpoint,
Thank you. That is also my understanding. A colleague thinks otherwise and specifically asked me to ask Gordon Smyth. I am asking through the forum so that there is a permanent record for anyone else who asks.
Best wishes,
Rich
Would you share the rationale of your colleague for the sake of discussion?
He is of the opinion that the negative binomial model is appropriate to RNASeq because of sequencing and that since Xenium uses probes it is more like microarrays. This is as best as I understand his reasoning. He suggested that I ask Gordon to resolve the disagreement.
The fact that probes are used is not the deciding factor, it is how the actual gene expression is measured. Arrays hybridize probes, indeed, but they read the attached fluorescence molecules as intensities, not counts. Probes itself can and do produce counts, for example as in targeted DNA sequencing, the relatively recent 10x probe-based Flex assays, or Xenium. So it is counts, not intensities, regardless whether this is obtained by probe hybridization, mRNA capture or other methods.
ATpoint,
Thank you. That is also my understanding. I will forward your reply to my colleague.
Best wishes,
Rich