Question

Dose-response study with DESeq2 and no dose replicates

0

Entering edit mode

rowcyclecamp • 0

@rowcyclecamp-16750

Last seen 4.7 years ago

United Kingdom

Hi,

I would like to know what are the limits of analysis for the following dataset I have been given:

1 cell population subjected to the following conditions:

Untreated
Drug vehicle
Drug A 1uM
Drug A 5uM
Drug A 10uM
Drug B 1uM
Drug B 5uM
Drug B 10uM

(There is an additional control cell population sample, which was not subjected to any of the above conditions (apart from untreated)).

To restate, there are no biological replicates of any condition (they are all from 1 patient). I understand that because of this, I am limited in the statistical comparisons I can use. This is what I am doing currently:

I have plotted the samples on a heatmap of euclidean distance to show that Drug A samples cluster separately from everything else (we do not expect Drug B to have an effect).
I have generated a table of genes differentially expressed between grouped Drug A and Drug B samples (ignoring dose).
I am generating a table of transformed count values for particular genes of interest for each dose of each drug, so we can at least look at whether each gene has a pattern of dose dependency (not applying any statistical test).

I am keen to know if there is something more I can do with this dataset. Would I be valid in performing a likelihood ratio test to look for genes which show a drug-dependent effect with dose, such as suggested by DESeq2 timecourse - How to set the experimental design?

Many thanks

deseq2 • 2.1k views

ADD COMMENT • link updated 7.4 years ago by Michael Love 43k • written 7.4 years ago by rowcyclecamp • 0

Michael Love · Answer 1 · 2018-08-02

0

Entering edit mode

Michael Love 43k

@mikelove

Last seen 8 days ago

United States

Yes you can assume some smooth function of dose, and then gain degrees of freedom, but the cost is that you need to have a good idea what the smooth function looks like, and the results will depend heavily on your choice. If you do ~ drug + dose + drug:dose, that assumes a linear increase with dose on log gene expression (so exponential on raw expression), assuming you code dose as a numeric (0,1,5,10). Also you have to figure out how to make use of untreated and drug vehicle, how should these two samples be used to set the baseline. This isn't a mathematical issue, but more of a biological one, what is the meaning of these two samples relative to the ones with non-zero dosage.

ADD COMMENT • link 7.4 years ago Michael Love 43k

0

Entering edit mode

Thank you for your help. I have talked to the investigators.

For looking at dose-dependent changes:

We expect the log10 of the drug doses to have a linear relationship to gene expression over this range.

If I understand correctly, I could rewrite the numeric doses (1, 5, 10) as log10 values (0, 0.69897, 1) which would then assume a linear increase with log10 dose on log2 gene expression. Should I be looking at raw expression here, however? Can you point me in the right direction as to how best to write this expression?

Baseline/vehicle

The vehicle is used to deliver drugs A and B. It is present at the same concentration in the 'drug vehicle' sample and in all drug A and B samples. I would therefore not be including it in the dose-dependent linear model. Given there are no replicates of the 'drug vehicle' and 'untreated' samples, is the only way to compare these 2 samples to look for genes with large fold changes in expression?

Many thanks

ADD REPLY • link updated 7.4 years ago by Michael Love 43k • written 7.4 years ago by rowcyclecamp • 0

0

Entering edit mode

log10 is a problem because you have a dose=0. You want to have a scale such that you can include a numeric value for dose for 0,1,5,10.

I don't have any particular advice about what functional form to use, the simplest to code would be linear changes in log expression, but this is a statistical and biological design choice of the analyst, and goes beyond what I can offer in terms of software support.

If you pick either untreated or vehicle to be the baseline, you can use a design of ~drug:dose. You should code the untreated baseline as dose=0, drug="A", though it won't make a difference - it will be the baseline for both drugs.

ADD REPLY • link 7.4 years ago Michael Love 43k