I've been using DESeq2 in Phyloseq (see this tutorial) for differential abundance testing in a microbiome data set i.e. OTUs, not genes.
The default multiple testing correction applied in the 'results' function is Benjamini-Hochberg. However, OTUs are not independent. There is information available about the specific structure of their relations in the form of a phylogenetic tree.
The package structSSI outlines a hierarchical procedure for multiple testing correction that takes this specific structure of the hypotheses into account.
I'd like to use this hierarchical FDR correction for DESeq results. I understand that running DESeq involves 3 functions.
dds <- estimateSizeFactors(dds) dds <- estimateDispersions(dds) dds <- nbinomWaldTest(dds)
The first two can be run once for the entire dataset (at the OTU level). I then want to be able to fit negative binomial models for the combined abundance of all of the children of each node in the phylogenetic tree somehow using the information from the dispersion fit.
Is there a way to extract the dispersion fit calculated on all the species for use in the model fitting, given that this will be on a new DESeq dataset containing the combined abundances of lots of OTUs?
Thanks,
Liam
Hi Michael,
Thanks for your reply.
I had read that section of the vignette and accessed the individual estimates of dispersion and the priors. My question about the 'fit' was because I'm still unclear about how the dispersion prior is used in the fitting - very much not a statistician - but I'll keep reading the vignette until it makes sense.
Best,
Liam