Entering edit mode
I want to use deseq2 for differential abundance analysis for 16s amplicon data analysis.
data detail:
I do have three tissue:
summary(sample_data(physeq)$Tissue)
Leaf Root Soil
And each tissue has given 3 treatment.
summary(sample_data(physeq)$Treatment)
T1 T2 T3
Now I just want to use soil data and want to see the effect of treatment T1, T2 and T3 on microbial abundance in soil only.
for this how should I should I take only soil sample from physeq data and modify the below condition code.
diagdds = phyloseq_to_deseq2(kostic, ~ DIAGNOSIS)
I will be very thankful for your help.
Kind Regards Yogesh
Hi Michael ,
Thanks a lot.
From the above data, I have extracted soil samples.I need your help to design the condition for deseq2:
here is the summary of data, I have soil sample collection from different villages from two different regions:
and we have given three treatment to each soil sample in each village: TREATMENT T1, T2 AND T3, now my aim is to find the differential abundance of the microbial community in T1 and T2 and T3 treatment condition:
T1 T2 T3 54 54 54
should I import data to deseq2 like this: or I need to change something
I am very thankful for your help.
You haven't followed my advice above. Have you read the section I referred to? I'm confused.
Dear Michael,
I read it, but I am much more interested to find treatment (T1, T2 and T3) effect on soil, root and shoot tissue separately so I extract all soil sample, root and shoot in separate files. and now wanted to proceed these file independently so that treatment effect can be analysed on microbial abundance. these samples also collected from different villages which located in two different regions (Mymensing and Rajshahi).
if still, you think I should get all tissue in one file and combine tissue and treatment and go forward:
then should I modify above code like this:
Thanks Yogesh
The analysis choices are up to you. This forum is for questions about the software. I'm saying (and the vignette says) if you want to extract treatment effects per tissue, you can combine tissue and treatment and use
~group
. Then make comparisons between treatment within the same tissue using e.g.contrasts=c("group","tissueX.treatmentA","tissueX.treatment0")
.Dear Michael,
Thanks for your all help and time. I will run the analysis both way, it will help me to understand deseq2 more.
as I mention I have extracted soil samples and want to see the treatment effect T1, T2, T3.
I run deseq2 code like this but I am getting some messege:
could please suggest how can resolve it.
Thanks Yogesh
Usually these messages can be ignored when they involve one or two genes and usually these genes have some outlier counts.
Dear Michael,
Thanks for your all help and time. I will run the analysis both way, it will help me to understand deseq2 more.
as I mention I have extracted soil samples and want to see the treatment effect T1, T2, T3.
I run deseq2 code like this but I am getting some messege:
could please suggest how can resolve it.
Thanks Yogesh
Hi Michael,
I am able to generate plot from deseq2 analysis using below command.
But I am not sure whether it is T3vsT1 OR T2vsT1.
could you please suggest how I can improve this analysis more for publication,
the aim is to find which microbial species or genus are more abundant in Treatment T1 and Treatment T3 in comparison to T2 and how T3 microbial abundance is different from T1 Treatment.
I will be very thankful for your time and kind support.
Here is all code.
out of 62301 with nonzero total read count adjusted p-value < 0.1 LFC > 0 (up) : 32, 0.051% LFC < 0 (down) : 36, 0.058% outliers [1] : 3939, 6.3% low counts [2] : 47758, 77% (mean count < 37) [1] see 'cooksCutoff' argument of ?results [2] see 'independentFiltering' argument of ?results
[1] "mean of normalized counts for all samples" [2] "log2 fold change (MLE): Treatment T3 vs T1" [3] "standard error: Treatment T3 vs T1"
[4] "Wald statistic: Treatment T3 vs T1"
[5] "Wald test p-value: Treatment T3 vs T1"
[6] "BH adjusted p-values"
Thanks Yogesh
It looks like you are not following my advice above about using contrast. Please read over the documentation about using “contrast” in the vignette and in
?results
Dear Micheal,
you suggesting me to use group for deseq analysis,
How I should import data first in deseq2 from phyloseq and then how to run deseq2
if I am just import data without design it says design argument is missing.
Kind Regards Yogesh
You can use ~1 as long as you change the design later to the appropriate one with
design(dds) <- ...
Dear Micheal,
thanks a lot,
converting counts to integer mode
do I also need to mention Treatment T2 as reference level or I should run further this command or I can use later contrast to extract the result as you mention
Kind Regards Yogesh
Hi Michael,
I am getting data like this for all comparision, but it seems there was some problem:
I'm not sure if DESeq2 is an appropriate software here. Why don't you try a different method? Generally, I don't work in metagenomics and I'm skeptical of the idea of global normalization when there is no concept of "housekeeping genes". It works well in bulk RNA-seq, but I don't see why it should work across samples that share no taxa.