Question

DEseq2 dispersion plot

0

Entering edit mode

Emma • 0

@863878c0

Last seen 7 weeks ago

Germany

Hi all, I'm currently using DESeq2 (version 1.38.3) for my analysis and encountered a dispersion plot that appears quite different from the typical dispersion plot presented in the DESeq tutorial. The plot I obtained looks like this:

enter image description here

I would greatly appreciate any insights or explanations as to why it appears this way. Here are some additional details that might be relevant: I have two different groups, each consisting of 11 individuals. After RNAseq, I used Trinity for assembly and Salmon for obtaining expression read counts. The quant.sf files were loaded in via tximport.

Thank you in advanced!

R code:

dirs <- list.files("first_batch/first_batch_salmon_trinity_full/", "Sample")
files <- file.path("first_batch/first_batch_salmon_trinity_full/", dirs, "quant.sf")
names(files) <- dirs 

######### gene level
tx2gene <- read_delim(file.path("first_batch/first_batch_salmon_trinity_full/", 
                                "Trinity.fasta.gene_trans_map"),
                      col_names = FALSE) %>% 
    dplyr::select(X2, X1)
names(tx2gene) <- c("transcript_IDs", "gene_IDs")
txi.salmon.g <- tximport(files, type = "salmon", tx2gene=tx2gene)

######### meta information
samples <- read_delim("first_batch/meta.txt", col_names = FALSE) %>% 
    dplyr::select(c(2, 3, 4))
names(samples) <- c('sampleID', 'color', 'sex') 
samples_reorder <- samples[match(dirs, samples$sampleID), ]
samples_reorder$color <- factor(samples_reorder$color)
samples_reorder$color <- relevel(samples_reorder$color, ref = 'B')

all(samples_reorder$sampleID == colnames(txi.salmon.g$counts))

dds_g <- DESeqDataSetFromTximport(txi.salmon.g, samples_reorder, ~color)
keep <- rowSums(counts(dds_g) >= 10 ) >= 11 
dds_g <- dds_g[keep,]
dds_g <- DESeq(dds_g)
plotDispEsts(dds_g, main="Dispersion plot")

DESeq2 • 258 views

ADD COMMENT • link updated 7 weeks ago by Michael Love 41k • written 8 weeks ago by Emma • 0

score 0 · Answer 1 · 2024-03-05

0

Entering edit mode

Michael Love 41k

@mikelove

Last seen 12 hours ago

United States

"why it appears this way"

This plot looks fine to me. The blue points are near the black points because you have a lot of information (samples) for computing dispersion per gene and the prior is doing less work.

ADD COMMENT • link 7 weeks ago Michael Love 41k