deseq2 normalized counts as an input to deseq2
1
0
Entering edit mode
Elif • 0
@195598f3
Last seen 22 months ago
Turkey

I am currently using deseq2 package to normalize my data and after that I will use it to calculate p-values between my conditions. I have a question about after normalization step. I normalized my RNA-seq data and I used this normalized data to find outliers in my data, I used PCA analysis. There are some samples that will be removed. But when checked the code I see that normalization step is inside the deseq function and I can not replace my new data (the data which is normalized previously but outliers removed) to the deseq function. Obviously it seems that, I will remove the outliers and renormalized my data to calculate p-values.

when I check some manuals I saw this:

"NOTE: DESeq2 doesn’t actually use normalized counts, rather it uses the raw counts and models the normalization inside the Generalized Linear Model (GLM). These normalized counts will be useful for downstream visualization of results, but cannot be used as input to DESeq2 or any other tools that peform differential expression analysis which use the negative binomial model." in here: https://hbctraining.github.io/DGE_workshop/lessons/02_DGE_count_normalization.html

I want to be sure that there is anything I can do to replace my data into deseq object. If there is a way to do it, can you explain to me please?

DESeq2 • 2.8k views
ADD COMMENT
0
Entering edit mode
@mikelove
Last seen 5 hours ago
United States

You should definitely provide original data and not normalized to DESeq(). Maybe see our workflow ("rnaseqGene") if you want to get an idea of a typical pipeline.

ADD COMMENT
0
Entering edit mode

Hi Michael, thank you. I will chek it asap. But I think I couldn't explain my problem clearly, I wonder how can we detect and extract the outliers in deseq2 packages? Because it takes only normalized data and does not let me rearrange them.

ADD REPLY
0
Entering edit mode

How to detect outliers? We recommend looking at PCA plots in the workflow.

Does not let me rearrange them

I don't follow what you are trying to do that isn't possible from the workflow.

ADD REPLY
0
Entering edit mode

okey, lets say if I check the PCA plots within or without (in matlab for example) the workflow and I detect some samples, not genes, are outliers, and ı want to remove the "normalized value" of these outliers before calculating pvalues with deseq package. ı wonder if I can change the normalized values within the deseq? I can extract them with "normalized = counts(ddssize, normalized = TRUE)" this code, but can I insert a changed normalized values into the ddssize object?

ADD REPLY
1
Entering edit mode

If you identify samples to remove you can do, e.g.:

idx <- c(1,3,5) # failed samples
dds <- dds[,-idx]
dds <- DESeq(dds)
ADD REPLY
0
Entering edit mode

okey, thank you very much. I will try it.

ADD REPLY
0
Entering edit mode

Hi Michael,

I have a similar question regarding the normalization of my RNAseq data.

I used VSN normalization on pre-filtered data as follows;

 `ntd <- normTransform(dds_TB_filter)`

Then made some heat maps and PCAs on these normalized data, and performed DEseq2 on the filtered data to see differences between my study groups.

 dds_TB_DES_fil <-DESeq(dds_TB_filter)

However, one of my colleagues pointed out that DE analyses that have been performed are not using normalized data. Could you please clarify the fact as to why DESeq() works on the original data and not on the normalized data?

Thanks!

ADD REPLY
0
Entering edit mode

That is not VSN, just log of scaled counts.

VST is accomplished with

vsd <- vst(dds)

They are similar for high counts.

The question about why to perform analysis on original counts has been addressed on the support site before, and in the workflow:

https://bioconductor.org/packages/release/workflows/vignettes/rnaseqGene/inst/doc/rnaseqGene.html#exploratory-analysis-and-visualization

The original counts contain information on the precision of the measurements of abundance, due to the sampling process that generates the observations.

ADD REPLY

Login before adding your answer.

Traffic: 754 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6