Question: sample removal after clustering
gravatar for tarun2
23 months ago by
tarun20 wrote:

To the developers,


I'm a novice R user and new to the expression profiling analysis as well.

I am trying to do a differential expression analysis on a 2x2 factorial experiment (2 drought contrasting genotypes and 2 contrasting conditions) with 4 replicates per sample to have a 16 experimental units.

Before proceeding to run the differential expression, I initially did sample clustering after normalization using rlog. However, when I did the heatmap and especially the PCA with all the samples, one sample is really far from the rest with PC1 having 41% of the variance and PC2 with 25% variance. I was able to track that sample. When I removed it the heatmap and PCA improved such that the PC1 now has 76% variance and the PC2 with 15% variance.

Is it technically acceptable to remove one sample? Because reading from blogs and different resources on the DESEQ2 community, it seems that people recommend having all the samples together.

Please advise.

deseq2 rlog transformation • 292 views
ADD COMMENTlink modified 23 months ago by Michael Love26k • written 23 months ago by tarun20
Answer: sample removal after clustering
gravatar for Michael Love
23 months ago by
Michael Love26k
United States
Michael Love26k wrote:

Yes, it's a good idea to remove an outlier if you think the experiment failed in a way. It could have been a problem in the experiment, in the library preparation, or in the sequencing. I would use FASTQC to check the sequencing quality, as you may be able to avoid the problem in the future if it was library preparation or sequencing. You can perhaps contact a local bioinformatics core for helping to understand what may have gone wrong, or if everything looks correct, it could have been a problem in the experiment itself. 

But for DESeq2 analysis, I would recommend to remove the sample. 

The advice to keep all the samples together is about a separate topic: whether to include all conditions when computing dispersion values.

ADD COMMENTlink written 23 months ago by Michael Love26k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 418 users visited in the last hour