Question

RNAseq outlier is a critical sample

0

Entering edit mode

grastalt27 • 0

@grastalt27-10859

Last seen 8.7 years ago

Hi Everyone,

I'm trying to analyze some RNAseq results, but one of my samples is a pretty bad outlier by PCA and by clustering over the entire transcriptome.

I have 4 groups with 3 biological replicates. These samples were run in 2 batches.

When I try to summarize my reads using RSubread's featureCounts, the outlier has a very low assignment %, with a high % of multiple assignments

My question is how should I proceed with my analysis? I don't have enough replicates to kick the outlier out. Are there methods to fix outliers? Is it valid to consider this outlier as a separate batch (Removing the variation with removeBatchEffect)?

Thank you!

rnaseq outlier statistics batch effect • 1.8k views

ADD COMMENT • link updated 8.8 years ago by Gordon Smyth 52k • written 8.8 years ago by grastalt27 • 0

score 1 · Answer 1 · 2016-06-08

1

Entering edit mode

Steve Lianoglou ★ 13k

@steve-lianoglou-2771

Last seen 2.0 years ago

United States

voomWithQualityWeights to the rescue!

... in the limma package (in case you weren't aware).

ADD COMMENT • link 8.8 years ago Steve Lianoglou ★ 13k

score 1 · Answer 2 · 2016-06-09

1

Entering edit mode

Gordon Smyth 52k

@gordon-smyth

Last seen 34 minutes ago

WEHI, Melbourne, Australia

No, it isn't valid to consider an outlier as a separate batch.

There are only two possibilities: down-weight the outlier using the appropriate functions in limma (as suggested by Steve) or throw the sample out (as suggested by Dario).

ADD COMMENT • link 8.8 years ago Gordon Smyth 52k

score 0 · Answer 3 · 2016-06-09

0

Entering edit mode

Dario Strbenac ★ 1.6k

@dario-strbenac-5916

Last seen 5 days ago

Australia

Since you have three replicates in each group, you should just exclude the unusual sample from the analysis. You don't need balanced sample sizes in every experimental group.

ADD COMMENT • link 8.8 years ago Dario Strbenac ★ 1.6k