Question: Highly uneven mapping rates and counts between samples
0
gravatar for chris86
8 months ago by
chris86380
UCL, United Kingdom
chris86380 wrote:

Hi

I have Illumina mRNA-seq samples where it seems because of low RINs (2-4) in a bunch of them compared to the others, I am getting very widely varying mapping rates (15%-70%) and therefore counts per sample (e.g. 8,000,000 mapped reads vs 40,000,000). Plus I can't really use RIN/mapping rate as a covariate because it is very confounded with a group of interest.

Is there a preferred way of analyzing this type of data? If I do the usual VST through DESEQ2 I get a cluster of samples with irregular high expression of a lot of genes, also the ones with low numbers of overall sample counts, presumably this is because of what I describe above. I was wondering if quantile normalisation would help or are there any other ideas?

I also used Salmon to quantify the data using the gc bias and validate mappings flags. Reads are 150bp.

Thanks,

Chris

normalization limma deseq2 • 163 views
ADD COMMENTlink modified 8 months ago by Aaron Lun25k • written 8 months ago by chris86380
Answer: Highly uneven mapping rates and counts between samples
0
gravatar for Aaron Lun
8 months ago by
Aaron Lun25k
Cambridge, United Kingdom
Aaron Lun25k wrote:

I'll answer from the limma side. There's a variety of possibilities:

  • Using voomWithQualityWeights() will help if the reduction in sample quality manifests as increased variation without introducing bias. This will downweight the contribution of the affected samples to the variance estimate.
  • If bias is introduced in a systematic manner, normalization may help. For example, normalizeCyclicLoess() will remove trended biases with respect to abundance. Packages like EDASeq use additional covariates to correct for other effects like GC content, gene length, etc., which may be helpful if the library preparations are so variable across samples. I feel quantile normalization is too aggressive for my tastes, I would not expect the same distribution from two different conditions.
  • If bias is introduced in a gene-specific manner, and quality is confounded with your group of interest... you're stuffed.
ADD COMMENTlink modified 8 months ago • written 8 months ago by Aaron Lun25k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 250 users visited in the last hour