Counting ambiguously mapped reads per feature
2
0
Entering edit mode
rubi ▴ 110
@rubi-6462
Last seen 5.7 years ago

Hi,

I have a bam file in which ~50% of the reads map ambiguously (since the library is from small RNAs). For each feature (e.g., gene) in my GTF file I'd like to add a count of 1 if a read from that bam file maps to that feature uniquely and 1/n if a read maps to n locations (that feature being one of them).

Is there any Bioconductor code that can achieve that?

 

rnaseq bam genomicranges GenomicAlignments Rsamtools • 1.4k views
ADD COMMENT
3
Entering edit mode
@gordon-smyth
Last seen 1 hour ago
WEHI, Melbourne, Australia

Rsubread::featureCounts() with countMultiMappingReads=TRUE and fraction=TRUE does exactly that.

ADD COMMENT
3
Entering edit mode
@steve-lianoglou-2771
Last seen 14 months ago
United States

You might try using tools like salmon, kallisto, or RSEM, which provide a more principled (and community-vetted) approach to solving this exact issue.

ADD COMMENT
2
Entering edit mode

Here are my Bioc2016 slides on using these quantifiers with Bioconductor software packages:

http://mikelove.github.io/bioc2016

In particular, see slide 4 for an idea how you can do better than assigning 1/n to all of the potential alignment locations.

ADD REPLY

Login before adding your answer.

Traffic: 1050 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6