Collecting unmapped RNA-seq reads
3
0
Entering edit mode
Patrick • 0
@patrick-14498
Last seen 5.3 years ago

Dear community,
I used subread-align to map RNA-seq reads to the reference genome (after index building) resulting in SAM files. Now, I am additionally interested in the unmapped reads (not aligning to the reference), but the subread-package seems to only show the number of unmapped reads and is not collecting them.

Questions:
1. Is there a way to collect unmapped reads within the subread-package
2. Otherwise, can you suggest a program able to do this?

Thanks a lot

subread read mapping unmapped reads • 3.8k views
ADD COMMENT
2
Entering edit mode
@gordon-smyth
Last seen 4 hours ago
WEHI, Melbourne, Australia

The SAM files from Subread contain all the reads, both mapped and unmapped. You can use standard samtools utiliites to extract any subset of reads you want, e.g., mapped or unmapped reads. It should be easy to figure out how to do that from a Google search or from the samtools manual.

ADD COMMENT
1
Entering edit mode
Robert Castelo ★ 3.4k
@rcastelo
Last seen 10 hours ago
Barcelona/Universitat Pompeu Fabra

I don't know about subread-align but outside BioC, the STAR aligner offers the option --outReadsUnmapped Fastx that will dump the unmapped reads to FASTQ files.

cheers,

robert.

ADD COMMENT
0
Entering edit mode
Wei Shi ★ 3.6k
@wei-shi-2183
Last seen 1 day ago
Australia/Melbourne

As Gordon said the Subread output includes both mapped and unmapped reads. It is fairly straightforward to extract unmapped reads from a SAM file. The 'RNAME' column (the 3rd column) in the SAM file contains a value '*' for each unmapped read.

ADD COMMENT

Login before adding your answer.

Traffic: 536 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6