missing reads in alignerBAM
1
0
Entering edit mode
@danielbernerunibasch-4268
Last seen 3.2 years ago
hi list When I align the N reads contained in my Illumina fastq using NovoAlign, convert the resulting SAM output to BAM using Samtools, and then upload the BAM file using readAligned (ShortRead), the resulting AlignedRead object contains fewer than N reads. Reconverting the BAM to SAM and counting the lines using wc -l, however, indicates there are still N reads in the BAM. Hence it appears that during upload into R, some reads are lost. The AlignedRead object, however, will contain exactly N reads when I perform the alignment using bowtie instead of NovoAlign. Any idea/comment on why reads are missing? Thanks, Daniel
Alignment convert Alignment convert • 1.0k views
ADD COMMENT
0
Entering edit mode
@martin-morgan-1513
Last seen 13 hours ago
United States
On 12/17/2010 12:55 AM, Daniel.Berner at unibas.ch wrote: > hi list > When I align the N reads contained in my Illumina fastq using NovoAlign, > convert the resulting SAM output to BAM using Samtools, and then upload > the BAM file using readAligned (ShortRead), the resulting AlignedRead > object contains fewer than N reads. Reconverting the BAM to SAM and > counting the lines using wc -l, however, indicates there are still N > reads in the BAM. Hence it appears that during upload into R, some reads > are lost. The AlignedRead object, however, will contain exactly N reads > when I perform the alignment using bowtie instead of NovoAlign. > Any idea/comment on why reads are missing? Hi Daniel -- readAligned and the AlignedRead class represents reads aligned without gaps (see ?readAligned, type="BAM"). You can use GenomicRanges::readGappedAlignments (alignment coordinates without sequence information) or Rsamtools::scanBam to input all data. Martin > Thanks, Daniel > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor -- Computational Biology Fred Hutchinson Cancer Research Center 1100 Fairview Ave. N. PO Box 19024 Seattle, WA 98109 Location: M1-B861 Telephone: 206 667-2793
ADD COMMENT

Login before adding your answer.

Traffic: 618 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6