Whole exome: BAM statistics and other relevant information
1
0
Entering edit mode
bioinf ▴ 10
@bioinf-12080
Last seen 15 days ago
United States

Hi, 

I have paired-end bam files (exomes) for two groups each with multiple individuals (all from one species, non-viral). I am interested to get information as mentioned here http://www.ark-genomics.org/bioinformatics/virome  Would it be possible to do with `Rsamtools`? I am in the process of learning `Rsamtools` and getting stuck with how to process all files together to get final output. My aim is to retrieve genes and SNPs from this species as compared to reference (with same genus). Kindly guide.

Thanks!

rsamtools whole exome sequence bam exome R • 816 views
ADD COMMENT
1
Entering edit mode

Welcome! Can you ask a more specific question? Also, depending on what you are interested in, the GenomicAlignments (for a little more straight-forward data output data structures) package and GenomicFiles (for working with collections of BAM files) packages might be worth a look. Be sure to check out the vignettes.

ADD REPLY
0
Entering edit mode

Thanks! I want to retrieve different parameters from bam files such as which chromosome from each file has most coverage, to look for specific genes etc. and then their plots as shown here http://www.ark-genomics.org/bioinformatics/virome

ADD REPLY
1
Entering edit mode
@martin-morgan-1513
Last seen 3 days ago
United States

The function idxstatsBam() summarized reads per sequence name quickly. I would not call this statistic 'coverage', but it would be sufficient for general QA or visualization purposes.

The rest of your question is too vague -- there is a ton of information at the URL you provide, and there is no magic bullet to produce all of it in a single command.

ADD COMMENT
0
Entering edit mode

Thanks! I can understand it is not the work of a single command but would it be possible with Rsamtools and in addition any other tools would you suggest (I am a beginner in exome analysis and there are so many tools available with no proper pipeline and  I want to use R-based).

ADD REPLY
1
Entering edit mode

You have to figure out more precise questions to ask. I"ve mentioned the tools for very general analysis -- GenomicAlignments, GenomicFiles

ADD REPLY
0
Entering edit mode

Ok, thanks.

ADD REPLY

Login before adding your answer.

Traffic: 283 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6