Hello,
I have 50 bam files that I'd like to go through. I have a list of mutations in 21 genes in those bamfiles, and I'd like to write some code to go through each gene, locate the base that is mutated, and get a readout of what's there in the tumor sample and the normal sample.
In the past I've done this manually in IGV, but I've gotten better at coding and would like to give this a try. I've used several bioconductor packages before, but never these. I've used samtools sparingly on the command line, so am somewhat familiar with it, but am no expert.
What I need help with is using rsamtools or rbamtools to extract a base from the genome. For example, I would give it chr10: 387,878,500-387,878,500, and it would give me some information. Information I could use is: 1) all the bases read at that position, 2) information on how many of each base are read at that position (IGV shows the total reads and # of reads with each base pair when you hover. This would work).
Is this possible? Thanks for your help!