filter and trim large fasta/bam files

0

Entering edit mode

achaillon • 0

@achaillon-14117

Last seen 4.2 years ago

Hi

I am analyzing deep sequencing data and I would like to manipulate these large data (>100,000 reads - can be either fasta or bam format) to do the followings:

#1 - Exclude primer sequences (short strings of 25-30 nt)

e.g. if I want to exclude all the match 'CAAACTCAAATCTAATCTAACCAAAAAAAC' and 'CAACCTTTTAATCTAACCAAAAAAAC'

#2 - Filter out the short reads (< a 100 bp)?

#3 - And finally exclude reverse oriented sequences?

I am using outside R tools (samtools) but it would be great to have all running in R...

thanks in advance!

a

deep sequencing fasta bam trim • 1.3k views

ADD COMMENT • link 7.5 years ago achaillon • 0

Login before adding your answer.

Similar Posts

Loading Similar Posts

Traffic: 571 users visited in the last hour

Content Search
Users
Tags
Badges

Help About
FAQ

Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the

version 2.3.6