DNA sequence analysis question
2
0
Entering edit mode
andigoni ▴ 10
@andigoni-5163
Last seen 9.6 years ago
Is there any package for motif finding in DNA sequences that uses consensus matrices as input? In addition, I need to extract genomic features e.g. exons, intron, utr, splice sites etc. given the genomic coordinates for multiple sequences. Is there any package for such analyses? Thanks in advance!!
• 907 views
ADD COMMENT
0
Entering edit mode
Paul Shannon ▴ 750
@paul-shannon-5161
Last seen 9.6 years ago
Your questions may need a longer and more detailed reply, but let me start with these comments. On Mar 14, 2012, at 8:25 AM, andigoni wrote: > > Is there any package for motif finding in DNA sequences that uses consensus matrices as input? BiocViews 'SequenceMatching' (http://www.bioconductor.org/packages/r elease/BiocViews.html#___SequenceMatching) mentions 'cosmo', about which the author (Oliver Benbom) says: Cosmo searches a set of unaligned DNA sequences for a shared motif that may, for example, represent a common transcription factor binding site. The algorithm is similar to MEME, but also allows the user to specify a set of constraints that the position weight matrix of the unknown motif must satisfy. Such constraints may include bounds on the information content across certain regions of the unknown motif, for example, and can often be formulated on the basis of prior knowledge about the structure of the transcription factor in question. The unknown motif width, the distribution of motif occurrences (OOPS,ZOOPS, or TCM), as well as the appropriate constraint set can be selected data-adaptively. > > In addition, I need to extract genomic features e.g. exons, intron, utr, splice sites etc. given the genomic coordinates for multiple sequences. Is there any package for such analyses? Just this week I have used GenomicFeatures and TxDb.Hsapiens.UCSC.hg19.knownGene to extract promoter sequence. Is this the sort of thing you want to do? - Paul > > > Thanks in advance!! > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
ADD COMMENT
0
Entering edit mode
@steve-lianoglou-2771
Last seen 13 months ago
United States
Hi, On Wed, Mar 14, 2012 at 11:25 AM, andigoni <andigoni at="" gmail.com=""> wrote: > > Is there any package for motif finding in DNA sequences that uses consensus > matrices as input? Does the `matchPWM` function in the Biostrings package do what you're looking for? > In addition, I need to extract genomic features e.g. exons, intron, utr, > splice sites etc. given the genomic coordinates for multiple sequences. Is > there any package for such analyses? Can you explain this a bit more? Do you want to, say, use a GRanges object to query what "type" of genomic annotation is there (intron, cds, utr, etc)? If that's the case, at the previous bioc workshop in Feb, it seemed like several people hacked up their own way to do things like that. I believe Valerie has put something "more official" together somewhere in her "VariantAnnotation" package that you might like to use. See section 3 (Variant location) in the VriantAnnotation pdf: http://www.bioconductor.org/packages/devel/bioc/vignettes/VariantAnnot ation/inst/doc/VariantAnnotation.pdf HTH, -steve -- Steve Lianoglou Graduate Student: Computational Systems Biology ?| Memorial Sloan-Kettering Cancer Center ?| Weill Medical College of Cornell University Contact Info: http://cbio.mskcc.org/~lianos/contact
ADD COMMENT

Login before adding your answer.

Traffic: 960 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6