occurrence of rGADEM motifs (mattia pelizzola)

0

Entering edit mode

Mercier Eloi ▴ 60

@mercier-eloi-4685

Last seen 9.6 years ago

You also can us the exportAsRangedData function from the package MotIV on the rGADEM object. It create a RangedData object with the exact position of the motif (start and end corrected by the "pos" as explained by Chris). See ?exportAsRangedData for example. Eloi Sent from my HTC ----- Reply message ----- From: "Chris Whelan" <whelanch@ohsu.edu> Date: Wed, Jul 27, 2011 10:27 am Subject: [BioC] occurrence of rGADEM motifs (mattia pelizzola) To: <bioconductor@r-project.org> Hi Mattia, I just had to figure out how to access the alignment locations returned by rGADEM also. The object returned by GADEM() contains a list of "motif" objects which then have a slot "alignList" that has a list of "align" objects. To pull out the locations for the second motif, for example, this worked for me: >chrs <- sapply(gadem[[2]]@alignList, slot, 'chr') >starts <- sapply(gadem[[2]]@alignList, slot, 'start') >ends <- sapply(gadem[[2]]@alignList, slot, 'end') >positions <- sapply(gadem[[2]]@alignList, slot, 'pos') >locations <- cbind(chrs, starts,ends,positions) > head(locations) chrs starts ends pos [1,] "chr3" "11250871" "11251624" "167" [2,] "chr7" "2975746" "2976319" "412" [3,] "chr7" "129587981" "129588370" "140" [4,] "chrX" "18735991" "18736550" "457" [5,] "chr1" "40002871" "40003399" "232" [6,] "chr1" "175910829" "175911459" "502" I believe that starts and ends are the coordinates of the original search regions you gave to GADEM and then "pos" is the offset location within that region of the motif. Hope that helps - if someone knows better, please correct me. Chris > Message: 4 > Date: Tue, 26 Jul 2011 12:51:33 +0200 > From: mattia pelizzola <mattia.pelizzola@gmail.com> > To: bioconductor <bioconductor@stat.math.ethz.ch> > Subject: [BioC] occurrence of rGADEM motifs > Message-ID: > Â Â Â Â <cag10-br939ye_+gss13l6zqhozuzcat7h_8ursk0fd9wi- 7kxq@mail.gmail.com=""> > Content-Type: text/plain > > Hi, > I am using rGADEM and MotIV to find out enriched motifs in my ChIPseq peaks > and determine the similarity with Jaspar TFBS. These tools look very useful! > > rGADEM provides a list of enriched motifs. The total number of motifs is > provided by the nOccurrences function, but I can't find a way to get to know > which peak regions do contain these motifs. In particular, what are the > startPos and endPos functions supposed to do? I would expect a set of > genomic positions (or positions relative to the peak regions) with the same > length as nOccurrences, but I only get one number for each motif, with no > chromosome associated. > Even in the rGADEM vignette you have nOccurrences equal to 60 but then you > get only one number out of the startPos and endPos functions. Am I missing > or misunderstanding anything? > > Additionally, I was also wondering if it is possible to control the max > number of processors used in the analysis. I am working on a cluster shared > between many people and apparently the software uses as many processors as > possible, while I do not want to be that greedy with other users .. > > Thanks for any hint, > > mattia _______________________________________________ Bioconductor mailing list Bioconductor@r-project.org https://stat.ethz.ch/mailman/listinfo/bioconductor Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor [[alternative HTML version deleted]]

Alignment rGADEM MotIV Alignment rGADEM MotIV • 1.2k views

ADD COMMENT • link 12.8 years ago Mercier Eloi ▴ 60

Login before adding your answer.