Interpretation of output files
1
0
Entering edit mode
Guest User ★ 13k
@guest-user-4897
Last seen 9.6 years ago
Hello, I was able to successfully run TSSi program. However, I have several questions. My questions are - how "0.CUFF.16.1_2" is related to "0.CUFF.20.1_4", - how region is determined to each gene, - how this file could be interpreted, - and how nTss is determined? Thank you again for sparing your precious time. Best regards. -- output of sessionInfo(): When I exported data by using the command bleow, x <- segmentizeCounts(counts=counts, start=start, chr=chromosome, region=region, strand=strand) yFit <- normalizeCounts( x, fit=TRUE ) z <- identifyStartSites( yFit ) segmentsRd <- segmentsAsRangedData( z ) export.gff3( segmentsRd, paste(tmpFile, "gff", sep=".") ) I could see some features corresponding to each gene. s0_+_0.CUFF.16.1_2 rtracklayer sequence_feature 85285 87682 . - . chr=s0;region=0.CUFF.20.1_4;nPos=12;nCounts=860;nTss=11 Actually, according to annotation file, 0.CUFF.16.1_2 has starting site at 48426, whereas 0.CUFF.20.1_4 has starting site at 85285. -- Sent via the guest posting facility at bioconductor.org.
Annotation rtracklayer TSSi Annotation rtracklayer TSSi • 588 views
ADD COMMENT
0
Entering edit mode
Julian Gehring ★ 1.3k
@julian-gehring-5818
Last seen 5.0 years ago
Dear Soyeon Cha, When you split the counts into segments (as you did by calling 'segmentizeCounts'), each counts gets a name assigned. By default, these names have the format '<chromosome>_<strand>_<region>, but you can change it with the 'pattern' argument in 'segmentizeCounts'. As an example, 's0_+_0.CUFF.16.1_2' would be chromosome 's0', '+' strand, and region '0.CUFF.16.1_2'. How this relates to the genes in your organism depends on the regions that you defined initially. But since you have the coordinates of each start site, this should be easy to map back, even if you can't infer it from the region/segment name. 'nTSS' counts the number of identified start sites in each segment, as you also can see by inspecting 'z' with 'show'. Best wishes Julian On 08/03/2013 08:18 AM, from a TSSi user [guest] wrote: > > Hello, > I was able to successfully run TSSi program. However, I have several questions. > > My questions are > - how "0.CUFF.16.1_2" is related to "0.CUFF.20.1_4", > - how region is determined to each gene, > - how this file could be interpreted, > - and how nTss is determined? > > Thank you again for sparing your precious time. > Best regards. > > -- output of sessionInfo(): > > When I exported data by using the command bleow, > > x <- segmentizeCounts(counts=counts, start=start, chr=chromosome, region=region, strand=strand) > yFit <- normalizeCounts( x, fit=TRUE ) > z <- identifyStartSites( yFit ) > segmentsRd <- segmentsAsRangedData( z ) > export.gff3( segmentsRd, paste(tmpFile, "gff", sep=".") ) > > I could see some features corresponding to each gene. > s0_+_0.CUFF.16.1_2 rtracklayer sequence_feature 85285 87682 . - . chr=s0;region=0.CUFF.20.1_4;nPos=12;nCounts=860;nTss=11 > > Actually, according to annotation file, 0.CUFF.16.1_2 has starting site at 48426, whereas 0.CUFF.20.1_4 has starting site at 85285. > > > -- > Sent via the guest posting facility at bioconductor.org. > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor >
ADD COMMENT

Login before adding your answer.

Traffic: 767 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6