Entering edit mode
regcastillo1990
•
0
@regcastillo1990-20939
Last seen 5.5 years ago
I want to keep the original counts as raw as possible (without tpm), even after annotation. In case of clustered tags I need to sum up the counts of the tags merged in to a cluster.
Does clusterCTSS
and aggregateTagClusters
sum raw-tag counts (and tmp) of the clustered tags into the resultant cluster?
Are there a way to force sum of counts, ignoring tmp or other treatment of counts?
Which column reflect the sum of counts?
I set method="none", this is my code:
normalizeTagCount(ce, method = "none", fitInRange = c(10, 1000), alpha = 1.25, T = 10^6)
clusterCTSS( ce, threshold = 1, thresholdIsTpm = TRUE, nrPassThreshold = 1, method = "distclu", maxDist = 20, removeSingletons = TRUE, keepSingletonsAbove = 5)
aggregateTagClusters(ce, tpmThreshold = 5, excludeSignalBelowThreshold = TRUE, qLow = NULL, qUp = NULL, maxDist = 100, useMulticore = FALSE, nrCores = NULL)
ce is my CAGEexp object.
clusterCTSS
andaggregateTagClusters
calculate both the raw tag count and the normalised expression scores of the consensus clusters. It is not possible to skip the calculation of the normalised expression scores (at this point, I am not sure if this would bring a significant performance advantage).The raw tag counts of each consensus cluster can be found in the
counts
assay of the theSummarizedExperiment
object returned by theconsensusClustersSE
function. It is aDataFrame
ofRle
-encoded integers. See for instance the example data:Thank you Plessy, that solved my present problem.
When I tried
CTSStagCountGR(ce)
I got this messageError in samples %in%
sampleLabels(object)
: argument "samples" is missing, with no defaultWhere this
samples
come from? What did I miss?This is what
sampleLabels(ce)
return of my ce objectCTSStagCountGR
works at the level of individual samples, so it needs a sample argument. If you want all CTSS positions, you can useCTSScoordinatesGR
If you want an expression table, you can useCTSStagCountDF
.Thank you C. Plessy. I successfully completed a pipeline with CAGEr to get the counts per sample of each consensus clustered tags, with at most 12 libraries. I added other three libraries but got this error.
Error in validObject(.Object) : invalid class “ConsensusClusters” object: Consensus clusters must not overlap with each other
I run a set with only these three libraries, and get a reasonable output, without error, also combined with others. It depends on the combination of libraries I choose, and the error re-reappear. I need to process all libraries together, this time 21, but I couldn't pass from 12, and not all combination of 12 in 21 escape from this error message.
Is it not the job of
aggregateTagClusters
function, to aggregates tag clusters into non-overlapping consensus clusters? Should I care for parameters or something else, to made it possible?I use the same parameters of my initial post, above in this thread. any advise?
Hi, if you still have interest in CAGEr, maybe you can try our version 2.0; we fixed a lot of bugs there!