In new version of bioconductor, I have used GenomicRanges packages, but intervaltree is defunct. I believed that intervaltree algorithm is used for finding overlapped regions from different genomic interval. What is the main different between intervaltree and nclist algorithm for finding overlapped regions in R ? I am bit of confused. I would be grateful if someone gives me comprehend explanation for it ? FYI, finding overlapped regions in parallel, using intervaltree is only way to solve the problem ? Thanks a lot the benefit of this community.
Short answer: They return basically the same thing but the new algo (nclist) is more efficient.
Long answer: Here is what the documentation says about the
algorithm argument (documented in
algorithm: Can be
"nclist"(the default) or
"intervaltree". This argument was added in BioC 3.1 to facilitate the transition between the new
countOverlapsimplementation based on Nested Containment Lists and the old implementation based on Interval Trees. See
?IntervalTreefor more information about these implementations. Note that the old implementation is defunct starting with BioC 3.2. The
algorithmargument will be removed in BioC 3.3.
The man page for NCList objects (
?NCList) in particular contains information about the differences between the 2 algorithms.