collessI {treebalance}R Documentation

Calculation of the Colless index for rooted binary trees

Description

This function calculates variants of the Colless index for a given rooted binary tree TT. All of them are imbalance indices.

The original Colless index C(T)C(T) is defined as

C(T)=uVin(T)nuanubC(T)=\sum_{u \in V_{in}(T)} |n_{u_a}-n_{u_b}|

in which Vin(T)V_{in}(T) denotes the set of all inner vertices of TT, and in which nuan_{u_a} and nubn_{u_b} denote the number of leaves in the two pending subtrees that are rooted at the direct descendants of uu.

The corrected Colless index IC(T)I_C(T) of TT is defined as IC(T)=0I_C(T)=0 for n=1n=1 and n=2n=2 and for n>2n>2 as

IC(T)=2C(T)(n1)(n2)I_C(T)=\frac{2\cdot C(T)}{(n-1)\cdot(n-2)}

in which nn denotes the total number of leaves in TT.

The quadratic Colless index QC(T)QC(T) of TT is defined as

QC(T)=uVin(T)nuanub2QC(T)=\sum_{u\in V_{in}(T)} |n_{u_a}-n_{u_b}|^2



Special cases: For n=1n=1 the function returns C(T)=IC(T)=QC(T)=0C(T)=I_C(T)=QC(T)=0 and a warning.

For details on the original, corrected and quadratic Colless indices, see also Chapters 12, 13 and 15 in "Tree balance indices: a comprehensive survey" (https://doi.org/10.1007/978-3-031-39800-1_12, https://doi.org/10.1007/978-3-031-39800-1_13, https://doi.org/10.1007/978-3-031-39800-1_15).

Usage

collessI(tree, method = "original")

Arguments

tree

A rooted binary tree in phylo format.

method

A character string specifying the version that shall be computed. It can be one of the following: "original", "corrected", "quadratic".

Value

collessI returns the Colless index of the given tree according to the chosen method.

Author(s)

Luise Kuehn and Sophie Kersting

References

D. Colless. Review of Phylogenetics: the theory and practice of phylogenetic systematics. Systematic Zoology, 1982. ISSN 00397989.

T. M. Coronado, M. Fischer, L. Herbst, F. Rossello, and K. Wicke. On the minimum value of the Colless index and the bifurcating trees that achieve it. Journal of Mathematical Biology, 2020.doi: 10.1007/s00285-020-01488-9.

S. B. Heard. Patterns in tree balance among cladistic, phenetic, and randomly generated phylogenetic trees. Evolution, 1992. doi: 10.1111/j.1558-5646.1992.tb01171.x.

K. Bartoszek, T. M. Coronado, A. Mir, and F. Rossello. Squaring within the Colless index yields a better balance index. Mathematical Biosciences, 331:108503, 2021. doi: 10.1016/j.mbs.2020.108503.

Examples

tree <- ape::read.tree(text="((((,),),(,)),(((,),),(,)));")
collessI(tree, method="original")
collessI(tree, method="corrected")
collessI(tree, method="quadratic")


[Package treebalance version 1.2.0 Index]