R: MLE of distributions for compositional data

MLE of distributions for compositional data {MLE}

R Documentation

MLE of distributions for compositional data

Description

MLE of distributions for compositional data.

Usage

comp.mle(x, distr = "diri", type = 1, a = NULL, tol = 1e-07)

Arguments

`x`	A matrix containing the compositional data. Zero values are not allowed except for the case of the ZAD which is designed for the case of zero values present.
`distr`	The distribution to fit. "diri" stands for the Dirichlet distribution, "zad" is the Zero Adjusted Dirichlet distribution and "afolded" for the `\alpha`-folded model (Tsagris and Stewart, 2020).
`type`	This is for the Dirichlet distribution ("diri"). Type 1 uses a vectorised version of the Newton-Raphson (Minka, 2012). In high dimensions this is to be preferred. If the data are too concentrated, regardless of the dimensions, this is also to be preferrred. Type 2 uses the regular Newton-Raphson, with matrix multiplications. In small dimensions this can be considerably faster.
`a`	The value of `\alpha`. If this is NULL, the function will estimate it internally.
`tol`	The tolerance level idicating no further increase in the log-likelihood.

Details

Maximum likelihood estimation of the parameters of a Dirichlet distribution is performed via Newton-Raphson. Initial values suggested by Minka (2012) are used.

Value

A list including:

`loglik`	The value of the log-likelihood.
`param`	The estimated parameters.
`phi`	The precision parameter. If covariates are linked with it (function "diri.reg2"), this will be a vector.
`mu`	The mean vector of the distribution.
`runtime`	The time required by the MLE.
`best`	The estimated optimal `\alpha` of the folded model.
`p`	The estimated probability inside the simplex of the folded model.
`mu`	The estimated mean vector of the folded model.
`su`	The estimated covariance matrix of the folded model.

Author(s)

Michail Tsagris and Sofia Piperaki.

R implementation and documentation: Michail Tsagris mtsagris@uoc.gr and Sofia Piperaki sofiapip23@gmail.com.

References

Minka Thomas (2012). Estimating a Dirichlet distribution. Technical report.

Ng Kai Wang, Guo-Liang Tian, and Man-Lai Tang (2011). Dirichlet and related distributions: Theory, methods and applications. John Wiley & Sons.

Tsagris M. and Stewart C. (2018). A Dirichlet regression model for compositional data with zeros. Lobachevskii Journal of Mathematics, 39(3): 398–412. Preprint available from https://arxiv.org/pdf/1410.5011.pdf

Tsagris M. and Stewart C. (2022). A Review of Flexible Transformations for Modeling Compositional Data. In Advances and Innovations in Statistics and Data Science, pp. 225–234. https://link.springer.com/chapter/10.1007/978-3-031-08329-7_10

Tsagris M. and Stewart C. (2020). A folded model for compositional data analysis. Australian and New Zealand Journal of Statistics, 62(2): 249–277. https://arxiv.org/pdf/1802.07330.pdf

Examples

x <- matrix( rgamma(100 * 4, c(5, 6, 7, 8), 1), ncol = 4)
x <- x / rowSums(x)
res <- comp.mle(x)

[Package MLE version 1.0 Index]