Multivariate or univariate regression with compositional data in the covariates side using the alpha-transformation {Compositional}R Documentation

Multivariate or univariate regression with compositional data in the covariates side using the \alpha-transformation

Description

Multivariate or univariate regression with compositional data in the covariates side using the \alpha-transformation.

Usage

alfa.pcr(y, x, a, k, model = "gaussian", xnew = NULL)

Arguments

y

A numerical vector containing the response variable values. They can be continuous, binary, discrete (counts). This can also be a vector with discrete values or a factor for the multinomial regression (model = "multinomial").

x

A matrix with the predictor variables, the compositional data.

a

The value of the power transformation, it has to be between -1 and 1. If zero values are present it has to be greater than 0. If \alpha=0 the isometric log-ratio transformation is applied.

k

How many principal components to use. You may also specify a vector and in this case the results produced will refer to each number of principal components.

model

The type of regression model to fit. The possible values are "gaussian", "multinomial", "binomial" and "poisson".

xnew

A matrix containing the new compositional data whose response is to be predicted. If you have no new data, leave this NULL as is by default.

Details

The \alpha-transformation is applied to the compositional data first ,the first k principal component scores are calcualted and used as predictor variables for a regression model. The family of distributions can be either, "normal" for continuous response and hence normal distribution, "binomial" corresponding to binary response and hence logistic regression or "poisson" for count response and poisson regression.

Value

A list tincluding:

be

If linear regression was fitted, the regression coefficients of the k principal component scores on the response variable y.

mod

If another regression model was fitted its outcome as produced in the package Rfast.

per

The percentage of variance explained by the first k principal components.

vec

The first k principal components, loadings or eigenvectors. These are useful for future prediction in the sense that one needs not fit the whole model again.

est

If the argument "xnew" was given these are the predicted or estimated values (if xnew is not NULL). If the argument k is a vector then this is a matrix with the estimated values for each number of components.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris mtsagris@uoc.gr.

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Tsagris M.T., Preston S. and Wood A.T.A. (2011). A data-based power transformation for compositional data. In Proceedings of the 4th Compositional Data Analysis Workshop, Girona, Spain. https://arxiv.org/pdf/1106.1451.pdf

See Also

glm.pcr, alfapcr.tune

Examples

library(MASS)
y <- as.vector(fgl[, 1])
x <- as.matrix(fgl[, 2:9])
x <- x / rowSums(x)
mod <- alfa.pcr(y = y, x = x, 0.7, 1)
mod

[Package Compositional version 6.9 Index]