quantilecl {quantileDA}R Documentation

A function that applies the quantile classifier for a given set of quantile probabilities and selects the best quantile classifier in the training set.

Description

The function applies the quantile classifier for a set of quantile probabilities and selects the optimal probability that minimize the misclassification rate in the training set.

Usage

quantilecl(train, test, cl, theta = NULL, 
cl.test = NULL, skew.correct="Galton")

Arguments

train

A matrix of data (the training set) with observations in rows and variables in columns. It can be a matrix or a dataframe.

test

A matrix of data (the test set) with observations in rows and variables in columns. It can be a matrix or a dataframe.

cl

A vector of class labels for each sample of the training set. It can be factor or numerical.

theta

A vector of quantile probabilities (optional)

cl.test

If available, a vector of class labels for each sample of the test set (optional)

skew.correct

Skewness measures applied to correct the skewness direction of the variables. The possibile choices are: Galton's skewness (default), Kelley's skewness and the conventional skewness index based on the third standardized moment

Details

quantile_cl carries out the quantile classifier for a set of quantile probabilities and selects the optimal probability that minimize the misclassification rate in the training set. The values of the quantile probabilities can be given in input or automatically selected in a equispaced range of 49 values between 0 and 1. The data in the training and test samples are preprocessed so that the variables used for the quantile estimator all have the same (positive) direction of skewness according to different measures of skewness: Galton's skewness, Kelley's skewness or conventional skewness index.

Value

A list with components

train.rates

Misclassification errors for each quantile probability in the training set

test.rates

Misclassification errors for each quantile probability in the test set

thetas

The list of optimal quantile probabilities for each variable

theta.choice

The quantile probability that gives the less misclassification error in the training set

me.train

Misclassification error in the training set

me.test

Misclassification error in the test set (only if cl.test is available)

train

The matrix of data (training set) with observations in rows and variables in columns

test

The matrix of data (test set) with observations in rows and variables in columns

cl.train

Predicted classification in the training set

cl.test

Predicted classification in the test set

cl.train.0

The true classification labels in the training set

cl.test.0

The true classification labels in the test set (if available)

Author(s)

Christian Hennig, Cinzia Viroli

See Also

See Also quantilecl.vw

Examples

data(ais)
x=ais[,3:13]
cl=as.double(ais[,1])
set.seed(22)
index=sample(1:202,152,replace=FALSE)
train=x[index,]
test=x[-index,]
cl.train=cl[index]
cl.test=cl[-index]
out.q=quantilecl(train,test,cl.train,cl.test=cl.test)
out.q$me.test
print(out.q)
plot(out.q)

[Package quantileDA version 1.2 Index]