R: Cross-validated grid search for GenSVM

gensvm.grid {gensvm}

R Documentation

Cross-validated grid search for GenSVM

Description

This function performs a cross-validated grid search of the model parameters to find the best hyperparameter configuration for a given dataset. This function takes advantage of GenSVM's ability to use warm starts to speed up computation. The function uses the GenSVM C library for speed.

Usage

gensvm.grid(
  x,
  y,
  param.grid = "tiny",
  refit = TRUE,
  scoring = NULL,
  cv = 3,
  verbose = 0,
  return.train.score = TRUE
)

Arguments

`x`	training data matrix. We denote the size of this matrix by n_samples x n_features.
`y`	training vector of class labels of length n_samples. The number of unique labels in this vector is denoted by n_classes.
`param.grid`	String (`'tiny'`, `'small'`, or `'full'`) or data frame with parameter configurations to evaluate. Typically this is the output of `expand.grid`. For more details, see "Using a Parameter Grid" below.
`refit`	boolean variable. If true, the best model from cross validation is fitted again on the entire dataset.
`scoring`	metric to use to evaluate the classifier performance during cross validation. The metric should be an R function that takes two arguments: y_true and y_pred and that returns a float such that higher values are better. If it is NULL, the accuracy score will be used.
`cv`	the number of cross-validation folds to use or a vector with the same length as `y` where each unique value denotes a test split.
`verbose`	integer to indicate the level of verbosity (higher is more verbose)
`return.train.score`	whether or not to return the scores on the training splits

Value

A "gensvm.grid" S3 object with the following items:

`call`	Call that produced this object
`param.grid`	Sorted version of the parameter grid used in training
`cv.results`	A data frame with the cross validation results
`best.estimator`	If refit=TRUE, this is the GenSVM model fitted with the best hyperparameter configuration, otherwise it is NULL
`best.score`	Mean cross-validated test score for the model with the best hyperparameter configuration
`best.params`	Parameter configuration that provided the highest mean cross-validated test score
`best.index`	Row index of the cv.results data frame that corresponds to the best hyperparameter configuration
`n.splits`	The number of cross-validation splits
`n.objects`	The number of instances in the data
`n.features`	The number of features of the data
`n.classes`	The number of classes in the data
`classes`	Array with the unique classes in the data
`total.time`	Training time for the grid search
`cv.idx`	Array with cross validation indices used to split the data

Using a Parameter Grid

To evaluate certain parameter configurations, a data frame can be supplied to the param.grid argument of the function. Such a data frame can easily be generated using the R function expand.grid, or could be created through other ways to test specific parameter configurations.

Three parameter grids are predefined:

'tiny': This parameter grid is generated by the function gensvm.load.tiny.grid and is the default parameter grid. It consists of parameter configurations that are likely to perform well on various datasets.
'small': This grid is generated by gensvm.load.small.grid and generates a data frame with 90 configurations. It is typically fast to train but contains some configurations that are unlikely to perform well. It is included for educational purposes.
'full': This grid loads the parameter grid as used in the GenSVM paper. It consists of 342 configurations and is generated by the gensvm.load.full.grid function. Note that in the GenSVM paper cross validation was done with this parameter grid, but the final training step used epsilon=1e-8. The gensvm.refit function is useful in this scenario.

When you provide your own parameter grid, beware that only certain column names are allowed in the data frame corresponding to parameters for the GenSVM model. These names are:

p: Parameter for the lp norm. Must be in [1.0, 2.0].
kappa: Parameter for the Huber hinge function. Must be larger than -1.
lambda: Parameter for the regularization term. Must be larger than 0.
weights: Instance weights specification. Allowed values are "unit" for unit weights and "group" for group-size correction weights
epsilon: Stopping parameter for the algorithm. Must be larger than 0.
max.iter: Maximum number of iterations of the algorithm. Must be larger than 0.
kernel: The kernel to used, allowed values are "linear", "poly", "rbf", and "sigmoid". The default is "linear"
coef: Parameter for the "poly" and "sigmoid" kernels. See the section "Kernels in GenSVM" in the codeinkgensvm-package page for more info.
degree: Parameter for the "poly" kernel. See the section "Kernels in GenSVM" in the codeinkgensvm-package page for more info.
gamma: Parameter for the "poly", "rbf", and "sigmoid" kernels. See the section "Kernels in GenSVM" in the codeinkgensvm-package page for more info.

For variables that are not present in the param.grid data frame the default parameter values in the gensvm function will be used.

Note that this function reorders the parameter grid to make the warm starts as efficient as possible, which is why the param.grid in the result will not be the same as the param.grid in the input.

Note

1. This function returns partial results when the computation is interrupted by the user. 2. The score.time reported in the results only covers the time needed to compute the score from the predictions and true class labels. It does not include the time to compute the predictions themselves.

Author(s)

Gerrit J.J. van den Burg, Patrick J.F. Groenen
Maintainer: Gerrit J.J. van den Burg <gertjanvandenburg@gmail.com>

References

Van den Burg, G.J.J. and Groenen, P.J.F. (2016). GenSVM: A Generalized Multiclass Support Vector Machine, Journal of Machine Learning Research, 17(225):1–42. URL https://jmlr.org/papers/v17/14-526.html.

Examples

x <- iris[, -5]
y <- iris[, 5]


# use the default parameter grid
grid <- gensvm.grid(x, y, verbose=TRUE)


# use a smaller parameter grid
pg <- expand.grid(p=c(1.0, 1.5, 2.0), kappa=c(-0.9, 1.0), epsilon=c(1e-3))
grid <- gensvm.grid(x, y, param.grid=pg)

# print the result
print(grid)


# Using a custom scoring function (accuracy as percentage)
acc.pct <- function(yt, yp) { return (100 * sum(yt == yp) / length(yt)) }
grid <- gensvm.grid(x, y, scoring=acc.pct)

# With RBF kernel and very verbose progress printing
pg <- expand.grid(kernel=c('rbf'), gamma=c(1e-2, 1e-1, 1, 1e1, 1e2),
                  lambda=c(1e-8, 1e-6), max.iter=c(5000))
grid <- gensvm.grid(x, y, param.grid=pg, verbose=2)

[Package gensvm version 0.1.7 Index]