pred_gp {MagmaClustR} | R Documentation |
Gaussian Process prediction
Description
Compute the posterior distribution of a standard GP, using the formalism of Magma. By providing observed data, the prior mean and covariance matrix (by defining a kernel and its associated hyper-parameters), the mean and covariance parameters of the posterior distribution are computed on the grid of inputs that has been specified. This predictive distribution can be evaluated on any arbitrary inputs since a GP is an infinite-dimensional object.
Usage
pred_gp(
data = NULL,
grid_inputs = NULL,
mean = NULL,
hp = NULL,
kern = "SE",
get_full_cov = FALSE,
plot = TRUE,
pen_diag = 1e-10
)
Arguments
data |
A tibble or data frame. Required columns: 'Input', 'Output'. Additional columns for covariates can be specified. The 'Input' column should define the variable that is used as reference for the observations (e.g. time for longitudinal data). The 'Output' column specifies the observed values (the response variable). The data frame can also provide as many covariates as desired, with no constraints on the column names. These covariates are additional inputs (explanatory variables) of the models that are also observed at each reference 'Input'. If NULL, the prior GP is returned. |
grid_inputs |
The grid of inputs (reference Input and covariates) values
on which the GP should be evaluated. Ideally, this argument should be a
tibble or a data frame, providing the same columns as |
mean |
Mean parameter of the GP. This argument can be specified under various formats, such as:
|
hp |
A named vector, tibble or data frame of hyper-parameters
associated with |
kern |
A kernel function, defining the covariance structure of the GP. Several popular kernels (see The Kernel Cookbook) are already implemented and can be selected within the following list:
|
get_full_cov |
A logical value, indicating whether the full posterior covariance matrix should be returned. |
plot |
A logical value, indicating whether a plot of the results is automatically displayed. |
pen_diag |
A number. A jitter term, added on the diagonal to prevent numerical issues when inverting nearly singular matrices. |
Value
A tibble, representing the GP predictions as two column 'Mean' and
'Var', evaluated on the grid_inputs
. The column 'Input' and
additional covariates columns are associated to each predicted values.
If the get_full_cov
argument is TRUE, the function returns a list,
in which the tibble described above is defined as 'pred' and the full
posterior covariance matrix is defined as 'cov'.
Examples
TRUE