| pei {dgpsi} | R Documentation |
Locate the next design point for a (D)GP emulator or a bundle of (D)GP emulators using PEI
Description
This function searches from a candidate set to locate the next design point(s) to be added to a (D)GP emulator or a bundle of (D)GP emulators using the Pseudo Expected Improvement (PEI), see the reference below.
Usage
pei(object, x_cand, ...)
## S3 method for class 'gp'
pei(object, x_cand, pseudo_points = NULL, batch_size = 1, ...)
## S3 method for class 'dgp'
pei(
object,
x_cand,
pseudo_points = NULL,
batch_size = 1,
workers = 1,
threading = FALSE,
aggregate = NULL,
...
)
## S3 method for class 'bundle'
pei(
object,
x_cand,
pseudo_points = NULL,
batch_size = 1,
workers = 1,
threading = FALSE,
aggregate = NULL,
...
)
Arguments
object |
can be one of the following:
|
x_cand |
a matrix (with each row being a design point and column being an input dimension) that gives a candidate set
from which the next design point(s) are determined. If |
... |
any arguments (with names different from those of arguments used in |
pseudo_points |
an optional matrix (with columns being input dimensions) that gives the pseudo input points for PEI calculations. See the reference below
for further details about the pseudo points. When |
batch_size |
an integer that gives the number of design points to be chosen.
Defaults to |
workers |
the number of workers/cores to be used for the criterion calculation. If set to |
threading |
a bool indicating whether to use the multi-threading to accelerate the criterion calculation for a DGP emulator. Turning this option on could improve the speed of criterion calculations when the DGP emulator is built with a moderately large number of training data points and the Matérn-2.5 kernel. |
aggregate |
an R function that aggregates scores of the PEI across different output dimensions (if
Set to |
Details
See further examples and tutorials at https://mingdeyu.github.io/dgpsi-R/.
Value
If
objectis an instance of thegpclass, a vector is returned with the length equal tobatch_size, giving the positions (i.e., row numbers) of next design points fromx_cand.If
objectis an instance of thedgpclass, a matrix is returned with row number equal tobatch_sizeand column number equal to one (ifaggregateis notNULL) or the output dimension (ifaggregateisNULL), giving positions (i.e., row numbers) of next design points fromx_candto be added to the DGP emulator across different outputs.If
objectis an instance of thebundleclass, a matrix is returned with row number equal tobatch_sizeand column number equal to the number of emulators in the bundle, giving positions (i.e., row numbers) of next design points fromx_candto be added to individual emulators.
Note
The column order of the first argument of
aggregatemust be consistent with the order of emulator output dimensions (ifobjectis an instance of thedgpclass), or the order of emulators placed inobjectifobjectis an instance of thebundleclass;If
x_candis supplied as a list whenobjectis an instance ofbundleclass and aaggregatefunction is provided, the matrices inx_candmust have common rows (i.e., the candidate sets of emulators in the bundle have common input locations) so theaggregatefunction can be applied.The function is only applicable to DGP emulators without likelihood layers.
Any R vector detected in
x_candandpseudo_pointswill be treated as a column vector and automatically converted into a single-column R matrix.
References
Mohammadi, H., Challenor, P., Williamson, D., & Goodfellow, M. (2022). Cross-validation-based adaptive sampling for Gaussian process models. SIAM/ASA Journal on Uncertainty Quantification, 10(1), 294-316.
Examples
## Not run:
# load packages and the Python env
library(lhs)
library(dgpsi)
# construct a 1D non-stationary function
f <- function(x) {
sin(30*((2*x-1)/2-0.4)^5)*cos(20*((2*x-1)/2-0.4))
}
# generate the initial design
X <- maximinLHS(10,1)
Y <- f(X)
# training a 2-layered DGP emulator with the global connection off
m <- dgp(X, Y, connect = F)
# generate a candidate set
x_cand <- maximinLHS(200,1)
# locate the next design point using PEI
next_point <- pei(m, x_cand = x_cand)
X_new <- x_cand[next_point,,drop = F]
# obtain the corresponding output at the located design point
Y_new <- f(X_new)
# combine the new input-output pair to the existing data
X <- rbind(X, X_new)
Y <- rbind(Y, Y_new)
# update the DGP emulator with the new input and output data and refit with 500 training iterations
m <- update(m, X, Y, refit = TRUE, N = 500)
# plot the LOO validation
plot(m)
## End(Not run)