select_neighbours {ingredients} | R Documentation |
Select Subset of Rows Closest to a Specified Observation
Description
Function select_neighbours
selects subset of rows from data set.
This is useful if data is large and we need just a sample to calculate profiles.
Usage
select_neighbours(
observation,
data,
variables = NULL,
distance = gower::gower_dist,
n = 20,
frac = NULL
)
Arguments
observation |
single observation |
data |
set of observations |
variables |
names of variables that shall be used for calculation of distance.
By default these are all variables present in |
distance |
the distance function, by default the |
n |
number of neighbors to select |
frac |
if |
Details
Note that select_neighbours()
function is S3 generic.
If you want to work on non standard data sources (like H2O ddf, external databases)
you should overload it.
Value
a data frame with selected rows
Examples
library("ingredients")
new_apartment <- DALEX::apartments[1,]
small_apartments <- select_neighbours(new_apartment, DALEX::apartments_test, n = 10)
new_apartment
small_apartments
[Package ingredients version 2.3.0 Index]