| compare_pairs.cluster_pairs {reclin2} | R Documentation |
Compare pairs on a set of variables common in both data sets
Description
Compare pairs on a set of variables common in both data sets
Usage
## S3 method for class 'cluster_pairs'
compare_pairs(
pairs,
on,
comparators = list(default_comparator),
default_comparator = cmp_identical(),
new_name = NULL,
...
)
compare_pairs(
pairs,
on,
comparators = list(default_comparator),
default_comparator = cmp_identical(),
...
)
## S3 method for class 'pairs'
compare_pairs(
pairs,
on,
comparators = list(default_comparator),
default_comparator = cmp_identical(),
x = attr(pairs, "x"),
y = attr(pairs, "y"),
inplace = FALSE,
...
)
Arguments
pairs |
|
on |
character vector of variables that should be compared. |
comparators |
named list of functions with which the variables are compared.
This function should accept two vectors. Function should either return a vector
or a |
default_comparator |
variables for which no comparison function is defined using
|
new_name |
name of new object to assign the pairs to on the cluster nodes. |
... |
Ignored for now |
x |
|
y |
|
inplace |
logical indicating whether |
Details
It is assumed the variables in on are present in both x and y. Variables
with the same names are added to pairs.
When the comparator returns a data.table multiple columns are added to pairs.
The names of these columns are variable pasted together with the names of
the data.table returned by comparator (separated by "_").
Value
Returns the data.table pairs with one or more columns added in
case of compare_pairs.pairs.
In case of compare_pairs.cluster_pairs, compare_pair.pairs is called on
each cluster node and the resulting pairs are assigned to new_name in
the environment reclin_env. When new_name is not given (or
equal to NULL) the original pairs on the nodes are overwritten.