add_from_x |
Add a variable from one of the data sets to pairs |
add_from_y |
Add a variable from one of the data sets to pairs |
cluster_call |
Call a function on each of the worker nodes and pass it the pairs |
cluster_collect |
Collect pairs from cluster nodes |
cluster_modify_pairs |
Call a function on each of the worker nodes to modify the pairs on the node |
cluster_pair |
Generate all possible pairs using multiple processes |
cluster_pair_blocking |
Generate pairs using simple blocking using multiple processes |
cluster_pair_minsim |
Generate pairs with a minimal similarity using multiple processes |
cmp_identical |
Comparison functions |
cmp_jaccard |
Comparison functions |
cmp_jarowinkler |
Comparison functions |
cmp_lcs |
Comparison functions |
compare_pairs |
Compare pairs on a set of variables common in both data sets |
compare_pairs.cluster_pairs |
Compare pairs on a set of variables common in both data sets |
compare_pairs.pairs |
Compare pairs on a set of variables common in both data sets |
compare_vars |
Compare pairs on given variables |
compare_vars.cluster_pairs |
Compare pairs on given variables |
compare_vars.pairs |
Compare pairs on given variables |
deduplicate_equivalence |
Deduplication using equivalence groups |
get_inspect_pairs |
Get a subset of pairs to inspect |
greedy |
Greedy one-to-one matching of pairs |
jaccard |
Comparison functions |
jaro_winkler |
Comparison functions |
lcs |
Comparison functions |
link |
Use the selected pairs to generate a linked data set |
linkexample1 |
Tiny example dataset for probabilistic linkage |
linkexample2 |
Tiny example dataset for probabilistic linkage |
match_n_to_m |
Force n to m matching on a set of pairs |
merge_pairs |
Merge two sets of pairs into one |
merge_pairs.cluster_pairs |
Merge two sets of pairs into one |
merge_pairs.pairs |
Merge two sets of pairs into one |
pair |
Generate all possible pairs |
pair_blocking |
Generate pairs using simple blocking |
pair_minsim |
Generate pairs with a minimal similarity |
predict.problink_em |
Calculate weights and probabilities for pairs |
problink_em |
Calculate EM-estimates of m- and u-probabilities |
rbind.cluster_pairs |
Merge two sets of pairs into one |
rbind.pairs |
Merge two sets of pairs into one |
score_simple |
Score pairs based on a number of comparison vectors |
score_simple.cluster_pairs |
Score pairs based on a number of comparison vectors |
score_simple.pairs |
Score pairs based on a number of comparison vectors |
select_greedy |
Select matching pairs enforcing one-to-one linkage |
select_greedy.cluster_pairs |
Select matching pairs enforcing one-to-one linkage |
select_greedy.pairs |
Select matching pairs enforcing one-to-one linkage |
select_n_to_m |
Select matching pairs enforcing one-to-one linkage |
select_n_to_m.cluster_pairs |
Select matching pairs enforcing one-to-one linkage |
select_n_to_m.pairs |
Select matching pairs enforcing one-to-one linkage |
select_threshold |
Select matching pairs with a score above or equal to a threshold |
select_threshold.cluster_pairs |
Select matching pairs with a score above or equal to a threshold |
select_threshold.pairs |
Select matching pairs with a score above or equal to a threshold |
select_unique |
Deselect pairs that are linked to multiple records |
select_unique.cluster_pairs |
Deselect pairs that are linked to multiple records |
select_unique.pairs |
Deselect pairs that are linked to multiple records |
summary.problink_em |
Summarise the results from 'problink_em' |
tabulate_patterns |
Create a table of comparison patterns |
tabulate_patterns.cluster_pairs |
Create a table of comparison patterns |
tabulate_patterns.pairs |
Create a table of comparison patterns |
town_names |
Spelling variations of a set of town names |