detect_similar_labels {daqapo} | R Documentation |
Search for similar labels in a column
Description
Function that tries to detect spelling mistakes in a given activity log column
Usage
detect_similar_labels(
activitylog,
column_labels,
max_edit_distance,
show_NA,
ignore_capitals,
filter_condition
)
Arguments
activitylog |
The activity log |
column_labels |
The name of the column(s) in which to search for spelling mistakes |
max_edit_distance |
The maximum number of insertions, deletions and substitutions that are allowed to be executed in order for two strings to be considered similar. |
show_NA |
A boolean indicating if labels that do not show similarities with others should be shown in the output |
ignore_capitals |
A boolean indicating if capitalization should be included or excluded when calculating the edit distance between two strings |
filter_condition |
Condition that is used to extract a subset of the activity log prior to the application of the function |
Value
tbl_df providing an overview of similar labels for the indicated column
Examples
data("hospital_actlog")
detect_similar_labels(activitylog = hospital_actlog,
column_labels = "activity",
max_edit_distance = 3)
[Package daqapo version 0.3.2 Index]