preproc {PreProcessRecordLinkage} | R Documentation |
Preprocessing and Unification of Variable Names of Two Input Data Sets
Description
In this function data preprocessing has been meticulously executed to cover a wide range of datasets, ensuring that variable names are standardized using synonyms.
Usage
preproc(d1, d2)
## S3 method for class 'explain'
print(x,...)
Arguments
d1 |
A data frame. |
d2 |
A data frame. |
x |
an object of class |
... |
further arguments passed to preproc function. |
Details
Because we want users to be able to change their names. The output of this function gives the names and classes that have changed in the new version and the previous version, as well as the number of changes in both datasets. Returns the corresponding number for the chz
argument in the chzInput
function.
Value
preproc
an object of class 'explain'
.
An object of class 'explain'
is a list containing the following components:
Changed variable's names |
Character. |
Changed variable's classes |
Character. |
Initial variable's names |
Character. |
Initial variable's classes |
Character. |
A number of changed variable values for the first dataset are |
Data frame. |
A number of changed variable values for the second dataset |
Data frame. |
Number of changed variable's names |
Vector. |
Note
This function has a comprehensible output if changes have been made on the names of the variables for equalization, otherwise it has no specific output and everything is zero.
In addition, it should be noted that the names of the variables of the second data set are matched and the necessary changes are made based on the first data set.
Author(s)
Hossein Hassani and and Leila Marvian Mashhad.
See Also
Examples
d1 = RLdata500
d2 = RLdata10000
preproc(d1, d2)