preproc {PreProcessRecordLinkage}R Documentation

Preprocessing and Unification of Variable Names of Two Input Data Sets

Description

In this function data preprocessing has been meticulously executed to cover a wide range of datasets, ensuring that variable names are standardized using synonyms.

Usage

preproc(d1, d2)
## S3 method for class 'explain'
print(x,...)

Arguments

d1

A data frame.

d2

A data frame.

x

an object of class 'eplain'.

...

further arguments passed to preproc function.

Details

Because we want users to be able to change their names. The output of this function gives the names and classes that have changed in the new version and the previous version, as well as the number of changes in both datasets. Returns the corresponding number for the chz argument in the chzInput function.

Value

preproc an object of class 'explain'.

An object of class 'explain' is a list containing the following components:

Changed variable's names

Character.

Changed variable's classes

Character.

Initial variable's names

Character.

Initial variable's classes

Character.

A number of changed variable values for the first dataset are

Data frame.

A number of changed variable values for the second dataset

Data frame.

Number of changed variable's names

Vector.

Note

This function has a comprehensible output if changes have been made on the names of the variables for equalization, otherwise it has no specific output and everything is zero.

In addition, it should be noted that the names of the variables of the second data set are matched and the necessary changes are made based on the first data set.

Author(s)

Hossein Hassani and and Leila Marvian Mashhad.

See Also

chzInput

Examples

  d1 = RLdata500
  d2 = RLdata10000
  preproc(d1, d2)

[Package PreProcessRecordLinkage version 1.0.1 Index]