R: Evaluates the format and content of the input dataset

banff_dataset_evaluate {banffIT}

R Documentation

Evaluates the format and content of the input dataset

Description

This function takes a dataset and evaluates its format and content based on the accepted format specified in the data dictionary. It applies a series of checks to make sure the dataset is ready to be processed by the add_diagnoses() function which assigns diagnoses to each observation of the dataset. The function evaluates whether:

The input file is a dataset
All mandatory variables are present in the dataset
Missing values (NA) are present in variables where they are not allowed
Data types are correct.
The combination of ID, center, and biopsy date is unique
There are duplicated variable in the dataset
Dates are valid
Content values follow the category values as specified in the data dictionary
Constraints specified in the data dictionary are respected

Usage

banff_dataset_evaluate(banff_dataset)

Arguments

banff_dataset

A tibble object.

Value

A list of tibble objects giving information on the assessment of the dataset.

Examples

{

banff_dataset <- get_banff_template()
banff_dataset_evaluate(banff_dataset)

}

[Package banffIT version 1.0.0 Index]