bdparData {bdpar}R Documentation

Example of the content of the files to be preprocessed.

Description

A manually collected data set containing e-mails and SMS messages from the nutritional and health domain classified as spam and non-spam (with a ratio of 50%). In addition the dataset contains two variables: (i) path which indicates the location of the target file and, (ii) source which contains the raw text comprising each file.

Usage

data(bdparData)

Format

A data frame with 20 rows and 2 variables:

path

File path.

source

File content.


[Package bdpar version 3.1.0 Index]