bdparData {bdpar} | R Documentation |
Example of the content of the files to be preprocessed.
Description
A manually collected data set containing e-mails and SMS messages from the nutritional and health domain classified as spam and non-spam (with a ratio of 50%). In addition the dataset contains two variables: (i) path which indicates the location of the target file and, (ii) source which contains the raw text comprising each file.
Usage
data(bdparData)
Format
A data frame with 20 rows and 2 variables:
- path
File path.
- source
File content.
[Package bdpar version 3.1.0 Index]