bbc_articles_full {ascentTraining}R Documentation

Full BBC Articles data

Description

Full BBC Articles data

Usage

bbc_articles_full

Format

A tibble, with 927 observations of separate documents and their contents. This results in two columns.

words

The words from a given article

document

The 'document' (article) ID

Details

A collection of business and politics BBC news articles. Each row represents each article (document), with a document ID and a string of the text content with stop words removed. This is a 'dirty' version of the bbc_articles dataset, where we now have a string of text for each observation, as opposed to a single word.

Source


[Package ascentTraining version 1.0.0 Index]