get_tokens {discoverableresearch}R Documentation

Remove stopwords from text

Description

Removes stopwords from text in whichever language is specified.

Removes stop words from a text string (adapted from 'litsearchr' https://github.com/elizagrames/litsearchr/) and returns the remaining words as a vector of strings

Usage

get_tokens(text, language = "English")

get_tokens(text, language = "English")

Arguments

text

An input string

language

The language used to look up stop words (default is "English")

Value

Returns the input text with stopwords removed.

A vector of strings consisting of the non-stop words from the 'text' input

Examples

get_tokens("On the Origin of Species", language="English")
text <- "A methodology for systematic mapping in environmental sciences"
tokens <- get_tokens(text)
tokens;

[Package discoverableresearch version 0.0.1 Index]