stopwordsISO {morestopwords} | R Documentation |
Combined stop words for all languages
Description
A list of stop words in each of the supported languages
Usage
stopwordsISO
Format
An object of class list
of length 65.
Details
Note: All Unicode characters are escaped. To un-escape them, consider using:
library(AllStopwords) if(!requireNamespace('stringi')){ install.packages('stringi') } data('stopwordsISO') stopwords_unescaped <- lapply(stopwordsISO, stringi::stri_unescape_unicode)
Author(s)
Each stop-word list's Authors
Source
All unique stopwords in the following databases:
the StopwordISO repository;
python's Natural Language Toolkit (nltk);
the Snowball stop-word list;
the R package quanteda;
the marimo repository;
the Perseus project; and
Aurélien Berra's list of stop words for Ancient Greek and Latin (doi:10.5281/zenodo.3860343).
[Package morestopwords version 0.2.0 Index]