| Maxent_Sent_Token_Annotator {openNLP} | R Documentation | 
Apache OpenNLP based sentence token annotators
Description
Generate an annotator which computes sentence annotations using the Apache OpenNLP Maxent sentence detector.
Usage
Maxent_Sent_Token_Annotator(language = "en", probs = FALSE, model = NULL)
Arguments
language | 
 a character string giving the ISO-639 code of the language being processed by the annotator.  | 
probs | 
 a logical indicating whether the computed annotations should provide the token probabilities obtained from the Maxent model as their ‘prob’ feature.  | 
model | 
 a character string giving the path to the Maxent model file to be
used, or   | 
Details
See http://opennlp.sourceforge.net/models-1.5/ for available model files. For languages other than English, these can conveniently be made available to R by installing the respective openNLPmodels.language package from the repository at https://datacube.wu.ac.at. For English, no additional installation is required.
Value
An Annotator object giving the generated sentence
token annotator.
See Also
https://opennlp.apache.org for more information about Apache OpenNLP.
Examples
require("NLP")
## Some text.
s <- paste(c("Pierre Vinken, 61 years old, will join the board as a ",
             "nonexecutive director Nov. 29.\n",
             "Mr. Vinken is chairman of Elsevier N.V., ",
             "the Dutch publishing group."),
           collapse = "")
s <- as.String(s)
sent_token_annotator <- Maxent_Sent_Token_Annotator()
sent_token_annotator
a1 <- annotate(s, sent_token_annotator)
a1
## Extract sentences.
s[a1]
## Variant with sentence probabilities as features.
annotate(s, Maxent_Sent_Token_Annotator(probs = TRUE))