unnest_sentences_ {lexRankr} | R Documentation |
Split a column of text into sentences
Description
Split a column of text into sentences
Usage
unnest_sentences_(tbl, output, input, doc_id = NULL,
output_id = "sent_id", drop = TRUE)
unnest_sentences(tbl, output, input, doc_id = NULL,
output_id = "sent_id", drop = TRUE)
Arguments
tbl |
dataframe containing column of text to be split into sentences |
output |
name of column to be created to store parsed sentences |
input |
name of input column of text to be parsed into sentences |
doc_id |
column of document ids; if not provided it will be assumed that each row is a different document |
output_id |
name of column to be created to store sentence ids |
drop |
whether original input column should get dropped |
Value
A data.frame of parsed sentences and sentence ids
Examples
df <- data.frame(doc_id = 1:3,
text = c("Testing the system. Second sentence for you.",
"System testing the tidy documents df.",
"Documents will be parsed and lexranked."),
stringsAsFactors=FALSE)
unnest_sentences(df, sents, text)
unnest_sentences_(df, "sents", "text")
## Not run:
library(magrittr)
df %>%
unnest_sentences(sents, text)
## End(Not run)
[Package lexRankr version 0.5.2 Index]