Chinese Text Segmentation

Documentation for package ‘jiebaR’ version 0.11

Help Pages

jiebaR-package	A package for Chinese text segmentation
<=.keywords	Keywords symbol
<=.qseg	Quick mode symbol
<=.segment	Text segmentation symbol
<=.simhash	Simhash symbol
<=.tagger	Tagger symbol
apply_list	Apply list input to a worker
DICTPATH	The path of dictionary
distance	Hamming distance of words
edit_dict	Edit default user dictionary
filecoding	Files encoding detection
file_coding	Files encoding detection
filter_segment	Filter segmentation result
freq	The frequency of words
get_idf	generate IDF dict
get_qsegmodel	Set quick mode model
get_tuple	get tuple from the segmentation result
HMMPATH	The path of dictionary
IDFPATH	The path of dictionary
jiebaR	A package for Chinese text segmentation
keywords	Keyword extraction
new_user_word	Add user word
print.inv	Print worker settings
print.jieba	Print worker settings
print.keywords	Print worker settings
print.qseg	Print worker settings
print.simhash	Print worker settings
qseg	Quick mode symbol
reset_qsegmodel	Set quick mode model
segment	Chinese text segmentation function
set_qsegmodel	Set quick mode model
show_dictpath	Show default path of dictionaries
simhash	Simhash computation
simhash_dist	Compute Hamming distance of Simhash value
simhash_dist_mat	Compute Hamming distance of Simhash value
STOPPATH	The path of dictionary
tagging	Speech Tagging
tobin	simhash value to binary
USERPATH	The path of dictionary
vector_distance	Hamming distance of words
vector_keywords	Keyword extraction
vector_simhash	Simhash computation
vector_tag	Tag the a character vector
worker	Initialize jiebaR worker
[.keywords	Keywords symbol
[.qseg	Quick mode symbol
[.segment	Text segmentation symbol
[.simhash	Simhash symbol
[.tagger	Tagger symbol