Chinese Text Segmentation


[Up] [Top]

Documentation for package ‘jiebaR’ version 0.11

Help Pages

jiebaR-package A package for Chinese text segmentation
<=.keywords Keywords symbol
<=.qseg Quick mode symbol
<=.segment Text segmentation symbol
<=.simhash Simhash symbol
<=.tagger Tagger symbol
apply_list Apply list input to a worker
DICTPATH The path of dictionary
distance Hamming distance of words
edit_dict Edit default user dictionary
filecoding Files encoding detection
file_coding Files encoding detection
filter_segment Filter segmentation result
freq The frequency of words
get_idf generate IDF dict
get_qsegmodel Set quick mode model
get_tuple get tuple from the segmentation result
HMMPATH The path of dictionary
IDFPATH The path of dictionary
jiebaR A package for Chinese text segmentation
keywords Keyword extraction
new_user_word Add user word
print.inv Print worker settings
print.jieba Print worker settings
print.keywords Print worker settings
print.qseg Print worker settings
print.simhash Print worker settings
qseg Quick mode symbol
reset_qsegmodel Set quick mode model
segment Chinese text segmentation function
set_qsegmodel Set quick mode model
show_dictpath Show default path of dictionaries
simhash Simhash computation
simhash_dist Compute Hamming distance of Simhash value
simhash_dist_mat Compute Hamming distance of Simhash value
STOPPATH The path of dictionary
tagging Speech Tagging
tobin simhash value to binary
USERPATH The path of dictionary
vector_distance Hamming distance of words
vector_keywords Keyword extraction
vector_simhash Simhash computation
vector_tag Tag the a character vector
worker Initialize jiebaR worker
[.keywords Keywords symbol
[.qseg Quick mode symbol
[.segment Text segmentation symbol
[.simhash Simhash symbol
[.tagger Tagger symbol