jiebaR-package |
A package for Chinese text segmentation |
<=.keywords |
Keywords symbol |
<=.qseg |
Quick mode symbol |
<=.segment |
Text segmentation symbol |
<=.simhash |
Simhash symbol |
<=.tagger |
Tagger symbol |
apply_list |
Apply list input to a worker |
DICTPATH |
The path of dictionary |
distance |
Hamming distance of words |
edit_dict |
Edit default user dictionary |
filecoding |
Files encoding detection |
file_coding |
Files encoding detection |
filter_segment |
Filter segmentation result |
freq |
The frequency of words |
get_idf |
generate IDF dict |
get_qsegmodel |
Set quick mode model |
get_tuple |
get tuple from the segmentation result |
HMMPATH |
The path of dictionary |
IDFPATH |
The path of dictionary |
jiebaR |
A package for Chinese text segmentation |
keywords |
Keyword extraction |
new_user_word |
Add user word |
print.inv |
Print worker settings |
print.jieba |
Print worker settings |
print.keywords |
Print worker settings |
print.qseg |
Print worker settings |
print.simhash |
Print worker settings |
qseg |
Quick mode symbol |
reset_qsegmodel |
Set quick mode model |
segment |
Chinese text segmentation function |
set_qsegmodel |
Set quick mode model |
show_dictpath |
Show default path of dictionaries |
simhash |
Simhash computation |
simhash_dist |
Compute Hamming distance of Simhash value |
simhash_dist_mat |
Compute Hamming distance of Simhash value |
STOPPATH |
The path of dictionary |
tagging |
Speech Tagging |
tobin |
simhash value to binary |
USERPATH |
The path of dictionary |
vector_distance |
Hamming distance of words |
vector_keywords |
Keyword extraction |
vector_simhash |
Simhash computation |
vector_tag |
Tag the a character vector |
worker |
Initialize jiebaR worker |
[.keywords |
Keywords symbol |
[.qseg |
Quick mode symbol |
[.segment |
Text segmentation symbol |
[.simhash |
Simhash symbol |
[.tagger |
Tagger symbol |