Gse v0.60.0 发布,增加 TF-IDF、停用词等功能
Go 语言高效分词, 支持英文、中文、日文等
该版本主要增加: 增加 "TF-IDF, Text rank, HMM Pos, 停用词, 清除和过滤词支持, Levenshtein" and more,
周边支持拼音和多音词, 以及 Levenshtein 等功能
增加更多示例、测试和 benchmark 代码以及修复 bug等
项目地址: https://github.com/go-ego/gse
Gse v0.60.0 is released, Go text segmentation and NLP; Add "TF-IDF, Text rank, HMM, Stop word, Trim/filter word, Levenshtein" and more
# v0.60.0, Green Lake
Add TF-IDF and text rank support
Add SegPos and HMM pos support
Add more HMM support, add SuggestFreq() support
Add Levenshtein and Pinyin support
Add stop word support
Add trim word support ( trim symbol and trim with pos... )
Add filter emoji, symbol and language support
Add PosStr and CutStr support
Add skip log and pos set support
Add multiple load dictionary support
Add more example and test code
Refactor test code
Optimize code and fixed bug
...