目录
骚年被解析成了骚 我们需要解析为骚年
{
"tokens": [
{
"token": "骚",
"start_offset": 0,
"end_offset": 1,
"type": "CN_CHAR",
"position": 0
},
{
"token": "年在",
"start_offset": 1,
"end_offset": 3,
"type": "CN_WORD",
"position": 1
},
{
"token": "开源",
"start_offset": 3,
"end_offset": 5,
"type": "CN_WORD",
"position": 2
},
{
"token": "中国学",
"start_offset": 5,
"end_offset": 8,
"type": "CN_WORD",
"position": 3
},
{
"token": "中国",
"start_offset": 5,
"end_offset": 7,
"type": "CN_WORD",
"position": 4
},
{
"token": "国学",
"start_offset": 6,
"end_offset": 8,
"type": "CN_WORD",
"position": 5
},
{
"token": "学习",
"start_offset": 7,
"end_offset": 9,
"type": "CN_WORD",
"position": 6
}
]
}
在同级目录之下创建一个文件 这个文件就是词典 自定义词典
custom.dic
重启es
骚年 开源中国 已被分词
{
"tokens": [
{
"token": "骚年",
"start_offset": 0,
"end_offset": 2,
"type": "CN_WORD",
"position": 0
},
{
"token": "年在",
"start_offset": 1,
"end_offset": 3,
"type": "CN_WORD",
"position": 1
},
{
"token": "开源中国",
"start_offset": 3,
"end_offset": 7,
"type": "CN_WORD",
"position": 2
},
{
"token": "开源",
"start_offset": 3,
"end_offset": 5,
"type": "CN_WORD",
"position": 3
},
{
"token": "中国学",
"start_offset": 5,
"end_offset": 8,
"type": "CN_WORD",
"position": 4
},
{
"token": "中国",
"start_offset": 5,
"end_offset": 7,
"type": "CN_WORD",
"position": 5
},
{
"token": "国学",
"start_offset": 6,
"end_offset": 8,
"type": "CN_WORD",
"position": 6
},
{
"token": "学习",
"start_offset": 7,
"end_offset": 9,
"type": "CN_WORD",
"position": 7
}
]
}
开源笔记
此笔记在学习的时候做的笔记 所以 没有那么多时间写的很细致或整理排版问题 采用关键帧图片与关键帧文字进行书写
可 Pull Requests 协作写开源笔记
开源视频
OSrcD的个人空间 - 哔哩哔哩 ( ゜- ゜)つロ 乾杯~ Bilibili
开源博客
全部博文 - OpenDevel的个人空间 - OSCHINA
开源项目
开源赞赏
请勿相信图片中任何联系方式
图片来源于视频 作者拿到视频学习已被第三方打码 没办法 截图也没时间去修改图片 请勿相信图片里的任何联系方式
谢谢
来源:oschina
链接:https://my.oschina.net/u/4675154/blog/4879648