Comments (3)
index:
analysis:
filter:
pinyin_filter:
padding_char: ''
type: pinyin
first_letter: none
tokenizer:
my_pinyin:
type: pinyin
first_letter: prefix
padding_char: ' '
analyzer:
pinyin_standard_analyzer:
type: custom
filter: word_delimiter
tokenizer: my_pinyin
pinyin_ngram_analyzer:
type: custom
filter: lowercase
tokenizer: my_pinyin
my_pinyin_ik:
type: custom
filter: pinyin_filter
tokenizer: ik_smart
my_ik_smart:
type: custom
tokenizer: ik_smart
from elasticsearch-analysis-pinyin.
mapping 在需要ik和拼音搜索的字段,使用multi_field。例子如下:
{
"properties": {
"title": {
"type": "multi_field",
"fields": {
"pinyinszm": {
"type": "string",
"term_vector": "with_positions_offsets",
"analyzer": "pinyin_szm_analyzer",
"store": "no"
},
"title": {
"type": "string",
"analyzer": "ik_smart",
"store": "no"
}
,
"pinyin": {
"type": "string",
"term_vector": "with_positions_offsets",
"analyzer": "pinyin_analyzer",
"store":"no"
}
}
}
}
}
from elasticsearch-analysis-pinyin.
5.x
PUT http://127.0.0.1:9200/test/
{
"settings": {
"analysis" : {
"analyzer" : {
"pinyin_analyzer" : {
"tokenizer" : "my_pinyin"
},
"ik_smart_pinyin":{
"type":"custom",
"tokenizer":"ik_max_word",
"filter":["single_pinyin"]
}
},
"tokenizer" : {
"my_pinyin" : {
"type" : "pinyin",
"keep_first_letter": true,
"keep_separate_first_letter" : true,
"keep_full_pinyin" : true,
"keep_joined_full_pinyin": true,
"keep_original" : true,
"limit_first_letter_length" : 16,
"lowercase" : true
}
},
"filter": {
"single_pinyin" :{
"type": "pinyin",
"keep_first_letter": true,
"keep_separate_first_letter" : true,
"keep_full_pinyin": true,
"keep_joined_full_pinyin": true,
"limit_first_letter_length" : 16,
"keep_original" : true,
"lowercase" : true
}
}
}
}
}
from elasticsearch-analysis-pinyin.
Related Issues (20)
- 关于zh,ch,sh无法查询到相关的词语
- 关于示例中name.pinyin搜索能直接搜中文英文 HOT 1
- 没有高亮
- No installable zip in release assets for v8.4.2 and v8.4.3 HOT 1
- es 8.5X版本无法建立mapping HOT 1
- v6.8.20 源码和jar包对不上
- es7.17.0 使用7.17.0版本依然报错startOffset HOT 3
- 求助,使用match_phrase搜索不到结果 HOT 1
- 中文首字符携带数字排序不理解大小
- 严重BUG:当分词内容中包含单独的A字母时,这个A字母会被分词器扔掉 HOT 1
- 如何解决同音字的问题 HOT 2
- 没有7.17.10版本吗? HOT 1
- elasticsearch8.7.0可以使用7.x版本的拼音吗 HOT 3
- 怎么在分词后保留"c++软件工程师"中“+”号在结果中,为什么拼音分词器会过滤掉符号呢
- 中英文混合时能否也支持下提取英文单词首字母
- 拼音首字母查询问题,当第二个字的拼音首字母为第一个字的韵母时查询不到结果 HOT 1
- 构建了 8.10.2 ,8.10.3,8.10.4,7.17.14供使用 HOT 6
- 求ES 8.12版本的插件 HOT 2
- ES8.9.2,release版本没有编译好的jar包 HOT 1
- 希望提供8.13.2、7.17.19版本插件 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from elasticsearch-analysis-pinyin.