Hyphen analysis for Elasticsearch
This plugin for Elasticsearch can handle hyphens and other symbols in words or at word boundaries for improved search precision.
An example in english is the word editor-in-chief
, which will be indexed as editor-in-chief
plus editor
, in
, chief
.
In other languages such as german, hyphenated words are much more common ("Bindestrichwörter").
This plugin also helps when analyzing acronyms or symbols in names because the hyphen characters is configurable by the symbols that connect parts of a word. Also, keywords may be optionally respected. So it is possible to index Programming in C++
differently from Programming in C
Documentation
For examples, see the documentation
Plugin |
Elasticsearch |
Release date |
5.1.1.0 |
5.1.1 |
Dec 23, 2016 |
Installation
5.x
./bin/elasticsearch-plugin install http://xbib.org/repository/org/xbib/elasticsearch/plugin/elasticsearch-analysis-hyphen/5.1.1.0/elasticsearch-analysis-hyphen-5.1.1.0-plugin.zip
Do not forget to restart the node after installing.
Issues
All feedback is welcome! If you find issues, please post them at Github
License
Hyphen Analysis Plugin for Elasticsearch
Copyright © 2016 Jörg Prante
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and lim