elasticsearch-analysis-worddelimiter2

The WordDelimiterFilter2 analysis plugin for ElasticSearch provides with an extension over the mainstream WordDelimiterFilter.

License	License The Apache Software License, Version 2.0
Categories	Categories Search Business Logic Libraries Elasticsearch
GroupId	GroupId com.yakaz.elasticsearch.plugins
ArtifactId	ArtifactId elasticsearch-analysis-worddelimiter2
Last Version	Last Version 1.2.0
Release Date	Release Date Feb 14, 2014
Type	Type jar
Description	Description elasticsearch-analysis-worddelimiter2 The WordDelimiterFilter2 analysis plugin for ElasticSearch provides with an extension over the mainstream WordDelimiterFilter.
Project URL	Project URL http://github.com/yakaz/elasticsearch-analysis-worddelimiter2
Source Code Management	Source Code Management https://github.com/yakaz/elasticsearch-analysis-worddelimiter2.git

Download elasticsearch-analysis-worddelimiter2

Filename	Size
elasticsearch-analysis-worddelimiter2-1.2.0.pom
elasticsearch-analysis-worddelimiter2-1.2.0.zip	12 KB
elasticsearch-analysis-worddelimiter2-1.2.0-sources.jar	12 KB
elasticsearch-analysis-worddelimiter2-1.2.0-javadoc.jar	64 KB
Browse

How to add to project

Apache Maven

<!-- https://jarcasting.com/artifacts/com.yakaz.elasticsearch.plugins/elasticsearch-analysis-worddelimiter2/ -->
<dependency>
    <groupId>com.yakaz.elasticsearch.plugins</groupId>
    <artifactId>elasticsearch-analysis-worddelimiter2</artifactId>
    <version>1.2.0</version>
</dependency>

Gradle Groovy

// https://jarcasting.com/artifacts/com.yakaz.elasticsearch.plugins/elasticsearch-analysis-worddelimiter2/
implementation 'com.yakaz.elasticsearch.plugins:elasticsearch-analysis-worddelimiter2:1.2.0'

Gradle Kotlin

// https://jarcasting.com/artifacts/com.yakaz.elasticsearch.plugins/elasticsearch-analysis-worddelimiter2/
implementation ("com.yakaz.elasticsearch.plugins:elasticsearch-analysis-worddelimiter2:1.2.0")

Apache Buildr

'com.yakaz.elasticsearch.plugins:elasticsearch-analysis-worddelimiter2:jar:1.2.0'

Apache Ivy

<dependency org="com.yakaz.elasticsearch.plugins" name="elasticsearch-analysis-worddelimiter2" rev="1.2.0">
  <artifact name="elasticsearch-analysis-worddelimiter2" type="jar" />
</dependency>

Groovy Grape

@Grapes(
@Grab(group='com.yakaz.elasticsearch.plugins', module='elasticsearch-analysis-worddelimiter2', version='1.2.0')
)

Scala SBT

libraryDependencies += "com.yakaz.elasticsearch.plugins" % "elasticsearch-analysis-worddelimiter2" % "1.2.0"

Leiningen

[com.yakaz.elasticsearch.plugins/elasticsearch-analysis-worddelimiter2 "1.2.0"]

Dependencies

compile (1)

Group / Artifact	Type	Version
org.elasticsearch : elasticsearch	jar	1.0.0.RC1

test (4)

Group / Artifact	Type	Version
org.hamcrest : hamcrest-core	jar	1.3
org.apache.lucene : lucene-test-framework	jar	4.6.0
org.testng : testng	jar	6.8
log4j : log4j	jar	1.2.17

Project Modules

There are no modules declared in this project.

Elasticsearch Word Delimiter 2 Filter

The WordDelimiterFilter2 analysis plugin provides with an extension over the mainstream WordDelimiterFilter.

Installation

Simply run at the root of your ElasticSearch v0.20.2+ installation:

bin/plugin -install com.yakaz.elasticsearch.plugins/elasticsearch-analysis-worddelimiter2/1.2.0

This will download the plugin from the Central Maven Repository.

For older versions of ElasticSearch, you can still use the longer:

bin/plugin -url http://oss.sonatype.org/content/repositories/releases/com/yakaz/elasticsearch/plugins/elasticsearch-analysis-worddelimiter2/1.0.1/elasticsearch-analysis-worddelimiter2-1.0.1.zip install elasticsearch-analysis-worddelimiter2

In order to declare this plugin as a dependency, add the following to your pom.xml:

<dependency>
    <groupId>com.yakaz.elasticsearch.plugins</groupId>
    <artifactId>elasticsearch-analysis-worddelimiter2</artifactId>
    <version>1.2.0</version>
</dependency>

Version matrix:

┌─────────────────────────────────┬──────────────────────┐
│ WordDelimiter 2 Analysis Plugin │ ElasticSearch        │
├─────────────────────────────────┼──────────────────────┤
│ 1.2.x                           │ 1.0.0.RC1 ─► (1.4.3) │
├─────────────────────────────────┼──────────────────────┤
│ 1.1.x                           │ 0.90 ─► (0.90.11)    │
├─────────────────────────────────┼──────────────────────┤
│ 1.0.x                           │ 0.19 ─► 0.20         │
└─────────────────────────────────┴──────────────────────┘

Description

Lucene 4's WordDelimiterFilter is exposed since ElasticSearch v0.17.0. This plugin exposes an extension over this filter, packaged as an ElasticSearch 0.19.0+ plugin.

See Lucene WordDelimiterFilter JavaDoc for more information about the base functionality.

Added features

Currently there is a single added feature:

All parts at same position

When the filter splits the input token, it generates additional tokens, usually each of them takes a new position on its own. If you ask to preserve the original token and to catenate some parts, you may get multiple times identical tokens, at diverse positions. The catenated tokens are added at the final position, making multiple tokens at the same position. All this behavior is a little confusing.

This new feature permits to output all tokens at the same position, hence "turn wi-fi on" with catenate numbers and preserve original will no longer yield 0:turn 1:wi-fi 1:wi 2:fi 2:wifi 3:on, but will yield 0:turn 1:wi-fi 1:wi 1:fi 1:wifi 2:on.

This is particularly useful when merging with other analysis, using the Combo Analyzer, to prevent position jitter.

Please always be aware of the impact of terms positions with regard to your queries.

Configuration

The plugin provides you with the word_delimiter_2 token filter type. It accepts the same list of parameters as the word_delimiter token filter, plus:

all_parts_at_same_position: false by default.

Versions

Version
1.2.0 Feb 14, 2014
1.1.0 Feb 28, 2013
1.0.1 Feb 1, 2013
1.0.0 Jan 29, 2013

elasticsearch-analysis-worddelimiter2

License

Categories

GroupId

ArtifactId

Last Version

Release Date

Type

Description

Project URL

Source Code Management

Download elasticsearch-analysis-worddelimiter2

How to add to project

Dependencies

compile (1)

test (4)

Project Modules

Elasticsearch Word Delimiter 2 Filter

Installation

Description

Added features

Configuration

See also

Yakaz

Versions