All Classes and Interfaces (Lucene 9.11.0 kuromoji API)

Class

Description

BaseFormAttribute

Attribute for Token.getBaseForm().

BaseFormAttributeImpl

Attribute for Token.getBaseForm().

BinaryDictionary

Base class for a binary-encoded in-memory dictionary.

BinaryDictionary.ResourceScheme

Deprecated, for removal: This API element is subject to removal in a future version.

CharacterDefinition

Character category data.

CharSequenceUtils

Utility functions for JapaneseCompletionFilter

ConnectionCosts

n-gram connection cost data

Dictionary

Dictionary interface for retrieving morphological data by id.

DictionaryBuilder

Tool to build dictionaries.

DictionaryBuilder.DictionaryFormat

Format of the dictionary.

GraphvizFormatter

Outputs the dot (graphviz) string for the viterbi lattice.

InflectionAttribute

Attribute for Kuromoji inflection data.

InflectionAttributeImpl

Attribute for Kuromoji inflection data.

JapaneseAnalyzer

Analyzer for Japanese that uses morphological analysis.

JapaneseBaseFormFilter

Replaces term text with the BaseFormAttribute.

JapaneseBaseFormFilterFactory

Factory for JapaneseBaseFormFilter.

JapaneseCompletionAnalyzer

Analyzer for Japanese completion suggester.

JapaneseCompletionFilter

A TokenFilter that adds Japanese romanized tokens to the term attribute.

JapaneseCompletionFilter.Mode

Completion mode

JapaneseCompletionFilterFactory

Factory for JapaneseCompletionFilter.

JapaneseHiraganaUppercaseFilter

A TokenFilter that normalizes small letters (捨て仮名) in hiragana into normal letters.

JapaneseHiraganaUppercaseFilterFactory

Factory for JapaneseHiraganaUppercaseFilter.

JapaneseIterationMarkCharFilter

Normalizes Japanese horizontal iteration marks (odoriji) to their expanded form.

JapaneseIterationMarkCharFilterFactory

Factory for JapaneseIterationMarkCharFilter.

JapaneseKatakanaStemFilter

A TokenFilter that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC).

JapaneseKatakanaStemFilterFactory

Factory for JapaneseKatakanaStemFilter.

JapaneseKatakanaUppercaseFilter

A TokenFilter that normalizes small letters (捨て仮名) in katakana into normal letters.

JapaneseKatakanaUppercaseFilterFactory

Factory for JapaneseKatakanaUppercaseFilter.

JapaneseNumberFilter

A TokenFilter that normalizes Japanese numbers (kansūji) to regular Arabic decimal numbers in half-width characters.

JapaneseNumberFilter.NumberBuffer

Buffer that holds a Japanese number string and a position index used as a parsed-to marker

JapaneseNumberFilterFactory

Factory for JapaneseNumberFilter.

JapanesePartOfSpeechStopFilter

Removes tokens that match a set of part-of-speech tags.

JapanesePartOfSpeechStopFilterFactory

Factory for JapanesePartOfSpeechStopFilter.

JapaneseReadingFormFilter

A TokenFilter that replaces the term attribute with the reading of a token in either katakana or romaji form.

JapaneseReadingFormFilterFactory

Factory for JapaneseReadingFormFilter.

JapaneseTokenizer

Tokenizer for Japanese that uses morphological analysis.

JapaneseTokenizer.Mode

Tokenization mode: this determines how the tokenizer handles compound and unknown words.

JapaneseTokenizer.Type

Token type reflecting the original source of this token

JapaneseTokenizerFactory

Factory for JapaneseTokenizer.

KatakanaRomanizer

Converts a Katakana string to Romaji using the pre-defined Katakana-Romaji mapping rules.

PartOfSpeechAttribute

Attribute for Token.getPartOfSpeech().

PartOfSpeechAttributeImpl

Attribute for Token.getPartOfSpeech().

ReadingAttribute

Attribute for Kuromoji reading data

ReadingAttributeImpl

Attribute for Kuromoji reading data

Token

Analyzed token with morphological data from its dictionary.

TokenInfoDictionary

Binary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs.

TokenInfoFST

Thin wrapper around an FST with root-arc caching for Japanese.

ToStringUtil

Utility class for english translations of morphological data, used only for debugging.

UnknownDictionary

Dictionary for unknown-word handling.

UserDictionary

Class for building a User Dictionary.