All Classes and Interfaces

Class
Description
Attribute for Token.getBaseForm().
Attribute for Token.getBaseForm().
Base class for a binary-encoded in-memory dictionary.
Deprecated, for removal: This API element is subject to removal in a future version.
Character category data.
Utility functions for JapaneseCompletionFilter
n-gram connection cost data
Dictionary interface for retrieving morphological data by id.
Tool to build dictionaries.
Format of the dictionary.
Outputs the dot (graphviz) string for the viterbi lattice.
Attribute for Kuromoji inflection data.
Attribute for Kuromoji inflection data.
Analyzer for Japanese that uses morphological analysis.
Replaces term text with the BaseFormAttribute.
Analyzer for Japanese completion suggester.
A TokenFilter that adds Japanese romanized tokens to the term attribute.
Completion mode
A TokenFilter that normalizes small letters (捨て仮名) in hiragana into normal letters.
Normalizes Japanese horizontal iteration marks (odoriji) to their expanded form.
A TokenFilter that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC).
A TokenFilter that normalizes small letters (捨て仮名) in katakana into normal letters.
A TokenFilter that normalizes Japanese numbers (kansūji) to regular Arabic decimal numbers in half-width characters.
Buffer that holds a Japanese number string and a position index used as a parsed-to marker
Removes tokens that match a set of part-of-speech tags.
A TokenFilter that replaces the term attribute with the reading of a token in either katakana or romaji form.
Tokenizer for Japanese that uses morphological analysis.
Tokenization mode: this determines how the tokenizer handles compound and unknown words.
Token type reflecting the original source of this token
Factory for JapaneseTokenizer.
Converts a Katakana string to Romaji using the pre-defined Katakana-Romaji mapping rules.
Attribute for Kuromoji reading data
Attribute for Kuromoji reading data
Analyzed token with morphological data from its dictionary.
Binary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs.
Thin wrapper around an FST with root-arc caching for Japanese.
Utility class for english translations of morphological data, used only for debugging.
Dictionary for unknown-word handling.
Class for building a User Dictionary.