All Classes
-
All Classes Interface Summary Class Summary Class Description DefaultICUTokenizerConfig DefaultICUTokenizerConfig
that is generally applicable to many languages.ICUCollatedTermAttributeImpl Extension ofCharTermAttributeImpl
that encodes the term text as a binary Unicode collation key instead of as UTF-8 bytes.ICUCollationAttributeFactory Converts each token into itsCollationKey
, and then encodes bytes as an index term.ICUCollationDocValuesField Indexes collation keys as a single-valuedSortedDocValuesField
.ICUCollationKeyAnalyzer ConfiguresKeywordTokenizer
withICUCollationAttributeFactory
.ICUFoldingFilter A TokenFilter that applies search term folding to Unicode text, applying foldings from UTR#30 Character Foldings.ICUFoldingFilterFactory Factory forICUFoldingFilter
.ICUNormalizer2CharFilter Normalize token text with ICU'sNormalizer2
.ICUNormalizer2CharFilterFactory Factory forICUNormalizer2CharFilter
ICUNormalizer2Filter Normalize token text with ICU'sNormalizer2
ICUNormalizer2FilterFactory Factory forICUNormalizer2Filter
ICUTokenizer Breaks text into words according to UAX #29: Unicode Text Segmentation (http://www.unicode.org/reports/tr29/)ICUTokenizerConfig Class that allows for tailored Unicode Text Segmentation on a per-writing system basis.ICUTokenizerFactory Factory forICUTokenizer
.ICUTransformFilter ATokenFilter
that transforms text with ICU.ICUTransformFilterFactory Factory forICUTransformFilter
.ScriptAttribute This attribute stores the UTR #24 script value for a token of text.ScriptAttributeImpl Implementation ofScriptAttribute
that stores the script as an integer.