C D E F G H I N O R S W 

C

clear() - Method in class org.apache.lucene.analysis.icu.tokenattributes.ScriptAttributeImpl
 
combineCJ() - Method in class org.apache.lucene.analysis.icu.segmentation.DefaultICUTokenizerConfig
 
combineCJ() - Method in class org.apache.lucene.analysis.icu.segmentation.ICUTokenizerConfig
true if Han, Hiragana, and Katakana scripts should all be returned as Japanese
copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.icu.tokenattributes.ScriptAttributeImpl
 
create(TokenStream) - Method in class org.apache.lucene.analysis.icu.ICUFoldingFilterFactory
 
create(TokenStream) - Method in class org.apache.lucene.analysis.icu.ICUNormalizer2FilterFactory
 
create(TokenStream) - Method in class org.apache.lucene.analysis.icu.ICUTransformFilterFactory
 
create(AttributeSource.AttributeFactory, Reader) - Method in class org.apache.lucene.analysis.icu.segmentation.ICUTokenizerFactory
 
create(TokenStream) - Method in class org.apache.lucene.collation.ICUCollationKeyFilterFactory
Deprecated.
 
createAttributeInstance(Class<? extends Attribute>) - Method in class org.apache.lucene.collation.ICUCollationAttributeFactory
 
createComponents(String, Reader) - Method in class org.apache.lucene.collation.ICUCollationKeyAnalyzer
 

D

DefaultICUTokenizerConfig - Class in org.apache.lucene.analysis.icu.segmentation
Default ICUTokenizerConfig that is generally applicable to many languages.
DefaultICUTokenizerConfig(boolean) - Constructor for class org.apache.lucene.analysis.icu.segmentation.DefaultICUTokenizerConfig
Creates a new config.

E

end() - Method in class org.apache.lucene.analysis.icu.segmentation.ICUTokenizer
 
equals(Object) - Method in class org.apache.lucene.analysis.icu.tokenattributes.ScriptAttributeImpl
 

F

fillBytesRef() - Method in class org.apache.lucene.collation.tokenattributes.ICUCollatedTermAttributeImpl
 

G

getBreakIterator(int) - Method in class org.apache.lucene.analysis.icu.segmentation.DefaultICUTokenizerConfig
 
getBreakIterator(int) - Method in class org.apache.lucene.analysis.icu.segmentation.ICUTokenizerConfig
Return a breakiterator capable of processing a given script.
getCode() - Method in interface org.apache.lucene.analysis.icu.tokenattributes.ScriptAttribute
Get the numeric code for this script value.
getCode() - Method in class org.apache.lucene.analysis.icu.tokenattributes.ScriptAttributeImpl
 
getMultiTermComponent() - Method in class org.apache.lucene.analysis.icu.ICUFoldingFilterFactory
 
getMultiTermComponent() - Method in class org.apache.lucene.analysis.icu.ICUNormalizer2FilterFactory
 
getMultiTermComponent() - Method in class org.apache.lucene.analysis.icu.ICUTransformFilterFactory
 
getMultiTermComponent() - Method in class org.apache.lucene.collation.ICUCollationKeyFilterFactory
Deprecated.
 
getName() - Method in interface org.apache.lucene.analysis.icu.tokenattributes.ScriptAttribute
Get the full name.
getName() - Method in class org.apache.lucene.analysis.icu.tokenattributes.ScriptAttributeImpl
 
getShortName() - Method in interface org.apache.lucene.analysis.icu.tokenattributes.ScriptAttribute
Get the abbreviated name.
getShortName() - Method in class org.apache.lucene.analysis.icu.tokenattributes.ScriptAttributeImpl
 
getType(int, int) - Method in class org.apache.lucene.analysis.icu.segmentation.DefaultICUTokenizerConfig
 
getType(int, int) - Method in class org.apache.lucene.analysis.icu.segmentation.ICUTokenizerConfig
Return a token type value for a given script and BreakIterator rule status.

H

hashCode() - Method in class org.apache.lucene.analysis.icu.tokenattributes.ScriptAttributeImpl
 

I

ICUCollatedTermAttributeImpl - Class in org.apache.lucene.collation.tokenattributes
Extension of CharTermAttributeImpl that encodes the term text as a binary Unicode collation key instead of as UTF-8 bytes.
ICUCollatedTermAttributeImpl(Collator) - Constructor for class org.apache.lucene.collation.tokenattributes.ICUCollatedTermAttributeImpl
Create a new ICUCollatedTermAttributeImpl
ICUCollationAttributeFactory - Class in org.apache.lucene.collation
Converts each token into its CollationKey, and then encodes bytes as an index term.
ICUCollationAttributeFactory(Collator) - Constructor for class org.apache.lucene.collation.ICUCollationAttributeFactory
Create an ICUCollationAttributeFactory, using AttributeSource.AttributeFactory.DEFAULT_ATTRIBUTE_FACTORY as the factory for all other attributes.
ICUCollationAttributeFactory(AttributeSource.AttributeFactory, Collator) - Constructor for class org.apache.lucene.collation.ICUCollationAttributeFactory
Create an ICUCollationAttributeFactory, using the supplied Attribute Factory as the factory for all other attributes.
ICUCollationDocValuesField - Class in org.apache.lucene.collation
Indexes collation keys as a single-valued SortedDocValuesField.
ICUCollationDocValuesField(String, Collator) - Constructor for class org.apache.lucene.collation.ICUCollationDocValuesField
Create a new ICUCollationDocValuesField.
ICUCollationKeyAnalyzer - Class in org.apache.lucene.collation
ICUCollationKeyAnalyzer(Version, Collator) - Constructor for class org.apache.lucene.collation.ICUCollationKeyAnalyzer
Create a new ICUCollationKeyAnalyzer, using the specified collator.
ICUCollationKeyAnalyzer(Collator) - Constructor for class org.apache.lucene.collation.ICUCollationKeyAnalyzer
Deprecated.
Use ICUCollationKeyAnalyzer.ICUCollationKeyAnalyzer(Version, Collator) and specify a version instead. This ctor will be removed in Lucene 5.0
ICUCollationKeyFilter - Class in org.apache.lucene.collation
Deprecated.
Use ICUCollationAttributeFactory instead, which encodes terms directly as bytes. This filter will be removed in Lucene 5.0
ICUCollationKeyFilter(TokenStream, Collator) - Constructor for class org.apache.lucene.collation.ICUCollationKeyFilter
Deprecated.
 
ICUCollationKeyFilterFactory - Class in org.apache.lucene.collation
Deprecated.
ICUCollationKeyFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.collation.ICUCollationKeyFilterFactory
Deprecated.
 
ICUFoldingFilter - Class in org.apache.lucene.analysis.icu
A TokenFilter that applies search term folding to Unicode text, applying foldings from UTR#30 Character Foldings.
ICUFoldingFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.icu.ICUFoldingFilter
Create a new ICUFoldingFilter on the specified input
ICUFoldingFilterFactory - Class in org.apache.lucene.analysis.icu
Factory for ICUFoldingFilter.
ICUFoldingFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.icu.ICUFoldingFilterFactory
Creates a new ICUFoldingFilterFactory
ICUNormalizer2Filter - Class in org.apache.lucene.analysis.icu
Normalize token text with ICU's Normalizer2
ICUNormalizer2Filter(TokenStream) - Constructor for class org.apache.lucene.analysis.icu.ICUNormalizer2Filter
Create a new Normalizer2Filter that combines NFKC normalization, Case Folding, and removes Default Ignorables (NFKC_Casefold)
ICUNormalizer2Filter(TokenStream, Normalizer2) - Constructor for class org.apache.lucene.analysis.icu.ICUNormalizer2Filter
Create a new Normalizer2Filter with the specified Normalizer2
ICUNormalizer2FilterFactory - Class in org.apache.lucene.analysis.icu
ICUNormalizer2FilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.icu.ICUNormalizer2FilterFactory
Creates a new ICUNormalizer2FilterFactory
ICUTokenizer - Class in org.apache.lucene.analysis.icu.segmentation
Breaks text into words according to UAX #29: Unicode Text Segmentation (http://www.unicode.org/reports/tr29/)
ICUTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.icu.segmentation.ICUTokenizer
Construct a new ICUTokenizer that breaks text into words from the given Reader.
ICUTokenizer(Reader, ICUTokenizerConfig) - Constructor for class org.apache.lucene.analysis.icu.segmentation.ICUTokenizer
Construct a new ICUTokenizer that breaks text into words from the given Reader, using a tailored BreakIterator configuration.
ICUTokenizer(AttributeSource.AttributeFactory, Reader, ICUTokenizerConfig) - Constructor for class org.apache.lucene.analysis.icu.segmentation.ICUTokenizer
Construct a new ICUTokenizer that breaks text into words from the given Reader, using a tailored BreakIterator configuration.
ICUTokenizerConfig - Class in org.apache.lucene.analysis.icu.segmentation
Class that allows for tailored Unicode Text Segmentation on a per-writing system basis.
ICUTokenizerConfig() - Constructor for class org.apache.lucene.analysis.icu.segmentation.ICUTokenizerConfig
Sole constructor.
ICUTokenizerFactory - Class in org.apache.lucene.analysis.icu.segmentation
Factory for ICUTokenizer.
ICUTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.icu.segmentation.ICUTokenizerFactory
Creates a new ICUTokenizerFactory
ICUTransformFilter - Class in org.apache.lucene.analysis.icu
A TokenFilter that transforms text with ICU.
ICUTransformFilter(TokenStream, Transliterator) - Constructor for class org.apache.lucene.analysis.icu.ICUTransformFilter
Create a new ICUTransformFilter that transforms text on the given stream.
ICUTransformFilterFactory - Class in org.apache.lucene.analysis.icu
Factory for ICUTransformFilter.
ICUTransformFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.icu.ICUTransformFilterFactory
Creates a new ICUTransformFilterFactory
incrementToken() - Method in class org.apache.lucene.analysis.icu.ICUNormalizer2Filter
 
incrementToken() - Method in class org.apache.lucene.analysis.icu.ICUTransformFilter
 
incrementToken() - Method in class org.apache.lucene.analysis.icu.segmentation.ICUTokenizer
 
incrementToken() - Method in class org.apache.lucene.collation.ICUCollationKeyFilter
Deprecated.
 
inform(ResourceLoader) - Method in class org.apache.lucene.analysis.icu.segmentation.ICUTokenizerFactory
 
inform(ResourceLoader) - Method in class org.apache.lucene.collation.ICUCollationKeyFilterFactory
Deprecated.
 

N

name() - Method in class org.apache.lucene.collation.ICUCollationDocValuesField
 

O

org.apache.lucene.analysis.icu - package org.apache.lucene.analysis.icu
Analysis components based on ICU
org.apache.lucene.analysis.icu.segmentation - package org.apache.lucene.analysis.icu.segmentation
Tokenizer that breaks text into words with the Unicode Text Segmentation algorithm.
org.apache.lucene.analysis.icu.tokenattributes - package org.apache.lucene.analysis.icu.tokenattributes
Additional ICU-specific Attributes for text analysis.
org.apache.lucene.collation - package org.apache.lucene.collation
Unicode Collation support.
org.apache.lucene.collation.tokenattributes - package org.apache.lucene.collation.tokenattributes
Custom AttributeImpl for indexing collation keys as index terms.

R

reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.icu.tokenattributes.ScriptAttributeImpl
 
reset() - Method in class org.apache.lucene.analysis.icu.segmentation.ICUTokenizer
 

S

ScriptAttribute - Interface in org.apache.lucene.analysis.icu.tokenattributes
This attribute stores the UTR #24 script value for a token of text.
ScriptAttributeImpl - Class in org.apache.lucene.analysis.icu.tokenattributes
Implementation of ScriptAttribute that stores the script as an integer.
ScriptAttributeImpl() - Constructor for class org.apache.lucene.analysis.icu.tokenattributes.ScriptAttributeImpl
Initializes this attribute with UScript.COMMON
setCode(int) - Method in interface org.apache.lucene.analysis.icu.tokenattributes.ScriptAttribute
Set the numeric code for this script value.
setCode(int) - Method in class org.apache.lucene.analysis.icu.tokenattributes.ScriptAttributeImpl
 
setStringValue(String) - Method in class org.apache.lucene.collation.ICUCollationDocValuesField
 

W

WORD_HANGUL - Static variable in class org.apache.lucene.analysis.icu.segmentation.DefaultICUTokenizerConfig
Token type for words containing Korean hangul
WORD_HIRAGANA - Static variable in class org.apache.lucene.analysis.icu.segmentation.DefaultICUTokenizerConfig
Token type for words containing Japanese hiragana
WORD_IDEO - Static variable in class org.apache.lucene.analysis.icu.segmentation.DefaultICUTokenizerConfig
Token type for words containing ideographic characters
WORD_KATAKANA - Static variable in class org.apache.lucene.analysis.icu.segmentation.DefaultICUTokenizerConfig
Token type for words containing Japanese katakana
WORD_LETTER - Static variable in class org.apache.lucene.analysis.icu.segmentation.DefaultICUTokenizerConfig
Token type for words that contain letters
WORD_NUMBER - Static variable in class org.apache.lucene.analysis.icu.segmentation.DefaultICUTokenizerConfig
Token type for words that appear to be numbers
C D E F G H I N O R S W 

Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.