Class ICUNormalizer2FilterFactory

java.lang.Object
org.apache.lucene.analysis.AbstractAnalysisFactory
org.apache.lucene.analysis.TokenFilterFactory
org.apache.lucene.analysis.icu.ICUNormalizer2FilterFactory

public class ICUNormalizer2FilterFactory extends org.apache.lucene.analysis.TokenFilterFactory
Factory for ICUNormalizer2Filter

Supports the following attributes:

  • form: A Unicode Normalization Form, one of 'nfc','nfkc', 'nfkc_cf'. Default is nfkc_cf.
  • mode: Either 'compose' or 'decompose'. Default is compose. Use "decompose" with nfc or nfkc, to get nfd or nfkd, respectively.
  • filter: A UnicodeSet pattern. Codepoints outside the set are always left unchanged. Default is [] (the null set, no filtering).
Since:
3.1.0
See Also:
SPI Name (case-insensitive: if the name is 'htmlStrip', 'htmlstrip' can be used when looking up the service).
"icuNormalizer2"
  • Field Summary

    Fields
    Modifier and Type
    Field
    Description
    static final String
    SPI name

    Fields inherited from class org.apache.lucene.analysis.AbstractAnalysisFactory

    LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion
  • Constructor Summary

    Constructors
    Constructor
    Description
    Default ctor for compatibility with SPI
    Creates a new ICUNormalizer2FilterFactory
  • Method Summary

    Modifier and Type
    Method
    Description
    org.apache.lucene.analysis.TokenStream
    create(org.apache.lucene.analysis.TokenStream input)
     
    org.apache.lucene.analysis.TokenStream
    normalize(org.apache.lucene.analysis.TokenStream input)
     

    Methods inherited from class org.apache.lucene.analysis.TokenFilterFactory

    availableTokenFilters, findSPIName, forName, lookupClass, reloadTokenFilters

    Methods inherited from class org.apache.lucene.analysis.AbstractAnalysisFactory

    defaultCtorException, get, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNames

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
  • Field Details

  • Constructor Details

    • ICUNormalizer2FilterFactory

      public ICUNormalizer2FilterFactory(Map<String,String> args)
      Creates a new ICUNormalizer2FilterFactory
    • ICUNormalizer2FilterFactory

      public ICUNormalizer2FilterFactory()
      Default ctor for compatibility with SPI
  • Method Details

    • create

      public org.apache.lucene.analysis.TokenStream create(org.apache.lucene.analysis.TokenStream input)
      Specified by:
      create in class org.apache.lucene.analysis.TokenFilterFactory
    • normalize

      public org.apache.lucene.analysis.TokenStream normalize(org.apache.lucene.analysis.TokenStream input)
      Overrides:
      normalize in class org.apache.lucene.analysis.TokenFilterFactory