Class OpenNLPLemmatizerFilterFactory

java.lang.Object
org.apache.lucene.analysis.AbstractAnalysisFactory
org.apache.lucene.analysis.TokenFilterFactory
org.apache.lucene.analysis.opennlp.OpenNLPLemmatizerFilterFactory
All Implemented Interfaces:
org.apache.lucene.util.ResourceLoaderAware

public class OpenNLPLemmatizerFilterFactory extends org.apache.lucene.analysis.TokenFilterFactory implements org.apache.lucene.util.ResourceLoaderAware
Factory for OpenNLPLemmatizerFilter.
 <fieldType name="text_opennlp_lemma" class="solr.TextField" positionIncrementGap="100"
   <analyzer>
     <tokenizer class="solr.OpenNLPTokenizerFactory"
                sentenceModel="filename"
                tokenizerModel="filename"/>
     />
     <filter class="solr.OpenNLPLemmatizerFilterFactory"
             dictionary="filename"
             lemmatizerModel="filename"/>
   </analyzer>
 </fieldType>
Since:
7.3.0
SPI Name (case-insensitive: if the name is 'htmlStrip', 'htmlstrip' can be used when looking up the service).
"openNlpLemmatizer"
  • Field Summary

    Fields
    Modifier and Type
    Field
    Description
    static final String
     
    static final String
     
    static final String
    SPI name

    Fields inherited from class org.apache.lucene.analysis.AbstractAnalysisFactory

    LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion
  • Constructor Summary

    Constructors
    Constructor
    Description
    Default ctor for compatibility with SPI
     
  • Method Summary

    Modifier and Type
    Method
    Description
    create(org.apache.lucene.analysis.TokenStream in)
     
    void
    inform(org.apache.lucene.util.ResourceLoader loader)
     

    Methods inherited from class org.apache.lucene.analysis.TokenFilterFactory

    availableTokenFilters, findSPIName, forName, lookupClass, normalize, reloadTokenFilters

    Methods inherited from class org.apache.lucene.analysis.AbstractAnalysisFactory

    defaultCtorException, get, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNames

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
  • Field Details

  • Constructor Details

    • OpenNLPLemmatizerFilterFactory

      public OpenNLPLemmatizerFilterFactory(Map<String,String> args)
    • OpenNLPLemmatizerFilterFactory

      public OpenNLPLemmatizerFilterFactory()
      Default ctor for compatibility with SPI
  • Method Details

    • create

      public OpenNLPLemmatizerFilter create(org.apache.lucene.analysis.TokenStream in)
      Specified by:
      create in class org.apache.lucene.analysis.TokenFilterFactory
    • inform

      public void inform(org.apache.lucene.util.ResourceLoader loader) throws IOException
      Specified by:
      inform in interface org.apache.lucene.util.ResourceLoaderAware
      Throws:
      IOException