IndexPart IndexStructure Analyzer Examples

eccenca Documentation

Analyzers Examples

Using specific Analyzers

WhiteSpace-, GermanAnalyzer, SnowballAnalyzer

DataDictionary
<IndexField FieldNo="1" IndexValue="true" Name="AccessRightsField" StoreText="true" Tokenize="true" Type="Text">
  <Analyzer ClassName="org.apache.lucene.analysis.WhitespaceAnalyzer" />
</IndexField>

<IndexField FieldNo="2" IndexValue="true" Name="ContentField" StoreText="true" Tokenize="true" Type="Text">
  <Analyzer ClassName="org.apache.lucene.analysis.de.GermanAnalyzer" />
</IndexField>
SnowboldAnalyzer needs at least 1 Parameter (Language) in its Constructor
<Analyzer ClassName="org.apache.lucene.analysis.snowball.SnowballAnalyzer">
   <ParameterSet xmlns="http://www.brox.de/ParameterSet">
     <Parameter Name="language" xsi:type="String">
	<Value>English</Value>
     </Parameter>
   </ParameterSet>
</Analyzer>

Using Analyzers with StopWordLists

Analyzer with No Stopwords
<Analyzer ClassName="org.apache.lucene.analysis.standard.StandardAnalyzer">
	<ParameterSet xmlns="http://www.brox.de/ParameterSet">
	<Parameter xsi:type="StringList" Name="stopWords">
		<Value/>
		</Parameter>
	</ParameterSet>
</Analyzer>
Analyzer with specific Stopwords
<Analyzer ClassName="org.apache.lucene.analysis.standard.StandardAnalyzer">
	<ParameterSet xmlns="http://www.brox.de/ParameterSet">
	<Parameter xsi:type="StringList" Name="stopWords">
		<Value>Stopword1</Value>
                <Value>Stopword2</Value>
        </Parameter>
	</ParameterSet>
</Analyzer>
SnowboldAnalyzer with a specific StopwordList
<Analyzer ClassName="org.apache.lucene.analysis.snowball.SnowballAnalyzer">
  <ParameterSet xmlns="http://www.brox.de/ParameterSet">
    <Parameter Name="language" xsi:type="String">
          <Value>English</Value>
    </Parameter>
    <Parameter Name="stopWords" xsi:type="StringList">
         <Value>Stopword1</Value>
         <Value>Stopword2</Value>
    </Parameter>
  </ParameterSet>
</Analyzer>
Hints
The order of the parameters is important. First parameter is used as the first parameter for the constructor of the Analyzer itself, etc.
The Parameter Name has no effect!
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.