Database Reference
In-Depth Information
class holds the name of the Java class to use for tokenizing and filtering the text;
for instance, "org.apache.lucene.analysis.WhitespaceAnalyzer" .
id defines the identifier for this analyzer. This is for referencing the analyzer (in
text elements using the analyzer attribute). If you don't specify an id , this
changes the default analyzer.
An analyzer definition can contain parameters to pass to the analyzer using param
elements. These parameters are passed to the constructor of the analyzer class:
<param name = string
type? = string
value? = string >
value*
</param>
name is the name of the parameter.
type is the (Java) type of the parameter. Several types are currently supported:
java.lang.String
A string that may be either a literal value, the name of a class, or the fully
qualified name of an enumeration value, depending on the parameter con‐
text.
java.io.File
A path to a file on the filesystem; it must be in the appropriate Java path syn‐
tax for the operating system in use.
java.util.Set
Assumed to be a set of java.lang.String . When this is used, we can pro‐
vide multiple values; for example:
<param name= "stopwords" type= "java.util.Set" >
<value> and </value>
<value> or </value>
<value> the </value>
<value> a </value>
<value> an </value>
<value> this </value>
<value> there </value>
</param>
java.lang.Integer (or int )
An integer.
java.lang.Boolean (or boolean )
A Boolean.
Search WWH ::




Custom Search