/AWS1/CL_SRHANALYSISOPTIONS¶
Synonyms, stopwords, and stemming options for an analysis scheme. Includes tokenization dictionary for Japanese.
CONSTRUCTOR
¶
IMPORTING¶
Optional arguments:¶
iv_synonyms
TYPE /AWS1/SRHSTRING
/AWS1/SRHSTRING
¶
A JSON object that defines synonym groups and aliases. A synonym group is an array of arrays, where each sub-array is a group of terms where each term in the group is considered a synonym of every other term in the group. The aliases value is an object that contains a collection of string:value pairs where the string specifies a term and the array of values specifies each of the aliases for that term. An alias is considered a synonym of the specified term, but the term is not considered a synonym of the alias. For more information about specifying synonyms, see Synonyms in the Amazon CloudSearch Developer Guide.
iv_stopwords
TYPE /AWS1/SRHSTRING
/AWS1/SRHSTRING
¶
A JSON array of terms to ignore during indexing and searching. For example,
["a", "an", "the", "of"]
. The stopwords dictionary must explicitly list each word you want to ignore. Wildcards and regular expressions are not supported.
iv_stemmingdictionary
TYPE /AWS1/SRHSTRING
/AWS1/SRHSTRING
¶
A JSON object that contains a collection of string:value pairs that each map a term to its stem. For example,
{"term1": "stem1", "term2": "stem2", "term3": "stem3"}
. The stemming dictionary is applied in addition to any algorithmic stemming. This enables you to override the results of the algorithmic stemming to correct specific cases of overstemming or understemming. The maximum size of a stemming dictionary is 500 KB.
iv_japanesetokizationdicti00
TYPE /AWS1/SRHSTRING
/AWS1/SRHSTRING
¶
A JSON array that contains a collection of terms, tokens, readings and part of speech for Japanese Tokenizaiton. The Japanese tokenization dictionary enables you to override the default tokenization for selected terms. This is only valid for Japanese language fields.
iv_algorithmicstemming
TYPE /AWS1/SRHALGORITHMICSTEMMING
/AWS1/SRHALGORITHMICSTEMMING
¶
The level of algorithmic stemming to perform:
none
,minimal
,light
, orfull
. The available levels vary depending on the language. For more information, see Language Specific Text Processing Settings in the Amazon CloudSearch Developer Guide
Queryable Attributes¶
Synonyms¶
A JSON object that defines synonym groups and aliases. A synonym group is an array of arrays, where each sub-array is a group of terms where each term in the group is considered a synonym of every other term in the group. The aliases value is an object that contains a collection of string:value pairs where the string specifies a term and the array of values specifies each of the aliases for that term. An alias is considered a synonym of the specified term, but the term is not considered a synonym of the alias. For more information about specifying synonyms, see Synonyms in the Amazon CloudSearch Developer Guide.
Accessible with the following methods¶
Method | Description |
---|---|
GET_SYNONYMS() |
Getter for SYNONYMS, with configurable default |
ASK_SYNONYMS() |
Getter for SYNONYMS w/ exceptions if field has no value |
HAS_SYNONYMS() |
Determine if SYNONYMS has a value |
Stopwords¶
A JSON array of terms to ignore during indexing and searching. For example,
["a", "an", "the", "of"]
. The stopwords dictionary must explicitly list each word you want to ignore. Wildcards and regular expressions are not supported.
Accessible with the following methods¶
Method | Description |
---|---|
GET_STOPWORDS() |
Getter for STOPWORDS, with configurable default |
ASK_STOPWORDS() |
Getter for STOPWORDS w/ exceptions if field has no value |
HAS_STOPWORDS() |
Determine if STOPWORDS has a value |
StemmingDictionary¶
A JSON object that contains a collection of string:value pairs that each map a term to its stem. For example,
{"term1": "stem1", "term2": "stem2", "term3": "stem3"}
. The stemming dictionary is applied in addition to any algorithmic stemming. This enables you to override the results of the algorithmic stemming to correct specific cases of overstemming or understemming. The maximum size of a stemming dictionary is 500 KB.
Accessible with the following methods¶
Method | Description |
---|---|
GET_STEMMINGDICTIONARY() |
Getter for STEMMINGDICTIONARY, with configurable default |
ASK_STEMMINGDICTIONARY() |
Getter for STEMMINGDICTIONARY w/ exceptions if field has no |
HAS_STEMMINGDICTIONARY() |
Determine if STEMMINGDICTIONARY has a value |
JapaneseTokenizationDictionary¶
A JSON array that contains a collection of terms, tokens, readings and part of speech for Japanese Tokenizaiton. The Japanese tokenization dictionary enables you to override the default tokenization for selected terms. This is only valid for Japanese language fields.
Accessible with the following methods¶
Method | Description |
---|---|
GET_JAPANESETOKIZATIONDICT00() |
Getter for JAPANESETOKIZATIONDICTIONARY, with configurable d |
ASK_JAPANESETOKIZATIONDICT00() |
Getter for JAPANESETOKIZATIONDICTIONARY w/ exceptions if fie |
HAS_JAPANESETOKIZATIONDICT00() |
Determine if JAPANESETOKIZATIONDICTIONARY has a value |
AlgorithmicStemming¶
The level of algorithmic stemming to perform:
none
,minimal
,light
, orfull
. The available levels vary depending on the language. For more information, see Language Specific Text Processing Settings in the Amazon CloudSearch Developer Guide
Accessible with the following methods¶
Method | Description |
---|---|
GET_ALGORITHMICSTEMMING() |
Getter for ALGORITHMICSTEMMING, with configurable default |
ASK_ALGORITHMICSTEMMING() |
Getter for ALGORITHMICSTEMMING w/ exceptions if field has no |
HAS_ALGORITHMICSTEMMING() |
Determine if ALGORITHMICSTEMMING has a value |