Configure Tokenizers and Analyzers for easy searching

mikeyGlitz · June 24, 2022, 6:35pm

I’m in the process of migrating an Elasticsearch index into Opensearch. I’m having issues with getting my index search to get me results.

In Elasticsearch, my index is configured as such

{
	"settings": {
		"analysis": {
			"analyzer": {
				"account_analyzer": {
					"tokenizer": "account_tokenizer"
				}
			},
			"tokenizer": {
				"account_tokenizer": {
					"token_chars": ["digit"],
					"type": "ngram",
					"min_gram": "3",
					"max_gram": "3"
				}
			}
		}
	},
	"mappings": {
		"properties": {
			"accounts": {
				"properties": {
					"accountNumber": {
						"type": "text",
						"analyzer": "account_analyzer"
					}
				}
			}
		}
	}
}

The index in OpenSearch is configured as such:

{
	"settings": {
		"analysis": {
			"analyzer": {
				"account_analyzer": {
					"type": "custom",
					"tokenizer": "account_tokenizer"
				}
			},
			"tokenizer": {
				"account_tokenizer": {
					"token_chars": ["digit"],
					"type": "ngram",
					"min_gram": "3",
					"max_gram": "3"
				}
			}
		}
	},
	"mappings": {
		"properties": {
			"accounts": {
				"type": "nested",
				"properties": {
					"accountNumber": {
						"type": "text",
						"analyzer": "account_analyzer"
					}
				}
			}
		}
	}
}

When I execute the query in Elasticsearch, I get results back. When I execute the query in OpenSearch, I get 0 hits from OpenSearch.

{
	"query": {
		"bool": {
			"must": [
				{
					"match": {
						"accounts.accountNumber": "12345678"
					}
				},
				{
					"match": {
						"accounts.accountLength": 8
					}
				}
			]
		}
	}
}

The account number I’m looking for in the index is saved as 1234####. Coming from Elasticsearch, the ngram tokenizer is supposed to group the characters in the index record and search based on groups indicated by the min_gram and max_gram configuration options. Does this feature translate into OpenSearch. How would I correctly configure the feature in OpenSearch? What plugins would I need to leverage to get the expected functionality?

mikeyGlitz · June 27, 2022, 4:56pm

For extra context, I’m running the docker container with the following command:

docker run -e discovery.type=single-node -e DISABLE_INSTALL_DEMO_CONFIG=true -e DISABLE_SECURITY_PLUGIN=true opensearchproject/opensearch:1.2.0

mikeyGlitz · June 27, 2022, 6:10pm

We can mark this one as solved. I needed to insert the records with ?refresh before I could search them.

Topic		Replies	Views
Mapping - Fields Type English or Custom Analyzers OpenSearch discuss	13	2023	June 13, 2022
Built-in Filters and Tokenizers Available in OpenSearch from ElasticSearch OpenSearch	4	931	July 11, 2022
Does adding analyzer and tokenizer to OpenSearch Index via Terraform work? OpenSearch	0	19	March 6, 2025
"search_as_you_type" "index_prefix" field does not return document with "match" query but does with "match_phrase" query OpenSearch discuss , troubleshoot	0	379	July 31, 2023
How to configure mapping and query with highlighting and filter? OpenSearch configure , index-management	0	1699	January 27, 2023

Configure Tokenizers and Analyzers for easy searching

Related topics