Autocomplete is a feature to predict the rest of a word a user is typing. It is an important feature to implement that can improve the user’s experience of your product. Creating an autocomplete might sound daunting at first if you’ve never created one. But with the help of the features in , it’s actually a simple thing to do. Elasticsearch Things You Should Know If you have little knowledge of Elasticsearch, I suggest that you read my other articles first. We do not require this, but knowing how an analyzer and a text field work definitely will help you understand this article. The article “ ” will introduce you to Elasticsearch. The article “ ” will teach you the difference between text and keyword in Elasticsearch and also will explain how Elasticsearch’s analyzer works. Basics of Elasticsearch for Developer Elasticsearch: Text vs. Keyword Setup Creating the index First, let’s create an index called . We will use this index for the examples in this article. autocomplete-example Defining a mapping Before indexing a document, let’s first define a mapping. We will only need one field, , with field data type text and will use a standard analyzer. simple_autocomplete Since Elasticsearch uses the standard analyzer as default, we need not define it in the mapping. Indexing a document Let’s index a document. For the examples in this article, we will only need one document, containing the text “Hong Kong.” Querying the Index With match Query Let’s start with the query that we normally use, . match_query The will lowercase your indexed text and split the text to tokens on stop words before storing it to an inverted index. standard analyzer The by default will use the index-time analyzer, so the analyzer it uses is the same as the one indexed in the index, which is . match_query standard analyzer Let’s see how our “Hong Kong” text looks in the inverted index with the API provided by the Elasticsearch: When we do a search query to the index with match query, we will only get a result when we type text containing either “Hong” or “Kong.” This is because Elasticsearch only returns a result when the analyzed query is an exact match with a token in the inverted index. If the user type “Ho” or “Kon” or “Hon Kon,” there won’t be any response from Elasticsearch. For an autocomplete, this one isn’t very useful to help the user, right? At the least, autocomplete needs to show something, even if we do not type the full words. To fix it, we can use a query provided by Elasticsearch. match_phrase_prefix Using match_phrase_prefix Query query will allow the user to get a result without typing all the words. By using the usual match query, we won’t get any result from the Elasticsearch if we type “Hon” or “Kon,” but with , we can get a result. match_phrase_prefix match_pharse_prefix There is still a shortcoming of this autocomplete: If the user types “Hon Kon,” it still won’t return any result. This is because “Hon Kon” is not the prefix of “Hong Kong”. The Pros and Cons An autocomplete with a text field data type and the standard analyzer is very simple, but it has pros and cons that you can consider before using this type of autocomplete. Pros You don’t even have to define any mapping because by default, if you index a text document into Elasticsearch, it will get mapped into the text and keyword field data types. Easy to no setup: : Because this type of autocomplete is using the standard analyzer, it doesn’t process your text much when saving it to the inverted index, which translates to fast index time. Fast index time : Most of the time, you don’t need a complex autocomplete. This autocomplete type will be enough. Enough most of the time Cons : This type of autocomplete can’t handle typos, so if the user types one wrong word, it won’t return any result. Can’t handle typos : The text queried to this type of autocomplete also can’t start from the middle. In the previous example of “Hong Kong,” if we do a query with text “ong kong,” the Elasticsearch won’t return anything. The query can’t start from the middle word If we had mistakenly typed “HongKong” in the previous example, the Elasticsearch wouldn’t have returned anything with this type of autocomplete. Can’t handle space character: When to Use I recommend an autocomplete with only the standard analyzer when you only need a simple autocomplete. You can also use this type of autocomplete if the index you want to create an autocomplete of is already in production and indexed with documents. Since this autocomplete uses the default analyzer and default mapping for text, it will work for most text documents. Conclusion Creating an autocomplete with the text field data type and standard analyzer is the simplest and easiest autocomplete that we can build with Elasticsearch. It requires almost no setup and can usually create an autocomplete for an existing index. Even if it’s enough for most use cases, it still has many weaknesses because it can only handle simple queries. To overcome that, we can use a custom-defined analyzer or the Suggesters feature in Elasticsearch, which I plan to write about. Please wait for it! At last, I want to say thank you to you for reading this article until the end. I hope this article will help you with your project. References https://opster.com/elasticsearch-glossary/elasticsearch-auto-complete-guide/ https://www.elastic.co/guide/en/elasticsearch/reference/current/query-dsl-match-query-phrase-prefix.html https://www.elastic.co/guide/en/elasticsearch/reference/current/query-dsl-match-query.html https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-standard-analyzer.html https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-standard-analyzer.html Previously published at https://codecurated.com/blog/create-a-simple-autocomplete-with-elasticsearch/

How to Use Fuzzy Query Matches in Elasticsearch

Ghost: One Of The Best WordPress Alternatives

How To Create a Simple Autocomplete Field And Connect it With Elasticsearch

About Author

Comments

TOPICS

THIS ARTICLE WAS FEATURED IN

Related Stories

Analyzer in Elasticsearch: An Introduction

10 Cool CI/CD Tools For Your Project

27 Stories To Learn About Ssl

20 Essential Backend Tools For Developers

3 Most Common Ways to Connect your Node and React Applications

369 Stories To Learn About Database

Analyzer in Elasticsearch: An Introduction

10 Cool CI/CD Tools For Your Project

27 Stories To Learn About Ssl

20 Essential Backend Tools For Developers

3 Most Common Ways to Connect your Node and React Applications

369 Stories To Learn About Database

Light-Mode

Classic

Newspaper

Minty

Dark-Mode

Neon Noir

Minty

HN StartUps