menu icon

Enrich the data and rewrite the queries with the Elasticsearch percolator

This article is a transcript of the lightning talk we presented this week at Haystack - the Search and Relevance Conference. We showed a method allowing to enrich and rewrite user queries using Wikidata and the Elasticsearch percolator.

Enrich the data and rewrite the queries with the Elasticsearch percolator

Haystack offered us the opportunity to present a lightning talk during the conference. We presented an innovative method that can be used in order to rewrite user queries. This method has the advantage to require only an Elasticsearch instance and Open Data coming from Wikidata.

When dealing with voice search, a chatbot or even with a certain type of users or use cases, the user queries can be quite verbose. A rewriting process is then necessary to extract the meaningful keywords. A user query like How to prepare athletes to be present on D-Day taking into account all the complexity of the competition? can be eventually rewritten with just the following keywords: present, D-Day, competition.

For this task we are using Wikidata - an Open Source database maintained by the Wikimedia Foundation and the Elasticsearch percolator - a feature available in Elasticsearch since the early days but still not so well known.

The following slides present in detail the process we put in place.

Enrich data and rewrite queries with the Elasticsearch percolator de Lucian Precup

You can easily implement the same framework on your infrastructure but if you need assistance, do not hesitate to contact us.

Scaling an online search engine to thousands of physical stores – ElasticON

10/03/2023

A summary of the talk Scaling an online search engine to thousands of physical stores by Roudy Khoury and Aline Paponaud at ElasticON 2023

Read the article

Question answering,a more human-based approach to our research on all.site.

19/01/2023

Everything about Question-Answering and how to implement it using a flask and elasticsearch.

Read the article

Feedback - Fine-tuning a VOSK model

05/01/2022

all.site is a collaborative search engine. It works like Bing or Google but it has the advantage of being able to go further by indexing for example media content and organizing data from systems like Slack, Confluence or all the information present in a company's intranet.

Read the article

Feedback - Indexing of media file transcripts

17/12/2021

all.site is a collaborative search engine. It works like Bing or Google but it has the advantage of being able to go further by indexing for example media content and organizing data from systems like Slack, Confluence or all the information present in a company's intranet.

Read the article

New Search & Data meetup - E-Commerce Search and Open Source

28/10/2021

The fifth edition of the Search and Data meetup is dedicated to e-commerce search and open source. A nice agenda to mark our return to the Meetup scene

Read the article

Shipping to Synonym Graph in Elasticsearch

21/04/2021

In this article, we explain how we moved from the old Elasticsearch synonym filters to the new Synonym Graph Token Filter.

Read the article

When queries are very verbose

22/02/2021

In this article, we present a simple method to rewrite user queries so that a keyword-based search engine can better understand them. This method is very useful in the context of a voice search or a conversation with a chatbot, context in which user queries are generally more verbose.

Read the article

A2 the engine that makes Elasticsearch great

13/06/2018

Elasticsearch is an open technology that allows integrators to build ever more innovative and powerful solutions. Elasticsearch

Read the article