menu icon

New Search & Data meetup - E-Commerce Search and Open Source

The fifth edition of the Search and Data meetup is dedicated to e-commerce search and open source. A nice agenda to mark our return to the Meetup scene

New Search & Data meetup - E-Commerce Search and Open Source

The fifth edition of the Search and Data Meetup is dedicated to e-commerce search and open source. A nice agenda to mark our return to the Meetup scene.

For this upcoming meetup, we will have three talks and free discussions:

// René Kriegler, Director of E-commerce at OpenSource Connections, who will talk about implicit user feedback and how to use it to improve an e-commerce search engine. A presentation for data scientists but not only :-)

// Lucian Precup, CTO of all.site, who will present Elasticsearch and the adaptations to be made to integrate this open source technology into an e-commerce search engine.

// Jérémy Mésière, Manager of the Digital Factory team at Manutan, who will give a feedback on the integration of Elasticsearch in the future e-commerce search engines of Manutan International.

Thanks again to Station F for hosting us, in parallel to the France is AI conference. A big thank you to the Open Source Experience conference to which we owe René’s visit in Paris.

A few words about the logistics: to have access to the premises, it will be necessary to present a Health Pass. The Meetup registration must be completed with the registration via this form

Find bellow some additional details about the agenda.

Abstract

More than other domains, e-commerce search depends on implicit user feedback to optimise search result ranking. While search relevance is probably still a major criterion of search result quality in e-commerce, the user finally takes the buying decision based on criteria such as ‘an attractive price’ and ‘brand sympathy’ that are very hard to make explicit. On the other hand, this decision making can be observed implicitly in web tracking.

Unfortunately, e-commerce search cannot just use more generally known approaches to click modelling. Many of these models assume that the user would view the search results sequentially, top to bottom, while in e-commerce search we often use a grid layout which invites the user to go back and forth between results, especially given the heavy use of product imagery on the search result page. In addition, our model needs to consider contexts beyond the well-known position bias, such as the device type (mobile vs desktop) or the time of the day, together with the different types of signals (clicks, add-to-basket, checkout). Last but not least, many general click models ignore the fact that the number of observations can vary a lot between query-product pairs. Our greater uncertainty in the case of sparse events should be reflected in the model.

In this talk, I shall introduce an approach to using implicit user feedback that is based on Bayesian hierarchical modelling. It will provide a solution for dealing with position bias, including for grid layouts, and for dealing with further contexts, such as device types. The model will cope with varying quantities of observations and it allows to incorporate different types of events, such as clicks and checkouts.

Bio

René works as Director E-commerce at OpenSource Connections, empowering the company’s clients to make onsite search a key driver for improving their online sales.

He has worked in search for 15 years, including on projects for some of the top 10 German e-commerce sites. He is co-founder and co-organiser of MICES (Mix-Camp E-commerce Search), an event that brings together the e-commerce search community each year. His technological focus is on Solr, Elasticsearch and Lucene. He created and maintains the Querqy open source library for query rewriting. René is co-initiator of the Chorus project – an open source software stack that combines Querqy with other powerful tools to build e-commerce search and to measure and improve search quality.

Talk 2 : Elasticsearch pour le e-commerce

Abstract

Elasticsearch is an open source technology based on Apache Lucene which is the most widely used search engine library in the world. The Elasticsearch technology, and more generally the Elastic stack (ELK), is widely used and very well suited for use cases like Monitoring, Observability, Logs and Security. Areas such as Enterprise Search, Workplace Search, Site Search and especially E-Commerce Search require specific adaptations.

Lucian will present an extensive overview of about 30 points of attention when integrating Elasticsearch in an e-commerce context. Topics such as general architecture, auto-completion, relevance, facets, filters and other ways to refine searches, synonyms and taxonomies as well as personalization will be discussed.

A focus will be made on the e-commerce ecosystem and the modules, algorithms and utilities needed to have a complete, powerful and relevant software stack.

Bio

Lucian Precup, is the CTO of all.site, and develops solutions for indexing, searching and data analysis. Since 2010, Lucian built or helped building many e-commerce search engines among which Fnac.com and Carrefour.fr. His latest project, all.site, allows companies, universities and individuals organize information and make it accessible in the most relevant, secure and user friendly manner.

Talk 3 : Retour d’Expérience Manutan International

Abstract

The Manutan Group is the European leader in BtoB e-commerce. Specialized in the distribution of equipment for businesses and local authorities, Manutan is constantly innovating to improve its customers’ experience. As part of its digital transformation, Manutan has launched several projects to overhaul its various systems. The most recent is a new search engine platform for the group’s e-commerce sites, including Manutan.fr. The new search engine is based on an open and modern software stack:

  • Azure Cloud
  • Docker, Kubernetes, Ansible, Jenkins, Prometheus, Grafana on the Devops side
  • Java, Quarkus for the Backend
  • React, Next, Typescript, Jest for the Frontend
  • Cucumber, Cypress on the Q&A side
  • Apigee, MongoDB and of course ElasticSearch This presentation gives an overview of the project and its different technical challenges.
Bio

Jérémy Mésière is the Founder and Manager of the Digital Factory team at Manutan International. Created over a year ago, the Digital Factory team has set up a modern technical stack to serve the different business lines of the Manutan Group.

Scaling an online search engine to thousands of physical stores – ElasticON

10/03/2023

A summary of the talk Scaling an online search engine to thousands of physical stores by Roudy Khoury and Aline Paponaud at ElasticON 2023

Read the article

Question answering,a more human-based approach to our research on all.site.

19/01/2023

Everything about Question-Answering and how to implement it using a flask and elasticsearch.

Read the article

Feedback - Fine-tuning a VOSK model

05/01/2022

all.site is a collaborative search engine. It works like Bing or Google but it has the advantage of being able to go further by indexing for example media content and organizing data from systems like Slack, Confluence or all the information present in a company's intranet.

Read the article

Feedback - Indexing of media file transcripts

17/12/2021

all.site is a collaborative search engine. It works like Bing or Google but it has the advantage of being able to go further by indexing for example media content and organizing data from systems like Slack, Confluence or all the information present in a company's intranet.

Read the article

Shipping to Synonym Graph in Elasticsearch

21/04/2021

In this article, we explain how we moved from the old Elasticsearch synonym filters to the new Synonym Graph Token Filter.

Read the article

When queries are very verbose

22/02/2021

In this article, we present a simple method to rewrite user queries so that a keyword-based search engine can better understand them. This method is very useful in the context of a voice search or a conversation with a chatbot, context in which user queries are generally more verbose.

Read the article

Enrich the data and rewrite the queries with the Elasticsearch percolator

26/04/2019

This article is a transcript of the lightning talk we presented this week at Haystack - the Search and Relevance Conference. We showed a method allowing to enrich and rewrite user queries using Wikidata and the Elasticsearch percolator.

Read the article

A2 the engine that makes Elasticsearch great

13/06/2018

Elasticsearch is an open technology that allows integrators to build ever more innovative and powerful solutions. Elasticsearch

Read the article