A live model for improving data quality in an online shop

14:50-15:10, January 27 @ 1A

Talk/ Overview

Maintaining data quality is a major challenge when handling millions of product offers in an online shop. Given that our suppliers have their own product type hierarchies and choose to name brands and properties differently, a lot of manual effort goes into mapping and curating this data to our shop’s data model. Machine Learning can support this process and significantly speed up product offer generation. In this talk, we demonstrate a convolutional neural network model for categorizing new products into one of ~2500 product types, using images, descriptions and other attributes. Furthermore, we show how we run and retrain this model in production alongside other machine-learning assistants, and how the model affected the business workflow since its Go-Live.

Talk/ Speakers

Michael Hardegger

Lead Machine Learning Engineer, Digitec Galaxus AG

Talk/ Slides

Download the slides for this talk.Download ( PDF, 14728.52 MB)

Talk/ Highlights

20:53

A live model for improving data quality in an online shop

With Michael HardeggerPublished March 12, 2020

AMLD / Global partners