Back to previous page
Workshop Publication

Data Pipelines for Personalized Exploration of Rated Datasets

CNRS, France
Sihem Amer-Yahia
Anh Tho Le, Eric Simon
accepted in:
BIAS 2020
Yet to be published
July 23, 2021
Read full article

Rated datasets are characterized by a combination of user demographics such as age and occupation, and user actions such as rating a movie or reviewing a book. Their exploration can greatly benefit end-users in their daily life. As data consumers are being empowered, there is a need for a tool to express end-to-end data pipelines for the personalized exploration of rated datasets. Such a tool must be easy to use as several strategies need to be tested by end-users to find relevant information. In this work, we develop a framework based on mining labeled segments of interest to the data consumer. The difficulty is to find segments whose demographics and rating behavior are both relevant to the data consumer. The variety of ways to express that task fully justifies the need for a productive and effective programming environment to express various data pipelines at a logical level. We examine how to do that and validate our findings with experiments on real rated datasets.

Will be available soon to download

Get in touch

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form, try again please.