Explore-Exploit: A Framework for Interactive and Online Learning

12/01/2018
by   Honglei Liu, et al.
0

Interactive user interfaces need to continuously evolve based on the interactions that a user has (or does not have) with the system. This may require constant exploration of various options that the system may have for the user and obtaining signals of user preferences on those. However, such an exploration, especially when the set of available options itself can change frequently, can lead to sub-optimal user experiences. We present Explore-Exploit: a framework designed to collect and utilize user feedback in an interactive and online setting that minimizes regressions in end-user experience. This framework provides a suite of online learning operators for various tasks such as personalization ranking, candidate selection and active learning. We demonstrate how to integrate this framework with run-time services to leverage online and interactive machine learning out-of-the-box. We also present results demonstrating the efficiencies that can be achieved using the Explore-Exploit framework.

READ FULL TEXT

page 5

page 6

research
11/17/2017

Learning User Preferences to Incentivize Exploration in the Sharing Economy

We study platforms in the sharing economy and discuss the need for incen...
research
11/03/2011

Online Learning with Preference Feedback

We propose a new online learning model for learning with preference feed...
research
06/07/2019

Preference-based Interactive Multi-Document Summarisation

Interactive NLP is a promising paradigm to close the gap between automat...
research
06/13/2022

Scalable Exploration for Neural Online Learning to Rank with Perturbed Feedback

Deep neural networks (DNNs) demonstrate significant advantages in improv...
research
05/02/2023

Exploration of Unranked Items in Safe Online Learning to Re-Rank

Bandit algorithms for online learning to rank (OLTR) problems often aim ...
research
02/10/2023

Transactional Panorama: A Conceptual Framework for User Perception in Analytical Visual Interfaces

Many tools empower analysts and data scientists to consume analysis resu...
research
05/04/2018

Time-on-Task Estimation with Log-Normal Mixture Model

We describe a method of estimating a user's time-on-task in an online le...

Please sign up or login with your details

Forgot password? Click here to reset