HARRIS: Hybrid Ranking and Regression Forests for Algorithm Selection

10/31/2022
by   Lukas Fehring, et al.
0

It is well known that different algorithms perform differently well on an instance of an algorithmic problem, motivating algorithm selection (AS): Given an instance of an algorithmic problem, which is the most suitable algorithm to solve it? As such, the AS problem has received considerable attention resulting in various approaches - many of which either solve a regression or ranking problem under the hood. Although both of these formulations yield very natural ways to tackle AS, they have considerable weaknesses. On the one hand, correctly predicting the performance of an algorithm on an instance is a sufficient, but not a necessary condition to produce a correct ranking over algorithms and in particular ranking the best algorithm first. On the other hand, classical ranking approaches often do not account for concrete performance values available in the training data, but only leverage rankings composed from such data. We propose HARRIS- Hybrid rAnking and RegRessIon foreSts - a new algorithm selector leveraging special forests, combining the strengths of both approaches while alleviating their weaknesses. HARRIS' decisions are based on a forest model, whose trees are created based on splits optimized on a hybrid ranking and regression loss function. As our preliminary experimental study on ASLib shows, HARRIS improves over standard algorithm selection approaches on some scenarios showing that combining ranking and regression in trees is indeed promising for AS.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2021

Regression with Missing Data, a Comparison Study of TechniquesBased on Random Forests

In this paper we present the practical benefits of a new random forest a...
research
11/18/2013

Ranking Algorithms by Performance

A common way of doing algorithm selection is to train a machine learning...
research
07/20/2021

Algorithm Selection on a Meta Level

The problem of selecting an algorithm that appears most suitable for a s...
research
11/04/2021

Label Ranking through Nonparametric Regression

Label Ranking (LR) corresponds to the problem of learning a hypothesis t...
research
03/15/2018

Minimax optimal rates for Mondrian trees and forests

Introduced by Breiman (2001), Random Forests are widely used as classifi...
research
10/30/2014

A random forest system combination approach for error detection in digital dictionaries

When digitizing a print bilingual dictionary, whether via optical charac...
research
05/07/2015

DART: Dropouts meet Multiple Additive Regression Trees

Multiple Additive Regression Trees (MART), an ensemble model of boosted ...

Please sign up or login with your details

Forgot password? Click here to reset