Active Sampling of Pairs and Points for Large-scale Linear Bipartite Ranking

08/24/2017
by   Wei-Yuan Shen, et al.
0

Bipartite ranking is a fundamental ranking problem that learns to order relevant instances ahead of irrelevant ones. The pair-wise approach for bi-partite ranking construct a quadratic number of pairs to solve the problem, which is infeasible for large-scale data sets. The point-wise approach, albeit more efficient, often results in inferior performance. That is, it is difficult to conduct bipartite ranking accurately and efficiently at the same time. In this paper, we develop a novel active sampling scheme within the pair-wise approach to conduct bipartite ranking efficiently. The scheme is inspired from active learning and can reach a competitive ranking performance while focusing only on a small subset of the many pairs during training. Moreover, we propose a general Combined Ranking and Classification (CRC) framework to accurately conduct bipartite ranking. The framework unifies point-wise and pair-wise approaches and is simply based on the idea of treating each instance point as a pseudo-pair. Experiments on 14 real-word large-scale data sets demonstrate that the proposed algorithm of Active Sampling within CRC, when coupled with a linear Support Vector Machine, usually outperforms state-of-the-art point-wise and pair-wise ranking approaches in terms of both accuracy and efficiency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/30/2020

A Graph-Based Approach for Active Learning in Regression

Active learning aims to reduce labeling efforts by selectively asking hu...
research
10/06/2014

Top Rank Optimization in Linear Time

Bipartite ranking aims to learn a real-valued ranking function that orde...
research
06/15/2020

xOrder: A Model Agnostic Post-Processing Framework for Achieving Ranking Fairness While Maintaining Algorithm Utility

Algorithmic fairness has received lots of interests in machine learning ...
research
10/23/2021

Confidence-Aware Active Feedback for Efficient Instance Search

Relevance feedback is widely used in instance search (INS) tasks to furt...
research
01/16/2014

Narrowing the Modeling Gap: A Cluster-Ranking Approach to Coreference Resolution

Traditional learning-based coreference resolvers operate by training the...
research
03/14/2018

Ranking with Adaptive Neighbors

Retrieving the most similar objects in a large-scale database for a give...
research
01/30/2014

Support vector comparison machines

In ranking problems, the goal is to learn a ranking function from labele...

Please sign up or login with your details

Forgot password? Click here to reset