Exploring and Exploiting Data Heterogeneity in Recommendation

05/21/2023
by   Zimu Wang, et al.
0

Massive amounts of data are the foundation of data-driven recommendation models. As an inherent nature of big data, data heterogeneity widely exists in real-world recommendation systems. It reflects the differences in the properties among sub-populations. Ignoring the heterogeneity in recommendation data could limit the performance of recommendation models, hurt the sub-populational robustness, and make the models misled by biases. However, data heterogeneity has not attracted substantial attention in the recommendation community. Therefore, it inspires us to adequately explore and exploit heterogeneity for solving the above problems and assisting data analysis. In this work, we focus on exploring two representative categories of heterogeneity in recommendation data that is the heterogeneity of prediction mechanism and covariate distribution and propose an algorithm that explores the heterogeneity through a bilevel clustering method. Furthermore, the uncovered heterogeneity is exploited for two purposes in recommendation scenarios which are prediction with multiple sub-models and supporting debias. Extensive experiments on real-world data validate the existence of heterogeneity in recommendation data and the effectiveness of exploring and exploiting data heterogeneity in recommendation.

READ FULL TEXT
research
04/01/2023

Predictive Heterogeneity: Measures and Applications

As an intrinsic and fundamental property of big data, data heterogeneity...
research
05/30/2022

Towards Fair Federated Recommendation Learning: Characterizing the Inter-Dependence of System and Data Heterogeneity

Federated learning (FL) is an effective mechanism for data privacy in re...
research
09/13/2021

ARGO: Modeling Heterogeneity in E-commerce Recommendation

Nowadays, E-commerce is increasingly integrated into our daily lives. Me...
research
05/19/2021

Heterogeneous Contrastive Learning

With the advent of big data across multiple high-impact applications, we...
research
05/30/2022

On the External Validity of Average-Case Analyses of Graph Algorithms

The number one criticism of average-case analysis is that we do not actu...
research
11/25/2022

Group Buying Recommendation Model Based on Multi-task Learning

In recent years, group buying has become one popular kind of online shop...
research
03/06/2023

Towards Capacity-Aware Broker Matching: From Recommendation to Assignment

Online real estate platforms are gaining increasing popularity, where a ...

Please sign up or login with your details

Forgot password? Click here to reset