A transparent approach to data representation

04/27/2023
by   Sean Deyo, et al.
0

We use a binary attribute representation (BAR) model to describe a data set of Netflix viewers' ratings of movies. We classify the viewers with discrete bits rather than continuous parameters, which makes the representation compact and transparent. The attributes are easy to interpret, and we need far fewer attributes than similar methods do to achieve the same level of error. We also take advantage of the nonuniform distribution of ratings among the movies in the data set to train on a small selection of movies without compromising performance on the rest of the movies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2019

Measuring the compositionality of noun-noun compounds over time

We present work in progress on the temporal progression of compositional...
research
07/11/2019

Semi-supervised Feature-Level Attribute Manipulation for Fashion Image Retrieval

With a growing demand for the search by image, many works have studied t...
research
06/13/2020

Quota-based debiasing can decrease representation of already underrepresented groups

Many important decisions in societies such as school admissions, hiring,...
research
09/11/2015

Multi-Attribute Proportional Representation

We consider the following problem in which a given number of items has t...
research
07/26/2019

Improving Generalization via Attribute Selection on Out-of-the-box Data

Zero-shot learning (ZSL) aims to recognize unseen objects (test classes)...
research
12/22/2016

Finding Statistically Significant Attribute Interactions

In many data exploration tasks it is meaningful to identify groups of at...
research
05/29/2022

Assessing the accuracy of the Australian Senate count: Key steps for a rigorous and transparent audit

This paper explains the main principles and some of the technical detail...

Please sign up or login with your details

Forgot password? Click here to reset