
-
Semantic Annotation for Tabular Data
Detecting semantic concept of columns in tabular data is of particular i...
read it
-
Fair Data Integration
The use of machine learning (ML) in high-stakes societal decisions has e...
read it
-
Efficient and Effective ER with Progressive Blocking
Blocking is a mechanism to improve the efficiency of Entity Resolution (...
read it
-
Adaptive Rule Discovery for Labeling Text Data
Creating and collecting labeled data is one of the major bottlenecks in ...
read it
-
Balancing the Tradeoff Between Clustering Value and Interpretability
Graph clustering groups entities – the vertices of a graph – based on th...
read it
-
Min-Max Correlation Clustering via MultiCut
Correlation clustering is a fundamental combinatorial optimization probl...
read it
-
Lexicographically Ordered Multi-Objective Clustering
We introduce a rich model for multi-objective clustering with lexicograp...
read it
-
Connectivity in Random Annulus Graphs and the Geometric Block Model
Random geometric graphs are the simplest, and perhaps the earliest possi...
read it
-
The Geometric Block Model
To capture the inherent geometric features of many community detection p...
read it
-
Fairness Testing: Testing Software for Discrimination
This paper defines software fairness and discrimination and develops a t...
read it