Network Flow Methods for the Minimum Covariates Imbalance Problem

07/14/2020
by   Dorit S. Hochbaum, et al.
0

The problem of balancing covariates arises in observational studies where one is given a group of control samples and another group, disjoint from the control group, of treatment samples. Each sample, in either group, has several observed nominal covariates. The values, or categories, of each covariate partition the treatment and control samples to a number of subsets referred to as levels where the samples at every level share the same covariate value. We address here a problem of selecting a subset of the control group so as to balance, to the best extent possible, the sizes of the levels between the treatment group and the selected subset of control group, the min-imbalance problem. It is proved here that the min-imbalance problem, on two covariates, is solved efficiently with network flow techniques. We present an integer programming formulation of the problem where the constraint matrix is totally unimodular, implying that the linear programming relaxation to the problem has all basic solutions, and in particular the optimal solution, integral. This integer programming formulation is linked to a minimum cost network flow problem which is solvable in O(n· (n' + nlog n)) steps, for n the size of the treatment group and n' the size of the control group. A more efficient algorithm is further devised based on an alternative, maximum flow, formulation of the two-covariate min-imbalance problem, that runs in O(n'^3/2log^2n) steps.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/06/2021

Rerandomization with Diminishing Covariate Imbalance and Diverging Number of Covariates

Completely randomized experiments have been the gold standard for drawin...
research
08/12/2020

Covariate Balancing Based on Kernel Density Estimates for Controlled Experiments

Controlled experiments are widely used in many applications to investiga...
research
05/08/2019

Optimal Rerandomization via a Criterion that Provides Insurance Against Failed Experiments

We present an optimized rerandomization design procedure for a non-seque...
research
05/23/2023

Asymptotic Properties of Multi-Treatment Covariate Adaptive Randomization Procedures for Balancing Observed and Unobserved Covariates

Applications of CAR for balancing continuous covariates remain comparati...
research
05/21/2022

Conditional Balance Tests: Increasing Sensitivity and Specificity With Prognostic Covariates

Researchers often use covariate balance tests to assess whether a treatm...
research
09/17/2020

Algorithms and Complexity for Variants of Covariates Fine Balance

We study here several variants of the covariates fine balance problem wh...

Please sign up or login with your details

Forgot password? Click here to reset