Probabilistic HIV Recency Classification – A Logistic Regression without Labeled Individual Level Training Data

04/12/2021
by   Ben Sheng, et al.
0

Accurate HIV incidence estimation based on individual recent infection status (recent vs long-term infection) is important for monitoring the epidemic, targeting interventions to those at greatest risk of new infection, and evaluating existing programs of prevention and treatment. Starting from 2015, the Population-based HIV Impact Assessment (PHIA) individual-level surveys are implemented in the most-affected countries in sub-Saharan Africa. PHIA is a nationally-representative HIV-focused survey that combines household visits with key questions and cutting-edge technologies such as biomarker tests for HIV antibody and HIV viral load which offer the unique opportunity of distinguishing between recent infection and long-term infection, and providing relevant HIV information by age, gender, and location. In this article, we propose a semi-supervised logistic regression model for estimating individual level HIV recency status. It incorporates information from multiple data sources – the PHIA survey where the true HIV recency status is unknown, and the cohort studies provided in the literature where the relationship between HIV recency status and the covariates are presented in the form of a contingency table. It also utilizes the national level HIV incidence estimates from the epidemiology model. Applying the proposed model to Malawi PHIA data, we demonstrate that our approach is more accurate for the individual level estimation and more appropriate for estimating HIV recency rates at aggregated levels than the current practice – the binary classification tree (BCT).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/05/2023

A Likelihood Approach to Incorporating Self-Report Data in HIV Recency Classification

Estimating new HIV infections is significant yet challenging due to the ...
research
02/14/2022

Causal Structural Learning on MPHIA Individual Dataset

The Population-based HIV Impact Assessment (PHIA) is an ongoing project ...
research
01/12/2021

Evaluation of Logistic Regression Applied to Respondent-Driven Samples: Simulated and Real Data

Objective: To investigate the impact of different logistic regression es...
research
09/21/2021

A Correlated Network Scale-up Model: Finding the Connection Between Subpopulations

Aggregated relational data (ARD), formed from "How many X's do you know?...
research
03/31/2022

Improving Biomarker Based HIV Incidence Estimation in the Treatment Era

Estimating HIV-1 incidence using biomarker assays in cross-sectional sur...
research
11/30/2020

Timely Group Updating

We consider two closely related problems: anomaly detection in sensor ne...

Please sign up or login with your details

Forgot password? Click here to reset