Volume 37, Issue 2
Special Issue Paper

Constrained binary classification using ensemble learning: an application to cost‐efficient targeted PrEP strategies

Wenjing Zheng

Corresponding Author

E-mail address: wenjing.zheng@berkeley.edu

Division of Biostatistics, School of Public Health, University of California, Berkeley, CA, U.S.A.

Correspondence to: Wenjing Zheng, Division of Biostatistics, School of Public Health, University of California, Berkeley, CA, U.S.A.

E‐mail: wenjing.zheng@berkeley.edu

Search for more papers by this author
Laura Balzer

Department of Biostatistics, Havard T.H. Chan School of Public Health, Boston, MA, U.S.A.

Search for more papers by this author
Mark van der Laan

Division of Biostatistics, School of Public Health, University of California, Berkeley, CA, U.S.A.

Search for more papers by this author
Maya Petersen

Division of Biostatistics, School of Public Health, University of California, Berkeley, CA, U.S.A.

Search for more papers by this author
the SEARCH Collaboration

Division of Biostatistics, School of Public Health, University of California, Berkeley, CA, U.S.A.

Search for more papers by this author
First published: 06 April 2017
Citations: 14

Abstract

Binary classification problems are ubiquitous in health and social sciences. In many cases, one wishes to balance two competing optimality considerations for a binary classifier. For instance, in resource‐limited settings, an human immunodeficiency virus prevention program based on offering pre‐exposure prophylaxis (PrEP) to select high‐risk individuals must balance the sensitivity of the binary classifier in detecting future seroconverters (and hence offering them PrEP regimens) with the total number of PrEP regimens that is financially and logistically feasible for the program. In this article, we consider a general class of constrained binary classification problems wherein the objective function and the constraint are both monotonic with respect to a threshold. These include the minimization of the rate of positive predictions subject to a minimum sensitivity, the maximization of sensitivity subject to a maximum rate of positive predictions, and the Neyman–Pearson paradigm, which minimizes the type II error subject to an upper bound on the type I error. We propose an ensemble approach to these binary classification problems based on the Super Learner methodology. This approach linearly combines a user‐supplied library of scoring algorithms, with combination weights and a discriminating threshold chosen to minimize the constrained optimality criterion. We then illustrate the application of the proposed classifier to develop an individualized PrEP targeting strategy in a resource‐limited setting, with the goal of minimizing the number of PrEP offerings while achieving a minimum required sensitivity. This proof of concept data analysis uses baseline data from the ongoing Sustainable East Africa Research in Community Health study. Copyright © 2017 John Wiley & Sons, Ltd.

Number of times cited according to CrossRef: 14

  • Super learner analysis of real‐time electronically monitored adherence to antiretroviral therapy under constrained optimization and comparison to non‐differentiated care approaches for persons living with HIV in rural Uganda, Journal of the International AIDS Society, 10.1002/jia2.25467, 23, 3, (2020).
  • Uptake, engagement, and adherence to pre-exposure prophylaxis offered after population HIV testing in rural Kenya and Uganda: 72-week interim analysis of observational data from the SEARCH study, The Lancet HIV, 10.1016/S2352-3018(19)30433-3, (2020).
  • Preexposure Prophylaxis Indication Criteria Underidentify Black and Latinx Persons and Require Revision, American Journal of Public Health, 10.2105/AJPH.2019.305514, 110, 3, (267-268), (2020).
  • Using electronic health records to identify candidates for human immunodeficiency virus pre‐exposure prophylaxis: An application of super learning to risk prediction when the outcome is rare, Statistics in Medicine, 10.1002/sim.8591, 39, 23, (3059-3073), (2020).
  • Artificial Intelligence and Machine Learning for HIV Prevention: Emerging Approaches to Ending the Epidemic, Current HIV/AIDS Reports, 10.1007/s11904-020-00490-6, (2020).
  • Understanding Demand for PrEP and Early Experiences of PrEP Use Among Young Adults in Rural Kenya and Uganda: A Qualitative Study, AIDS and Behavior, 10.1007/s10461-020-02780-x, (2020).
  • Pre-exposure Prophylaxis (PrEP) Uptake Among Older Individuals in Rural Western Kenya, JAIDS Journal of Acquired Immune Deficiency Syndromes, 10.1097/QAI.0000000000002150, 82, 4, (e50-e53), (2019).
  • Distance to clinic is a barrier to PrEP uptake and visit attendance in a community in rural Uganda, Journal of the International AIDS Society, 10.1002/jia2.25276, 22, 4, (2019).
  • Characterizing Sociostructural Associations With New HIV Diagnoses Among Female Sex Workers in Cameroon, JAIDS Journal of Acquired Immune Deficiency Syndromes, 10.1097/QAI.0000000000001920, 80, 3, (e64-e73), (2019).
  • Machine Learning to Identify Persons at High-Risk of Human Immunodeficiency Virus Acquisition in Rural Kenya and Uganda, Clinical Infectious Diseases, 10.1093/cid/ciz1096, (2019).
  • Evaluating the impact of policies recommending PrEP to subpopulations of men and transgender women who have sex with men based on demographic and behavioral risk factors, PLOS ONE, 10.1371/journal.pone.0222183, 14, 9, (e0222183), (2019).
  • Improved Small-Sample Estimation of Nonlinear Cross-Validated Prediction Metrics, Journal of the American Statistical Association, 10.1080/01621459.2019.1668794, (1-16), (2019).
  • Early Adopters of Human Immunodeficiency Virus Preexposure Prophylaxis in a Population-based Combination Prevention Study in Rural Kenya and Uganda, Clinical Infectious Diseases, 10.1093/cid/ciy390, 67, 12, (1853-1860), (2018).
  • Stacked generalization: an introduction to super learning, European Journal of Epidemiology, 10.1007/s10654-018-0390-z, 33, 5, (459-464), (2018).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.