Modeling the Visibility Distribution for Respondent-driven Sampling with Application to Population Size Estimation

McLaughlin, Katherine R.; Johnston, Lisa G.; Jakupi, Xhevat; Gexha-Bunjaku, Dafina; Deva, Edona; Handcock, Mark S.

Citeable URL: https://ir.library.oregonstate.edu/concern/articles/nz806743p

Descriptions

Attribute Name	Values
Creator	McLaughlin, Katherine R. Johnston, Lisa G. Jakupi, Xhevat Gexha-Bunjaku, Dafina Deva, Edona Handcock, Mark S.
Abstract	Respondent-driven sampling (RDS) is used throughout the world to estimate prevalence and population size for hidden populations. Although RDS is an effective method for enrolling people from key populations in studies, it relies on a partially unknown sampling mechanism, and thus each individual’s inclusion probability is unknown. Current estimators for population prevalence, population size, and other outcomes rely on a participant’s network size (degree) to approximate their inclusion probability in the sample from the networked population. However, in most RDS studies, a participant’s network size is attained via a self-report and is subject to many types of misreporting and bias. Because design-based inclusion probabilities cannot be exactly computed, we instead use the term visibility to describe how likely a person is to be selected to participate in the study. The commonly used successive sampling population size estimation (SS-PSE) framework to estimate population sizes from RDS data relies on self-reported network sizes in the model for the sampling mechanism. We propose an enhancement of the SS-PSE framework that adds a measurement error model for visibility used in place of the self-reported network size and a model for the number of recruits an individual can enroll. Inferred visibilities are a way to smooth the degree distribution and bring in outliers as well as a mechanism to deal with missing and invalid network sizes. We demonstrate the performance of visibility SS-PSE on three populations from Kosovo sampled in 2014 using RDS. We also discuss how the visibility modeling framework could be extended to prevalence estimation.
License	All rights reserved
Resource Type	Article
DOI	http://dx.doi.org/10.1214/23-AOAS1807
Date Issued	2024-03
Journal Title	The Annals of Applied Statistics
Journal Volume	18
Journal Issue/Number	1
Academic Affiliation	Statistics
Rights Statement	In Copyright
Funding Statement (additional comments about funding)	Funding. This material is based upon work supported by the National Science Foundation Graduate Research Fellowship under Grant No. DGE-1144087.
Publisher	Institute of Mathematical Statistics
Peer Reviewed	Yes
Language	English [eng]
ISSN	1941-7330

ScholarsArchive@OSU

Modeling the Visibility Distribution for Respondent-driven Sampling with Application to Population Size Estimation

Descriptions

Relationships

Items

ScholarsArchive@OSU

Modeling the Visibility Distribution for Respondent-driven Sampling with Application to Population Size Estimation

Downloadable Content

Descriptions

Relationships

Items