The sensitivity of respondent-driven sampling
Article first published online: 18 JUL 2011
DOI: 10.1111/j.1467-985X.2011.00711.x
© 2011 Royal Statistical Society
Issue

Journal of the Royal Statistical Society: Series A (Statistics in Society)
Volume 175, Issue 1, pages 191–216, January 2012
Additional Information
How to Cite
Lu, X., Bengtsson, L., Britton, T., Camitz, M., Kim, B. J., Thorson, A. and Liljeros, F. (2012), The sensitivity of respondent-driven sampling. Journal of the Royal Statistical Society: Series A (Statistics in Society), 175: 191–216. doi: 10.1111/j.1467-985X.2011.00711.x
Publication History
- Issue published online: 17 JAN 2012
- Article first published online: 18 JUL 2011
- [Received February 2010. Final revision March 2011]
- Abstract
- Article
- References
- Cited By
Keywords:
- Directed network;
- Hidden population;
- Network;
- Respondent-driven sampling;
- Sampling;
- Sensitivity
Summary. Researchers in many scientific fields make inferences from individuals to larger groups. For many groups, however, there is no list of members from which to draw a random sample. Respondent-driven sampling (RDS) is a relatively new sampling methodology that circumvents this difficulty by using the social networks of the groups under study. The RDS method has been shown to provide unbiased estimates of population proportions given certain conditions. The method is now widely used in human immunodeficiency virus related studies among high risk populations globally. We test the RDS methodology by simulating RDS studies on the social networks of a large Lesbian, gay, bisexual and transgender Web community. The robustness of the RDS method is tested by violating, one by one, the conditions under which the method provides unbiased estimates. Simulations indicate that the bias is large if networks are directed or respondents choose to invite people on the basis of characteristics that are correlated with the study outcomes. The bias and variance increase if participants invite close as opposed to more distant friends whereas sampling in denser networks sharply reduces variance. However, the RDS method shows strong resistance to sampling without replacement, low response rates and certain errors in the participants’ reporting of their network sizes, as well as the selection criteria of seeds. The effects of network structure and the number of seeds and coupons are also discussed.

1467-985X/asset/olbannerleft.gif?v=1&s=d0aa82ec470e65ec5f60e48a22c6dd75ac35e130)
1467-985X/asset/olbannercenter.jpg?v=1&s=21d3c336b838f00aebdd912c80e387904b3f80fa)
1467-985X/asset/olbannerright.jpg?v=1&s=9f239571c70925a8f6aa24b97820137d1c749563)