Disclosure Statement: The authors report no conflicts of interests.
Performance of automated and manual coding systems for occupational data: A case study of historical records†
Article first published online: 27 DEC 2011
Copyright © 2011 Wiley Periodicals, Inc.
American Journal of Industrial Medicine
Volume 55, Issue 3, pages 228–231, March 2012
How to Cite
Patel, M. D., Rose, K. M., Owens, C. R., Bang, H. and Kaufman, J. S. (2012), Performance of automated and manual coding systems for occupational data: A case study of historical records . Am. J. Ind. Med., 55: 228–231. doi: 10.1002/ajim.22005
- Issue published online: 8 FEB 2012
- Article first published online: 27 DEC 2011
- Manuscript Accepted: 11 DEC 2011
- National Institutes of Health. Grant Number: R01-HL081627
- National Heart, Lung, and Blood Institute. Grant Numbers: HHSN268201100005C, HSN268201100006C, HHSN268201100007C, HHSN268201100008C, HSN268201100009C, HHSN268201100010C, HHSN268201100011C, HHSN268201100012C
- occupational coding;
- automatic data processing;
- computer systems;
- occupation classification;
- social class
Occupational data are a common source of workplace exposure and socioeconomic information in epidemiologic research. We compared the performance of two occupation coding methods, an automated software and a manual coder, using occupation and industry titles from U.S. historical records.
We collected parental occupational data from 1920–40s birth certificates, Census records, and city directories on 3,135 deceased individuals in the Atherosclerosis Risk in Communities (ARIC) study. Unique occupation-industry narratives were assigned codes by a manual coder and the Standardized Occupation and Industry Coding software program. We calculated agreement between coding methods of classification into major Census occupational groups.
Automated coding software assigned codes to 71% of occupations and 76% of industries. Of this subset coded by software, 73% of occupation codes and 69% of industry codes matched between automated and manual coding. For major occupational groups, agreement improved to 89% (kappa = 0.86).
Automated occupational coding is a cost-efficient alternative to manual coding. However, some manual coding is required to code incomplete information. We found substantial variability between coders in the assignment of occupations although not as large for major groups. Am. J. Ind. Med. 55:228–231, 2012. © 2011 Wiley Periodicals, Inc.