The American Cancer Society Cancer Prevention Study II Nutrition Cohort

Rationale, study design, and baseline characteristics




Large-scale, prospective cohort studies have played a critical role in discovering factors that contribute to variability in cancer risk in human populations. Epidemiologists and volunteers at the American Cancer Society (ACS) were among the first to establish such cohorts, beginning in the early 1950s and continuing through the present, and these ACS cohorts have made landmark contributions in many areas of epidemiologic research.


The Cancer Prevention Study II Nutrition Cohort was established in 1992 and was designed to investigate the relation between diet and other lifestyle factors and exposures and the risk of cancer, mortality, and survival. The cohort includes over 84,000 men and 97,000 women who completed a mailed questionnaire in 1992. New questionnaires are sent to surviving cohort members every other year to update exposure information and to ascertain new occurrences of cancer; a 90% response rate was achieved for follow-up questionnaires in 1997 and 1999. Reported cancers are verified through medical records, registry linkage, or death certificates. The cohort is followed actively for all cases of incident cancer and for all causes of death. Through a collaborative effort among ACS national and division staff, volunteers, and the American College of Surgeons, blood samples were collected from a subgroup of 40,000 cohort members and are in storage at a central repository for future investigation of dietary, hormonal, genetic, and other factors and cancer risk. Collection of DNA samples from buccal cells in an additional 50,000 cohort members is underway currently and will be completed in 2002.


This new cohort of both men and women promises to be particularly valuable for the study of cancer occurrence, mortality, and survival as they relate to obesity and weight change, physical activity at various points in life, vitamin supplement use, exogenous hormone use, other medications (such as aspirin and nonsteroidal anti-inflammatory drugs) and cancer screening modalities. Cancer 2002;94:500–11. © 2002 American Cancer Society.

The evolution of the modern cohort study has received particular attention recently.1, 2 Early cohorts typically addressed very specific exposure issues and generally were limited in their ability to control for confounding. More recently, large cohort studies have collected questionnaire information and biospecimens to assess a comprehensive array of lifestyle, medical, and other risk factors as potential determinants of a wide range of specific diseases.

Newer cohorts have taken advantage of structured dietary questionnaires developed in the 1980s and early 1990s that can be administered to large groups of people and that discriminate among people with respect to dietary intake.3 More than 30 prospective cohorts with a focus on dietary assessment have been assembled to date; taken together, these cohorts include over 3 million people, primarily from North America and Europe.3

It is interesting to note that, in contrast to early cohorts, women substantially outnumber men in most of these contemporary cohorts. Only four cohorts have enrolled 80,000 or more men; these include the EPIC multicenter cohort study conducted in nine European countries,4 the Multiethnic Cohort study conducted in African-American, Latino, Japanese, Native Hawaiian, and white populations living in Hawaii and California,5 the National Cancer Institute Diet and Health Study conducted in members of the American Association of Retired Persons,3 and the American Cancer Society (ACS) Nutrition Cohort conducted in middle-aged to elderly men and women living in 21 states in the United States.6 In this article, we present the rationale, study design, data collection, and descriptive characteristics of the ACS Nutrition Cohort.



In 1952, the ACS began its first large-scale prospective cohort study, the Hammond–Horn Study,7, 8 to examine the effect of cigarette smoking on death rates from cancer and other diseases (Table 1). This landmark research effort was one of the first cohort studies to establish the association of smoking with lung carcinoma, heart disease, and all-cause mortality; it set the methodologic foundation for the two subsequent Cancer Prevention Study cohorts (CPS-I and CPS-II).

Table 1. Characteristics of American Cancer Society Cohorts
  • CPS-II: Cancer Prevention Study II.

  • a

    Data from the Hammond–Horn cohort have been lost.

Study design
 Study participants187,7831,051,0381,185,106184,19439,200
 Volunteer recruiters22,00068,00077,00070002000
 Yrs of follow-up1952–19551959–19721982 (ongoing)1992 (ongoing)1998 (ongoing)
 No. of states925502120
 Outcome of interestMortalityMortalityMortalityIncidence and mortalityIncidence and mortality
 Women (%)057575356
 White (%)10097939798
 Black (%)02411
 Married (%)?a85838894
 College graduate (%)?a17303846
 Age at entry (median) (yrs)?a52576370
 Age range (yrs)50–6930–10830–11140–9249–96
 Current cigarette smokers (%)57292073

Compared with the Hammond–Horn Study, both CPS-I and CPS-II were designed to address a wide range of potential exposures, in addition to tobacco use, that may be associated with cancer (Table 1). CPS-I data have resulted in the publication of more than 100 scientific articles, including analyses of smoking,9–12 body weight,13–18 air pollution,19 and risk factors for breast carcinoma20–22 in relation to mortality. Whereas the Hammond–Horn data have been lost, CPS-I data continue to be analyzed.

CPS-II was launched in 1982 using volunteers to enroll approximately 1.2 million men and women in all 50 states, the District of Columbia, and Puerto Rico (Table 1).23 Participants completed a four-page confidential questionnaire that included questions on demographic characteristics, height and weight, personal and family history of cancer and other diseases, use of medicines and vitamins, reproductive history and hormone use (women), occupational exposures, brief dietary and physical activity assessments, and alcohol and tobacco use. Exposure information for the entire CPS-II cohort was collected only once at baseline in 1982. Mortality follow-up is ongoing and is accomplished through linkage with the National Death Index24; 283,600 deaths have occurred in the cohort through December 31, 1998. Over 100 publications from CPS-II address a broad range of lifestyle factors, including alcohol use,25 exogenous hormones,26–30 aspirin,31, 32 vitamin supplements,33, 34 air pollution,35 tobacco use,36–43 family history of cancer,44–46 occupation,47, 48 and adiposity.49–52 Analysis of CPS-II data in relation to mortality is ongoing. Complete citations for all publications can be viewed on the ACS website at

CPS-II Nutrition Cohort

Selection of participants

The CPS-II Nutrition Cohort was established in 1992 and 1993 as a subgroup of the larger CPS-II Cohort with three primary objectives: 1) to obtain detailed information on dietary and other exposures and to periodically update this information, 2) to identify incident cases of cancers as well as deaths, and 3) to evaluate dietary and other risk factors for cancer incidence in a prospective study of men and women large enough to study important specific cancers within a reasonable follow-up period. We estimated that a cohort size of at least 80,000 men and 80,000 women would provide excellent statistical power (> 90%) within 5 years of follow-up to detect rate ratios as low as 1.3 for associations between incident breast carcinoma (1500 expected incidents) or prostate carcinoma (2500 expected incidents) and exposure prevalence of at least 10%.53 Rate ratios of 1.5 could be detected with at least 90% power in gender specific analyses of colon carcinoma (600–900 expected incidents).53

In 1992 and 1993, a baseline questionnaire (described below) was mailed to 516,671 CPS-II men and women, ages 50–74 years, who resided in 21 states with population-based state cancer registries reported to ascertain at least 90% of incident cancer cases.54 The states included California, Connecticut, Florida, Georgia, Illinois, Iowa, Louisiana, Maryland, Massachusetts, Michigan, Minnesota, Missouri, New Jersey, New Mexico, New York, North Carolina, Pennsylvania, Utah, Virginia, Washington, and Wisconsin. These states were chosen originally to enable follow-up of cancer incidence using computerized linkage with state cancer registries.

Development of the 1992–1993 questionnaire

The dietary assessment instrument used initially in the CPS-II Nutrition Cohort was a self-administered food frequency questionnaire (FFQ) that addressed usual eating habits over the past year. The questionnaire was a modification of the brief 60-item Health Habits and History Questionnaire (HHHQ) developed by Block and colleagues at the National Cancer Institute.55 The brief HHHQ food list was developed based on the contributions of different foods to total population intake of energy and 17 other nutrients in the Second National Health and Nutrition Examination Survey (NHANES II) data and represents over 80% of dietary contribution to those intakes in the United States population.55 Three portion sizes (small, medium, and large) are associated with each food item and were derived from gender and age specific portion size distributions in the NHANES II. In addition to portion size, the HHHQ assesses frequency of consumption for each item. Eight food items derived from the full version of the HHHQ56 were added to the brief HHHQ for the Nutrition Cohort: other fruits, other vegetables, fats on vegetables, biscuits/muffins, yogurt, low-fat yogurt, tuna, and noodles/pasta. Additional questions derived from the full HHHQ or from the semiquantitative FFQ developed by Willett and colleagues3, 57 included changes in diet since 1982, degree to which red meat typically was cooked, and frequency of consumption of restaurant foods, low-fat foods, fats, and red meat. Questions pertaining to use of vitamin and mineral supplements also were included. In addition to the large section on dietary assessment, the questionnaire included briefer sections on demographics, medical history, weight history, smoking history, physical activity, medication use, occupational history, and, for women, breast and cervical screening, hormone replacement therapy, and family history of breast carcinoma.

Data collection and response

In the fall of 1992, the 10-page self-administered questionnaire was sent to 159,716 CPS-II men and their 140,780 CPS-II spouses ages 50–69 years and presumed to be living in the 21 states listed above. In response to this single mailing, we received completed, usable questionnaires from 55,105 CPS-II men and 50,937 CPS-II women who were still resident in the 21 states. To assemble both male and female cohorts of at least 80,000 individuals, we remailed the questionnaire in the fall of 1993. This mailing again was restricted to residents of the 21 selected states and was sent to all nonrespondents from the first mailing, to an additional 66,371 men ages 70–74 years and an additional 149,804 women ages 70–74 years or women ages 50–69 years who had not been included in 1992 because they were not living with a CPS-II male spouse. With this additional mailing, we received completed, usable questionnaires from a total of 86,406 men and 97,788 women ages 50–74 years (Table 1).

Validity and reproducibility of the FFQ

The FFQ used in the CPS-II Nutrition Cohort was validated in a stratified random sample of 244 men (mean age, 62 years) and 197 women (mean age, 60 years) who completed the survey prior to August 1993, were cancer free, and did not report significant changes in diet or weight in the year prior to survey completion.58 The validation study consisted of four 24-hour dietary recalls obtained by telephone for each person in the sample; telephone interviews were conducted from January 1994 to May 1995.

For total energy and most nutrients, intakes assessed by the Nutrition Cohort FFQ were lower than those from the 24-hour dietary recalls. Validity correlation coefficients for 10 food groups ranged from 0.1 for fats, oil, sweets, and snacks in women to 0.8 for alcoholic beverages in men, with a median correlation for all 10 food groups of 0.6 for both men and women. Validity coefficients for grams of energy-related macronutrients (protein, carbohydrate, ethanol, dietary fiber, and total and saturated fats) ranged from 0.3 (protein in men) to 0.8 (ethanol in men) and median values also were 0.6 for each gender group. Lower median validity correlations were observed for eight dietary vitamins and minerals: 0.4 in men and 0.3 in women. Median values for these micronutrients were influenced particularly by low correlation coefficients for zinc (< 0.2 in both men and women) and vitamin E (< 0.3 in both men and women). Higher correlations were seen for vitamin C (0.7 in men and women), calcium (0.6 in men and 0.7 in women) and folate (0.5 in men and 0.4 in women). Estimates for micronutrients from both the FFQ and the 24-hour recalls were from dietary sources only, exclusive of supplement use. Median values for reproducibility, based on a repeat administration of the FFQ, exceeded 0.7 for food groups, macronutrients and fats, and micronutrients among both men and women.

Nutrient data base

In preparation for nutritional analysis of the CPS-II Nutrition Cohort data, we estimated nutrient and food group intakes for each participant using the Dietary Analysis System (version 3.8a; Division of Cancer Prevention and Control, National Cancer Institute, Bethesda, MD).56 Modifications were made to the standard analysis configurations to account for differences between the FFQ used in the CPS-II Nutrition Cohort and the standard brief version of the HHHQ. We supplemented the nutrient data base with additional nutrient measures of interest, including glycemic load,59 vitamin D,60 and ethanol.61

Repeated exposure measurement

When the Nutrition Cohort originally was assembled in 1992 and 1993, we did not have funding to administer future questionnaires. Approval and funding for periodic resurveys were received in 1996, and we currently are recontacting the Nutrition Cohort members every 2 years using a mailed, self-administered questionnaire.

Accordingly, a new questionnaire was mailed to the 176,537 living members of the Nutrition Cohort in 1997–1998. This survey included questions on personal and family history of cancer, screening history (for breast, colorectal, cervical, and prostate carcinoma), smoking status, physical activity, weight, waist size, hormone replacement therapy, vitamins, and medications. Close to 91% of living cohort members completed and returned this questionnaire (n = 160,403 cohort members). To achieve this level of response, we mailed the questionnaire six times at 2–3-month intervals. The first four mailings sent the complete questionnaire, and the next two mailings sent a much shorter version of the questionnaire that included questions on personal history of cancer and a few key exposure variables. We used certified mail for the final mailing. Staff members at the ACS Call Center in Austin, Texas, made telephone calls to remaining nonresponders and administered a short phone survey that was identical in content to the short mailed questionnaire. Ninety percent of the 160,403 respondents completed the long mailed questionnaire, 8% completed the short mailed questionnaire, and 2% completed the short phone questionnaire.

Our most recent questionnaire was sent to the 166,773 living cohort members in 1999–2000 using the mailing strategy outlined above. This questionnaire was devoted primarily to the administration of a second FFQ to obtain a more recent and complete assessment of diet. This FFQ is based on the 130-item FFQ developed by Willett and colleagues in 1986 (Zimm et al.62 and Willet63). Just over 90% of living cohort members (n = 150,362 cohort members) completed any version of this questionnaire; 131,278 cohort members completed the long version, including the FFQ. The 2001–2003 questionnaire repeats many of the questions asked in 1997 and includes new topics, such as herbal supplement use, sun exposure, and limitations in activities of daily living.

All questionnaires for the Nutrition Cohort have been designed for self-administration and for processing by optical scanning. Mailing, receiving, and scanning of the questionnaires have been conducted by outside contractors with the capacity for high-volume data processing. All questionnaires can be viewed on the ACS website at

Cancer incidence follow-up

We are capturing information about incident cancers by requesting this information in the periodic questionnaires to the study cohort and then validating reported incidents of cancer through medical records acquisition. Cohort members provide written consent for medical records release. Pilot work linking the Nutrition Cohort members to four state cancer registries has indicated that the ability of our respondents to report a past diagnosis of cancer accurately is very high (sensitivity = 0.93 for the report of any cancer).64 False positive reports were rare (specificity > 0.99). Additional work with 11 more registries yielded a range in sensitivity of 0.92–0.97 for a match on any cancer reported by the registry (data not published). Thus, we were confident that a system that validates self-reports of cancer in this cohort could acquire additional information on up to 92–97% of all cancers without incurring the large costs potentially associated with attempting to verify many false reports. This pilot work suggested that verifying participant reports of cancer would provide a timely and uniform method of ascertaining new (incident) cases. Verification includes checking for confirmation of any cancer diagnosis; it is not restricted to the specific disease site that was reported by the participant.

An incident cancer is defined as a cancer (other than nonmelanoma skin carcinoma) that reportedly occurred during the interval since the previous questionnaire or that was recorded on a death certificate during the interval since the previous questionnaire among cohort members who did not report the cancer at the beginning of the interval. Respondents to the 1997–1998 questionnaire reported 10,272 incident cancers in the interval since baseline (1992–1993) (Table 2). We have obtained medical records for 73% of these cases. In addition, 4219 deaths from cancer (as recorded on the death certificate) occurred in the same interval among cohort members who did not respond to the 1997–1998 questionnaire and did not report the cancer in 1992–1993 (Table 2). We are linking with state cancer registries as needed for follow-up of specific reported cases of cancer or cancer deaths when medical records are not available. Through this mechanism, we have obtained registry records for an additional 14% of reported cancer cases and for 60% of the interval cancer deaths. Verification of incident cancers currently is ongoing for cancers reported during the interval from 1997–1998 to 1999–2000.

Table 2. Incident Cancers in the Cancer Prevention Study II Nutrition Cohort, 1992/1993 to 1997/1998
  • NHL: non-Hodgkin lymphoma.

  • a

    Reported on the 1997/1998 questionnaire by cohort members who did not report the cancer in 1992/1993.

  • b

    Recorded on a death certificate through December 31, 1997 for cohort members who did not complete the 1997/1998 questionnaire and who did not report the cancer in 1992/1993.


Collection of blood samples: The LifeLink Cohort

The CPS-II LifeLink Cohort was initiated in 1998 with the objective of collecting and storing blood samples from 40,000 members of the Nutrition Cohort. These donors were recruited from the approximately 142,000 surviving members of the cohort who lived in urban and suburban areas. Cohort members in more rural areas were not recruited for logistic reasons. Cohort members received an invitation by mail (nonrespondents were sent a second similar letter), and those who mailed back a form indicating their willingness to participate were telephoned to schedule a blood draw appointment at a participating hospital in their community. Blood sample collection was coordinated by ACS staff and volunteers and was performed by hospital staff at 312 community hospitals in 20 states (blood samples were not collected in Louisiana, because potential participants were dispersed geographically throughout the state). Hospitals were recruited from those approved by the American College of Surgeons (ACOS) Commission on Cancer with the assistance of the ACOS. Collection of blood samples was subject to Institutional Review Board approval in individual hospitals. Collection of blood samples for LifeLink was completed in June 2001. A total of 39,200 Nutrition Cohort members gave a single blood sample. In addition, we collected a second blood sample from 322 individuals to assess within-person variability in levels of blood analytes over time.

We collected a maximum of 43 mL of nonfasting whole blood from each participant drawn in two 15-mL tubes containing the anticoagulant ethylenediamine tetraacetic acid (EDTA) and a 13-mL serum separator tube (Fig. 1). The two 15-mL EDTA tubes yielded about 12 mL of plasma and 4 mL of red blood cells (additional red blood cell volume was discarded) as well as the buffy coat fraction, from which at least 340 μg of DNA can be extracted.65 Samples were chilled on ice or stored in a refrigerator during blood collection. Hospital staff centrifuged the serum separator tubes to separate serum from cellular blood components prior to shipping, and all samples were then shipped in foam containers with coolant packs overnight by Express Mail to a central repository in Rockville, Maryland. Shipments of blood from the field were received, unpacked, aliquoted, and frozen in liquid nitrogen vapor phase at approximately −130 °C for long-term storage. All samples at the central repository are identified by bar code only.

Figure 1.

Diagram of the LifeLink aliquoting protocol.

We collected 450 mL of blood from each of 28 volunteer donors to create several different quality control (QC) pools. These will be used to evaluate the reliability of measurements made by collaborating laboratories, both before beginning an analysis and on an ongoing basis during all analyses. Specifically, one or more pair(s) of blinded identical samples from these QC pools will be interspersed with each batch of participant samples to be analyzed, and the concordance of results from these identical QC samples will be monitored. Aliquots from the QC pools are stored in the same liquid nitrogen freezers as the participant samples.

Collection of additional DNA from buccal cells.

In January 2001, we began an initiative to collect DNA from cohort members who were unable or unwilling to give a blood sample or who were not invited to give blood because they lived in a rural (i.e., geographically dispersed) area. Collection of buccal (cheek) cells is a quick, cost-effective, and noninvasive alternative means to obtain DNA. All living members of the Nutrition Cohort who did not give a blood sample (approximately 125,000 individuals) are being invited to provide a cheek cell sample by mail. Participants receive a sample collection kit that includes mouthwash, collection cups, instructions, and a postage-paid return envelope. They are instructed to swish the mouthwash in their mouths for 30 seconds, spit the sample into the cup, and return the sample in the envelope provided. This mouthwash method will yield approximately 30 μg of high-quality human DNA per participant.66 Pilot work and early collection efforts have indicated that we will be able to collect a DNA sample from an additional 50,000 cohort members using this method. We anticipate that collection of buccal cell samples will be complete by mid-2002. Buccal cell DNA will be stored with the blood sample components at the central repository and will be identified by bar code only.

Protection of participants.

All aspects of the CPS-II protocol have been reviewed and approved by the Emory University School of Medicine Human Investigations Committee. Signed informed consent is received from participants to obtain medical records and from participants who donate blood or buccal cell samples. At the time of each mailed questionnaire, cohort members are informed that we use their identifying information to link with cancer registries and death indexes. In addition, in accordance with the provisions of section 301(d) of the Public Health Service Act (42 U.S. C. 241[d]), a Confidentiality Certificate has been issued for the CPS-II Nutrition Cohort by the National Cancer Institute. This certificate provides additional protection to study participants by helping ACS researchers avoid involuntary disclosures of participants' identifying information, which could expose them to adverse economic, psychological, and social consequences.

Characteristics of Nutrition Cohort Members

Because the majority of our prospective analyses will be conducted among individuals who were cancer free at baseline in 1992–1993, we present characteristics of cohort members after excluding 9358 men and 12,428 women who reported a history of cancer (other than nonmelanoma skin cancer). At baseline, virtually all members of this cohort were between ages 50 years and 79 years, with > 50% between ages 60 years and 69 years (Table 3). The cohort is largely white (97%), married (89%), and has a relatively high level of education (38% are college graduates). Many cohort members are former smokers, especially among the men (60%), and relatively few cohort members were current smokers at baseline (7%). The prevalence of vitamin use (any type, at least once per week) was 38% in men and 52% in women. Over half of the cohort reported body weights that are classified as overweight (40% with a body mass index 25.0–29.9) or obese (15% with a body mass index ≥ 30.0) by World Health Organization criteria.67

Table 3. Baseline Characteristics of Persons with no Previous History of Cancer in the Cancer Prevention Study II Nutrition Cohort, 1992–1993a
CharacteristicTotal % (n = 162,408)bMen % (n = 77,048)bWomen % (n = 85,360)b
  • MET: metabolic equivalent.

  • a

    Excludes persons with prevalent cancer, except nonmelanoma skin cancer (n = 9358 men; n = 12,428 women).

  • b

    Percentages that do not add up to the total reflect missing data.

  • c

    One hour of moderate paced walking = 3.5 MET hours.

Age in 1992 (yrs)
 Less than high school graduate6.78.55.2
 High school graduate31.925.437.7
 Some college22.819.725.6
 College graduate or more37.945.730.9
Marital status
 Widowed, separated, divorced9.04.013.6
 Never married1.51.11.9
Smoking history
 Never smoker44.332.654.7
 Former smoker47.959.937.1
 Current smoker6.66.66.6
Body mass index
 20 to < 2539.633.844.9
 25 to < 3039.548.831.0
 30 to < 3511.411.811.0
 ≥ 353.52.44.5
Vitamin use
Physical activity (in MET hours per week)c
 > 0 to ≤ 731.729.433.7
 > 7 to ≤ 17.529.828.131.4
 > 17.5 to ≤ 31.519.921.818.1
Hormone replacement therapy
 Never use49.8
 Former user13.1
 Current user32.8
 Ever, unknown1.4
High blood pressure34.336.732.1
Blood pressure lowering drugs26.628.724.6
High cholesterol36.033.538.3
Cholesterol lowering drugs9.410.28.6

Participants were asked to report the average time per week that they had spent doing seven leisure-time physical activities during the past year (walking, jogging/running, lap swimming, tennis or racquetball, bicycling/stationary bike, aerobics/calisthenics, and dancing). For the great majority of people, walking was the main type of physical activity. The median combined weekly energy expenditure from all of these activities was equivalent to between 2 and 3 hours of moderate walking for both men and women.68

Approximately 33% of women reported current use of hormone replacement therapy at baseline. Over 30% of both men and women reported physician-diagnosed high blood pressure, and over 75% of those reporting this condition reported taking blood pressure-lowering medication. Over 30% of both men and women also reported physician-diagnosed elevated cholesterol, but only one-third of these reported taking cholesterol-lowering drugs. Over 10% of men and 8% of women reported a history of diabetes mellitus (Table 3).

Mean values of total energy and selected nutrients from dietary sources in men and women are presented in Tables 4 and 5. For these estimates, we excluded men and women with > 15% of FFQ responses left blank and those with implausible reported energy intake (< 800 or > 4200 kcals per day for men and < 500 or > 3500 kcals per day for women). There appears to be a substantial range in intake of all nutrients in both men and women, as evidenced by the 25th and 75th percentile levels (Tables 4 and 5). For certain carotenoids, substantial differences in intake were observed by racial group (Table 6). Black men and women had considerably higher intakes of beta carotene (from sweet potatoes, collards and greens, and carrots), cryptoxanthin (from cornbread, corn, and peas), and lutein (from collards and greens and spinach) and considerably lower intake of lycopene (from tomatoes and strawberries) compared with white men and women.

Table 4. Mean and Range of Daily Intake of Selected Nutrients in Men, Cancer Prevention Study II Nutrition Cohort, 1992–1993
NutrientMen (N = 69,216)a
Mean25th Percentile75th percentile
  • a

    Exclusions include prevalent cancer, except nonmelanoma skin cancer; men with > 15% of the Food Frequency Questionnaire missing; and men with < 800 or > 4200 Kcals/day.

Total calories (Kcal/day)180513532164
Percent of calories from fat35.429.741.4
Percent of calories from carbohydrates44.739.050.0
Percent of calories from protein16.114.017.9
Total fat (g)72.647.091.7
Saturated fat (g)23.914.730.5
Cholesterol (mg)245151302
Carbohydrate (g)198149238
Dietary fiber (g)13.49.416.4
Protein (g)71.453.085.8
Alpha carotene (mcg)4401845560
Beta carotene (mcg)296115103722
Calcium (mg)8465341030
Cryptoxanthin (mcg)69.724.293.5
Iron (mg)12.08.8714.5
Folate (mcg)284208346
Linoleic acid (g)13.98.617.9
Lutein (mcg)19987112503
Lycopene (mcg)13256161729
Magnesium (mg)255189308
Niacin (mg)17.813.321
Oleic acid (g)
Phosphorus (mg)12459011504
Potassium (mg)264819963165
Retinol (mcg)676414851
Riboflavin (B2) (mg)1.871.322.27
Sodium (mg)286020943458
Thiamin (B1) (mg)1.351.021.62
Vitamin A (IU)786348749685
Vitamin A (re)12358171523
Vitamin B6 (mg)1.551.171.87
Vitamin C (mg)12376.2156
Vitamin E (α-te)9.276.6011.4
Zinc (mg)10.17.4412.2
Vitamin D (IU)190107235
Glycemic load163118197
Ethanol (g)11.20.0013.6
Table 5. Mean and Range of Daily Intake of Selected Nutrients in Women, Cancer Prevention Study II Nutrition Cohort, 1992–1993
NutrientWomen (N = 77,222)a
Mean25th Percentile75th Percentile
  • a

    Exclusions include prevalent cancer, except nonmelanoma skin cancer; women with > 15% of the Food Frequency Questionnaire missing; and women with < 500 or > 3500 Kcals/day.

Total calories (Kcal/day)136010161629
Percent of calories from fat33.827.140.5
Percent of calories from carbohydrates47.541.553.4
Percent of calories from protein16.714.518.8
Total fat (g)52.232.866.1
Saturated fat (g)16.59.920.9
Cholesterol (mg)169105209
Carbohydrate (g)160118194
Dietary fiber (g)11.68.0114.4
Protein (g)56.141.367.8
Alpha carotene (mcg)328131421
Beta carotene (mcg)230911852901
Calcium (mg)752456931
Cryptoxanthin (mcg)63.121.687.3
Iron (mg)9.816.9412.1
Folate (mcg)253169312
Linoleic acid (g)10.35.7613.4
Lutein (mcg)17106432073
Lycopene (mcg)11234661458
Magnesium (mg)210152257
Niacin (mg)14.310.217.3
Oleic acid (g)19.311.925.7
Phosphorus (mg)9387071237
Potassium (mg)218715982652
Retinol (mcg)551332707
Riboflavin (B2) (mg)1.641.072.05
Sodium (mg)220515902682
Thiamin (B1) (mg)1.160.821.41
Vitamin A (IU)627638997775
Vitamin A (re)9936531236
Vitamin B6 (mg)1.310.911.59
Vitamin C (mg)11972.5153
Vitamin E (α-te)8.695.2910.5
Zinc (mg)8.515.7310.3
Vitamin D (IU)16787.5213
Glycemic load12690.8154
Ethanol (g)4.700.004.71
Table 6. Mean and Range of Daily Intake of Selected Nutrients by Gender and Race, Cancer Prevention Study II Nutrition Cohort, 1992–1993
White (N = 67,541)Black (N = 750)White (N = 75,270)Black (N = 1043)
  • a

    Exclusions include prevalent cancer, except nonmelanoma skin cancer; persons with > 15% of the Food Frequency Questionnaire missing; men with < 800 or > 4200 Kcal/day; and women with < 500 or > 3500 Kcal/day.

Total calories (Kcal/day)1807173113611335
Percent of calories from fat35.436.133.834.8
Percent of calories from carbohydrates44.645.147.547.8
Percent of calories from protein16.115.616.716.3
Total fat (g)72.870.352.353.0
Saturated fat (g)23.923.116.516.7
Cholesterol (mg)245266168183
Carbohydrate (g)199193160157
Dietary fiber (g)13.412.711.611.2
Protein (g)71.567.056.253.7
Alpha carotene (mcg)441341329271
Beta carotene (mcg)2945446822913680
Calcium (mg)849724755656
Cryptoxanthin (mcg)69.395.562.783.3
Iron (mg)
Folate (mcg)285269253252
Linoleic acid (g)13.913.210.310.4
Lutein (mcg)1961495016694323
Lycopene (mcg)13329631129827
Magnesium (mg)256239210202
Niacin (mg)17.816.814.414.2
Oleic acid (g)27.126.619.319.9
Phosphorus (mg)124811281014931
Potassium (mg)2655238521932045
Retinol (mcg)677689552582
Riboflavin (B2) (mg)1.881.711.641.54
Sodium (mg)2863275422072122
Thiamin (B1) (mg)1.351.311.161.15
Vitamin A (IU)7865854362737130
Vitamin A (re)123613139941100
Vitamin B6 (mg)1.591.431.321.27
Vitamin C (mg)123127119130
Vitamin E (α-te)9.298.838708.81
Zinc (mg)
Vitamin D (IU)191164167148
Glycemic load163160126124
Ethanol (g)11.38.784.752.67


The Nutrition Cohort was recruited from within the larger CPS-II Cohort. One great advantage of assembling the Nutrition Cohort in this way, rather than establishing a cohort of newly identified individuals, is the availability of historic measures of exposure collected in 1982. For this subcohort, the completion of the questionnaire in 1992–1993 established a new baseline for prospective analyses of cancer outcomes. Unlike previous ACS cohorts that were restricted to mortality follow-up, the Nutrition Cohort will include follow-up of incidents of cancer, enhancing our ability to study predictors of both cancer occurrence and survival.

It is also important to note that the Nutrition Cohort is not intended to be a representative sample of the larger CPS-II Cohort. In recruiting the Nutrition Cohort, we did not attempt to maximize the participation rate but, rather, to enroll a selected cohort of a given size. Assembling a selected cohort rather than a population-based cohort has several advantages. Most noteworthy, cohort stability and ongoing, long-term participation that are often seen in selected cohorts69–72 increase the likelihood of achieving high follow-up rates essential to the internal validity of exposure-disease results from a cohort study.2, 73 In addition, a selected cohort usually offers reduced potential for confounding and may yield higher data quality, depending on the nature of the cohort.2 Finally, these factors all contribute to greatly increased cost effectiveness because of the relative ease of tracing and following cohort members.73

An often stated concern about selected cohorts is the potential loss of generalizability (external validity). To address this concern, it is critical to differentiate between the generalizability of an absolute measure of disease (or risk factor) frequency and the generalizability of measures of association. Absolute rates of disease or risk factor frequency that are calculated in a selected cohort probably will not be generalizable to a different population. However, the purpose of most large cohort studies is to study relations (associations) between exposure and disease. If the internal validity of the study is maintained, then measures of association typically are unbiased and are generalizable even if the cohort is not a random sample of a defined underlying population.2, 73–76 For example, although the British Doctors Study77, 78 was not a random sample of British physicians, let alone the general population, the associations between smoking and disease observed in that study have proved to be generalizable to many other populations. Moreover, generalizability of associations across different subgroups within a cohort can be examined carefully and quantified.

The brief FFQ that was used to capture dietary intake in 1992–1993 underestimated intake of total energy and most individual nutrients. However, if this underestimation is fairly uniform across cohort members, as we expect it is, then it should not invalidate internal comparisons between high-consuming and low-consuming groups. Also, the range of dietary intake appears sufficient to allow detection of true differences. Although median validity correlation coefficients for food groups, macronutrients, and many micronutrients were comparable to those observed in other studies,57, 62, 79 validity coefficients were low for some micronutrients (e.g., zinc and vitamin E) from dietary sources. Many of these coefficients are likely to improve when vitamin supplement use is considered. In addition, the 1999 questionnaire recaptured dietary intake in this cohort using a more detailed FFQ62, 63 than was used in 1992. The 1992 FFQ will be used to define dietary exposures for those nutrients measured with reasonable validity and for incidents of cancer occurring from 1992 to 1999. The 1999 FFQ will provide the primary dietary exposure data for all incidents of cancers occurring after 1999. With rare exception, any cumulative measures of exposure estimated using both FFQs will be based on quantile ordering of individuals and not on absolute measures of consumption, given the inherent differences in the two instruments.

Blood and buccal cell samples will be used by ACS researchers and collaborating investigators for future nested case–control studies on the correlations between hormonal, dietary, genetic, and other factors and important health outcomes, particularly breast, prostate, and colorectal carcinomas. To oversee this valuable resource, ACS will form a standing review panel comprised of intramural and extramural scientists, including epidemiologists familiar with large cohort studies and laboratory and other researchers able to integrate cancer epidemiology, biology, genetics, and laboratory research. We also are participating in a National Cancer Institute initiative to form a Consortium of Cohort Studies sufficiently large to address interactions between genetic factors and environmental exposures. No single study is currently large enough for this purpose, because, typically, several thousand cases are needed to provide statistically stable estimates of gene-environment or gene-gene interactions.

The ACS Nutrition Cohort establishes a new, large, prospective cohort of both men and women, with detailed information on their dietary and other exposure characteristics, who will be followed actively for all incidents cancer and for all causes of death. This cohort promises to be particularly valuable for the study of cancer occurrence, mortality, and survival as they relate to obesity and weight change, physical activity at various points in life, vitamin supplement use, exogenous hormone use, other medications (such as aspirin and nonsteroidal anti-inflammatory drugs), and cancer screening modalities. In addition to the ongoing collection of exposure and disease information in the future, this cohort benefits from the availability of exposure data collected 10 years before baseline, in 1982, providing a unique opportunity to study temporal changes in exposure and the subsequent impact on the occurrence of disease.


The authors acknowledge the hundreds of thousands of ACS volunteers whose trust and good will make epidemiologic research at the ACS possible. They also thank the many ACS staff members who have gone before, including Dr. E. Cuyler Hammond, Lawrence Garfinkle, and Dr. Clark W. Heath, Jr., upon whose foundation they continue to build. Finally, the authors thank the Illinois Division for generous support of the CPS-II Nutrition and LifeLink Cohorts.