Spatial clustering of malaria and associated risk factors during an epidemic in a highland area of western Kenya


Simon Brooker (corresponding author), Siân Clarke, Sarah Polack and Jonathan Cox, Department of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK. E-mail:,,,
Joseph Kiambo Njagi, Division of Malaria Control, Ministry of Health, PO Box 20750 00202 KNH, Nairobi, Kenya.
Benbolt Mugo and Eric Muchiri, Division of Vector Borne Disease, Ministry of Health, PO Box 20750, Nairobi, Kenya.
Benson Estambale, Department of Medical Microbiology, College of Health Sciences, PO Box 19676, Nairobi, Kenya.
Pascal Magnussen, Danish Bilharziasis Laboratory, Jaegersborg Allé 1D, 2920 Charlottenlund, Denmark. E-mail:


The epidemiology of malaria over small areas remains poorly understood, and this is particularly true for malaria during epidemics in highland areas of Africa, where transmission intensity is low and characterized by acute within and between year variations. We report an analysis of the spatial distribution of clinical malaria during an epidemic and investigate putative risk factors. Active case surveillance was undertaken in three schools in Nandi District, Western Kenya for 10 weeks during a malaria outbreak in May–July 2002. Household surveys of cases and age-matched controls were conducted to collect information on household construction, exposure factors and socio-economic status. Household geographical location and altitude were determined using a hand-held geographical positioning system and landcover types were determined using high spatial resolution satellite sensor data. Among 129 cases identified during the surveillance, which were matched to 155 controls, we identified significant spatial clusters of malaria cases as determined using the spatial scan statistic. Conditional multiple logistic regression analysis showed that the risk of malaria was higher in children who were underweight, who lived at lower altitudes, and who lived in households where drugs were not kept at home.


In common with most infectious diseases, malaria distribution within a geographical area is heterogeneous and can vary greatly between villages and households (Greenwood 1989; Gamage-Mendis et al. 1991; Carter et al. 2000). These patterns of malaria reflect a composite of heterogeneities in vector distribution, human–vector contact and human host factors (Greenwood 1989). Identified risk factors for malaria include distance to known mosquito breeding sites, household construction, household crowding and personal protection measures against mosquito biting (Gamage-Mendis et al. 1991; Trape et al. 1992; Adiamah et al. 1993; Koram et al. 1995; Thompson et al. 1997; Snow et al. 1998; van der Hoek et al. 1998; Ghebreyesus et al. 1999, 2000; Thomas & Lindsay 2000; Clarke et al. 2002). In turn, these factors are proximally influenced by differences in environmental landscape (Rejmankova et al. 1995; Thomas & Lindsay 2000) and socio-economic status (Koram et al. 1995; Clarke 2001).

Disentangling the influence of these different factors is frequently hindered by a lack of detailed data relating to a full range of contextual factors together, and few studies have been performed which include both household and environmental landscape factors. This is particularly true for epidemic-prone areas in highland locations, despite the increasing interest in the epidemiology of highland malaria (Lindblade et al. 1999; Shanks et al. 2000; Hay et al. 2002). In these areas, transmission is unstable and the risk of disease tends to be equal across all age groups as populations have little or no immunity against Plasmodium spp. It remains unclear, however, whether the risk of malaria during an epidemic is equal amongst all households or the degree to which risk is spatially clustered.

The investigation of infectious disease clustering is receiving renewed interest, not least because of advances in geographical information systems (GIS) and spatial statistics, which allow for the quantification of the degree of clustering of infections. Such approaches have been used to investigate the spatial clustering of dengue (Morrison et al. 1998), LaCrosse encephalitis (Kitron et al. 1997) and sleeping sickness (Fèvre et al. 2001), but their application to malaria has been limited (Schellenberg et al. 1998; Chadee & Kitron 1999; Ghebreyesus et al. 2003). An improved understanding of the spatial clustering of malaria and its determinants in highland areas may provide useful insights into local epidemic control (Carter et al. 2000).

In May–July 2002, Western Kenya experienced a number of malaria epidemics, following heavy rains earlier during May. This provided a unique opportunity to investigate the epidemiology of clinical malaria within epidemic-prone areas during an epidemic. In this study, we have mapped and analysed the household distribution of clinical malaria based on active case detection among school children in Kapkangani Location, a rural epidemic-prone area of the highlands in western Kenya. The objectives of our study are to evaluate spatial clustering of clinical malaria and to investigate putative risk factors.


Study area

The present study was conducted in Kapkangani Location (0°14′–0°22′ N, 34°54′–35°3′ E), a rural part of Nandi District, western Kenya. Investigations were undertaken in three schools: Kiborgok in Kiborgok sub-location, and Koibem and Kabaskei in Chepkomia sub-location (Figure 1). Kiborgok sub-location is situated on the Nandi Escarpment where elevation ranges from 1650 to 2050 m. Chepkomia sub-location is lower down the escarpment to the west of South Nandi Forest, running along the River Yala at elevations ranging from 1700 to 1900 m. Rainfall (annual average of 2428 mm) is seasonally bimodal, with the long rains occurring from March to May and the short rains from October to December. Average annual minimum and maximum temperatures are 12.2 and 23.6 °C, respectively (unpublished data from the adjacent Kaimosi tea estate meteorological station).

Figure 1.

Map of study area, Kapkangani Location, Nandi District in western Kenya. The locations of sampled households, schools, health centre, land classification types, River Yala and elevation contours are shown. Kiborogok school is positioned north of the health centre, Koibem is south of the health centre and Kabaskei is the most southern school.

Malaria transmission is acutely seasonal with peaks occurring 2–3 months after the peak rains in April–May, although the extent of the malaria burden varies considerably from year to year. This temporal pattern is similar to other areas of western Kenya (Hay et al. 2002). Early entomological research indicated that throughout the district, anthropophilic Anopheles gambiae s.l. (98%) was the principal malaria vector in the area, with An. funestus (2%) playing a minor role (Roberts 1964). Recent investigations report similar findings (Shililu et al. 1998; Minakawa et al. 2002a).

The population of the area consists of indigenous Kapsigi people and numerous Luhya settlers who have moved from the lowland areas of western Kenya and have purchased land during the past 30 years. The economy is primarily rural subsistence agriculture, with some families growing tea as a cash crop. Other economic opportunities include casual labour on local tea estates. This population is serviced by Kapkangani Government Health Centre, which has a catchment area of about 20 000 people and a catchment radius of about 20 km.

Selection of schools and methods of surveillance

Schools within the catchment area of the health centre were identified and classified according to ethnic mix. In order to minimize the effect of differential immunity, host genetic heterogeneity and/or travel to malaria-endemic areas on the risk of malaria, only schools where almost all pupils were of indigenous Kapsigi descent (including Nandi and Kapchukin peoples) were eligible for inclusion in the study. The three schools closest to the health centre fulfilling these ethnicity criteria were selected. A series of meetings were held with teachers, parents and community leaders to explain the purpose and methodology of the study that participation was voluntary and children were able to withdraw from the study at any time. All children in classes three to seven were enrolled in the study, after written parental consent. The incidence of malaria amongst children was monitored over a 10-week period in May–July 2002, which corresponded to the time of peak malaria transmission. Data were collected through a system of active case detection at the three schools, supplemented by continuous passive case detection at Kapkangani Health Centre.

Ethical approval was obtained from the Ethical Review Board of Kenyatta National Hospital, Nairobi and the Danish Central Ethical Committee.

Active surveillance of malaria cases

Each school was visited by the surveillance team (nurse, laboratory technician and field assistant) two to three times per week to identify children with clinical episodes of malaria. To maximize case detection, class teachers identified and recorded the names of any children who were ill or absent from school each morning. Any children reporting fever or other malaria-related symptoms, or absenteeism because of illness were notified to the surveillance team for follow-up and screening, either in school or at home. Absentees were visited at home. A morbidity questionnaire was completed to include age, sex, history and duration of fever, other presenting signs and symptoms, and whether the child had received any prior treatment. A fingerprick blood sample was taken from any child satisfying one or more of the following screening criteria; (i) one or more of the following symptoms suggestive of malaria within the previous 24 h (reported fever, chills/shivering, rigours, vomiting, malaise, or generalized body pain) or (ii) demonstrable axillary temperature ≥37.5 °C. Giemsa-stained thick and thin blood films were prepared and the number of asexual parasites per 200 leucocytes were counted. Schoolchildren with clinically diagnosed episodes were treated with sulphadoxine/pyrimethamine on the day of survey. Other conditions requiring treatment were referred to the health centre.

The active surveillance was supplemented by passive case surveillance in health centres, where treatment was provided free for schoolchildren enrolled in the study. At the end of the surveillance period, a cross-sectional survey among all schoolchildren was conducted to assess children's anthropometric status. Weight was measured to the nearest 0.1 kg using a Soehnle electronic balance (CMS Weighing Equipment; UK); height was measured to the nearest 0.1 cm using a portable fixed base stadiometer (CMS Weighing Equipment; UK). Anthropometric indices were calculated using Anthro Software (Atlanta: CDC and Geneva: WHO) which uses the NCHS reference values. Height-for-age and weight-for-age were expressed as differences from the median in SD units or z-scores. Children were classified as stunted and underweight if z-scores of height-for-age and weight-for-age respectively were <2SD below the NCHS median.

Definition of cases and community controls

A case of malaria was defined as a child with one or more of the screening symptoms and a parasite density threshold of >500 parasites/μl blood, following Bloland et al. (1999) who used geqslant R: gt-or-equal, slanted500 parasites/μl for children aged >10 years in an endemic area of western Kenya. This is likely to be a conservative case definition when applied amongst a non-immune population in whom plasmodial infection is more likely to result in symptomatic illness. Controls were randomly drawn from amongst children who during the surveillance period were either a) well or b) symptomatic but slide-negative. Cases were matched to controls by year of age and school. We had originally planned to match every case to two controls. However, this was only possible in Kiborgok. In Chepkomia, the high number of cases meant that each case could only be matched to a single control.

Household mapping and household surveys

Homes of every school child enrolled in the study area were visited and the location and elevation of all households were determined using a hand-held Trimble GeoExplorer3 global positioning system (GPS; Trimble Navigation, Sunnyvale, CA, USA), which gives a positional accuracy within 5 m.

In addition, in the homes of cases and their age-matched controls, the household head or senior wife was interviewed in the local language to obtain data on household risk factors, and the room where the child slept was visited to record information on exposure factors that may affect mosquito–human contact. Exposure data recorded included roofing material, presence of open eaves and windows, ceiling, proximity to animal sheds, whether a child slept under a bednet and the use of methods of protection against mosquitoes such as insecticides, repellents and mosquito coils during the surveillance period. A pre-tested standardized questionnaire was used to record details of household socioeconomic characteristics including building materials and ownership of selected agricultural and household assets. The list of household assets and indicators was selected based on published literature and interviews/discussions with local key informants. These features were used to derive a wealth index, using the method of Filmer and Pritchett (2001), which has been shown to reliably measure economic status on the basis of asset ownership without the necessity of direct income or expenditure information. A principal component analysis (PCA) was used to determine the weights for an index of asset variables in order to calculate the wealth index (Ah) for each household, using STATA (v. 7.0; College Station, TX, USA). Specifically, Ah = ΣFn(AnjAn)/(Sn), where Fn is the scoring factor of nth asset, Anj is the PCA score for nth asset of jth household, An and Sn are the mean and SD of the PCA score for nth asset. Variables entered into the PCA included: type of building materials used for roof and walls, presence of windows, presence of separate kitchen building, ownership of eight household assets (table, pressure lamp, mosquito net, iron, radio, clock, sofaset, bicycle) and eight agricultural assets (own land, tea bushes, cattle, sheep, donkey(s), wheelbarrow, ox-and-plough, tractor and/or other vehicle). The first principal component explained 20.7% of the variance in included variables and gave greatest weight respectively to ownership of a wheelbarrow, pressure lamp, cement walls, iron and separate kitchen building. The resultant scores were divided into quintiles, so that each household could be classified in terms of relative socio-economic status (Armstrong-Schellenberg et al. 2003).

Land use/land cover

We image-processed satellite remote sensing data to derive thematic maps of principal land use/land cover types in the study area. Digital Landsat Enhanced Thematic Mapper (ETM+) data for 5th February 2001 were acquired, representing ecological conditions in advance of the main rainy season. The ETM+ sensor measures radiation reflected from the Earth's surface in a number of discrete spectral bands. From an ecological standpoint the most useful of these (bands 1–5 and band 7) cover the visible and near infrared portions of the electromagnetic spectrum and have a spatial resolution of 30 m. ETM data for the study area were geometrically corrected with reference to GPS ground control points using ENVI image processing software (Version 3.5; RSI Inc., Boulder, CO, USA). To produce coverages of land cover type we used a standard ‘supervised’ classification approach, where a maximum likelihood classification is performed to allocate each image pixel to one of a small number of known categories. The main classes identified included tea, primary forest, cleared forest and grassland. Subsequently, the distance of each household to the nearest area of forest and tea was determined using standard GIS functionality in Arc/Info (Version 7; ESRI, CA, USA).

Spatial clustering

Spatial analysis was used to explore the spatial pattern of malaria cases and help test hypotheses relating to the processes that may have given rise to the observed distributions. All households of schoolchildren were analysed, whether they were a case, control or not included in the case-control analysis. The Kulldorf spatial scan statistic was used to test whether malaria cases were distributed randomly over space, and if not, to identify significant spatial clusters (Kulldorff & Nagarwalla 1995). For this, we used the SaTScanTM software ( This programme uses a circular window moved systematically throughout the geographic space to identify significant clustering of cases. This window is centred on each of a number of possible locations throughout the study area and for each location, and the window size varies from 0 to a pre-defined upper limit. For the current analysis, the upper limit was specified as 50% of the study population, which allows both small and large clusters to be detected, while ignoring clusters that contain more than 50% of the population. For each location and size of the scanning window, a likelihood ratio test is conducted to test the hypothesis that there is an elevated rate of disease when compared with the distribution outside. The window size and location with the maximum likelihood is defined as the ‘most likely’ cluster (i.e. least likely to have occurred by chance). The distribution and P-value of the most likely and secondary clusters are determined by conducting Monte Carlo replications of the data set. SaTScanTM uses either a Poisson based or Bernoulli model. The latter is appropriate for 0/1 event data such as cases/non-cases, where non-cases are taken to represent the background distribution population. This approach was therefore selected for current analysis.

Risk factor analysis

Risk factor analysis was restricted to comparison of case-control sets (a subset of the data). Univariate analysis of all risk factors was conducted using logistic regression to estimate odds ratios (OR), with SE adjusted to account for within-household clustering of cases. In multivariate analysis, conditional multiple-logistic regression was employed. Analysis was conducted using STATA. Whether a child had always lived in the district was originally included in the questionnaire as a proxy measure for immune status. However, few children were born outside the district (18/284) and there was very little variation within the study population. Therefore, this variable was excluded in the analysis. Only one child was reported to be sleeping under a mosquito net and therefore this variable was also excluded from the analysis.


A total of 129 incident cases were detected during a 10-week surveillance period between May and July 2002, yielding the following weekly incidence rates: 0.047/week in Koibem school; 0.032/week in Kabaskei and 0.013/week in Kiborogok. Cases were matched to 155 controls. The household distribution of malaria cases is shown in Figure 2, which indicates fewer cases occurred higher up the escarpment in Kiborgok than near the River Yala, suggesting evidence of an association between malaria and altitude.

Figure 2.

Map of the household distribution of malaria cases identified during the active case detection in Koibem and Kabaskei, and Kiborgok (insert), and the location of significant clusters of cases as identified by the Kulldorf spatial scan statistic (large open circles).

To assess whether there are distinct spatial clusters in the distribution of malaria, we applied a spatial scan statistic separately for Kiborgok and Chepkomia. In Kiborgok, a single cluster of seven cases (1.68 expected) in seven households was identified (relative risk = 4.16, P = 0.058). In Chepkomia, a larger cluster of 17 cases (7.43 expected) in nine households was identified (relative risk = 2.28, P = 0.012). The geographical locations of these clusters are depicted in Figure 2.

Frequencies of risk factors amongst cases and controls, and associated univariate odds ratios are shown in Table 1. Neither age nor sex were identified as significant risk factors. Overall, 18.2% of the schoolchildren were stunted and 25.2% of children were underweight. There was no association between risk of malaria and stunting, but underweight children were significantly more prevalent among cases than controls. Malaria risk decreased with altitude, with significantly fewer cases occurring amongst children living above 1800 m compared with children living below 1750 m. Decreased risk of malaria was associated with increased distance from the forest fringe. Household socio-economic status was lower among the cases than controls. The practice of keeping medicines in the home was less common among families of cases than among families of controls. None of the child's room factors were associated with risk of malaria.

Table 1.  Frequencies and odds ratios for variables associated with the risk of clinical malaria among schoolchildren during an epidemic in western Kenya (univariate analysis)
  1. * Significance tested using logistic regression, with adjustment for clustering by household.

  2. † Defined as height-for-age z-scores <2SD.

  3. ‡ Defined as weight-for-age z-scores <2SD.

No. of children284155129  
Child factors
 Sex (female compared with male)51.443.954.30.660.074
 Age (years)
  7–11 years 30.331.91.00 
  12–15 years 61.961.10.940.819
  16 years and over
Child's room factors
 Thatch roof28.929.028.70.980.948
 Open eaves32.429.036.41.400.204
 Smoky room52.352.
 Coils or sprays used in child's room16.216.815.50.910.758
Household factors
 Family keeps medicines at home48.955.541.10.560.012
 Socio-economic status index of household
  First quintile (poorest) 
  Second quintile19.114.924.21.310.505
  Third quintile19.
  Fourth quintile25.029.719.40.530.077
  Fifth quintile (least poor)18.823.013.70.480.071
Geographic factors
 Altitude of household
  <1750 m36.930.345.01.00 
  1750–1799 m35.232.338.80.810.436
  1800–1849 m14.
  1850 m and over13.819.
 Distance of household to forest fringe
  250 m9.96.514.01.00 
  250–499 m23.921.926.40.560.162
  500 m and over66.271.659.70.390.013

Based on the results of the univariate analysis and including variables with P-values <0.1, a conditional multiple-logistic regression model was developed in a backward stepwise fashion (Table 2). The risk of malaria was significantly reduced for children living in households at higher altitudes, whereas children who were underweight had a significantly higher risk of malaria. The practice of keeping malaria drugs at home had borderline significance. No significant interactions were detected among the factors included in the analysis.

Table 2.  Conditional logistic regression model of significant variables associated with the risk of malaria in western Kenya, matching by age and school
VariableAdjusted odds ratios (95% CI)P-values
Underweight2.18 (1.12–4.27)0.022
Family keeps medicines at home0.58 (0.32–1.04)0.069
Altitude of household
 <1750 m1.0 
 1750–1799 m0.70 (0.33–1.46)0.342
 1800–1849 m0.42 (0.17–1.05)0.064
 1850 m and over0.24 (0.07–0.77)0.017

By comparing the characteristics of cases identified in a spatial cluster – as identified by the scan-statistic with those of cases outside a cluster throughout the study area, the results of the spatial analysis provide further insights into risk factors for malaria. Households within an identified spatial cluster were positioned at lower altitudes than case households outside a cluster (1741 m vs. 1777 m, t-test: t = 3.21, P < 0.001). No other variables differed between cases identified in a cluster and cases identified outside.


Of the numerous studies investigating risk factors for malaria, few have simultaneously examined individual, household and environmental risk factors. Fewer epidemiological studies have investigated the epidemiology of malaria within the East African highlands where transmission intensity is low but characterized by acute within and between year variations (Lindblade et al. 1999; Shanks et al. 2000; Hay et al. 2002). In 2002, exceptional rainfall during May in the western highlands of Kenya led to epidemics in some districts in June and July. Using spatial statistics and GIS, we investigated the spatial distribution of incident malaria cases among schoolchildren in three highland schools and investigated putative risk factors for clinical malaria. It is important to recognize that risk factors for malaria during an epidemic may differ (in their nature and strength of association) from those affecting disease in non-epidemic periods.

Using the spatial scan statistic, our study identified significant spatial clusters where the risk of malaria was higher. Having identified spatial clustering in the distribution of malaria cases, the next step was to investigate the underlying individual, household and environmental factors that characterize both malaria risk per se and high-risk areas. Areas characterized by low altitude were strongly associated with the risk of malaria and risk of spatial clustering. Probably the most important factor that is influenced by altitude is temperature, which affects the development and survival of the vector and, more importantly, development of Plasmodium within the vector. This varies according to temperature to a point where parasite development ceases altogether. For P. falciparum, laboratory studies have estimated the critical temperature for development to be 16–19 °C (MacDonald 1957). Below this, few adult mosquitoes survive the 56 days required for completion of the sporogonic cycle, and this temperature range is often considered the threshold for stable transmission. However, at high altitudes where these temperatures are rarely reached outdoors, mosquitoes may avoid low temperatures by finding more favourable microclimates within which to rest (Garnham 1948).

Another factor which may be associated with altitude is suitability for mosquito breeding. In particular, An. gambiae tend to inhabit temporary freshwater pools (Minakawa et al. 1999), and these are typically found in cleared areas resulting from deforestation, such as the forest fringe in the present study (Figure 1). Throughout the study area there were many small streams and water-bodies and preliminary larval surveys of these bodies found anopheline larvae at all altitudes. We did not set out to systematically determine known breeding sites or presence of particular mosquito species (larvae or adults). However, along an altitudinal transect in western Kenya Minakawa et al. (2002a), found that anopheline mosquito densities declined with increasing altitude, with the maximum altitude at which An. gambiae and An. funestus were found being 1980 m. The closest site surveyed to our study area was Tindinyo (7 km along the main road) where An. gambiae was the only malaria vector found. The relationship between proximity of houses to larval habitats and/or number of mosquitoes found in the house with risk of malaria has been demonstrated in a number of studies. Working at lower altitudes in western Kenya, Minakawa et al. (2002b) showed that over 90% of adult mosquitoes are found in houses within 300 m of the nearest larval habitat, (which tend to be temporary freshwater pools). Other work in various African settings suggests that An. gambiae dispersal is <1 km (Trape et al. 1992; Takken et al. 1998), and that risk of malaria is strongly associated with distance from breeding sites (Trape et al. 1992; Thompson et al. 1997; Ghebreyesus et al. 1999; Thomas & Lindsay 2000; Clarke et al. 2002).

The lack of association of malaria risk with specific household construction features such as roof or eaves contrasts with other studies (Gamage-Mendis et al. 1991; Koram et al. 1995; Thompson et al. 1997; Ghebreyesus et al. 2000). Not only does the quality of housing affect the ease with which mosquitoes can enter a home, but housing quality is also also a key indicator of household economic status and thus a marker for other socio-economic determinants of malaria risk. However, the lack of association with housing in our study is most likely to be due to the fact that there was relatively little variation in house construction in the study area. As a result of low night-time temperatures most houses had shuttered windows and the eaves were closed. We were also unable to demonstrate any association between malaria risk and personal protection measures. This may reflect generally low usage of bed nets (3% of households reported using a net) and other protective measures against mosquito biting in this area (<20% of households used mosquito repellent strategies). The fact that mosquitoes are not generally perceived as a nuisance may explain this low usage. By contrast, in rural coastal Kenya, where mosquito densities are extremely high, the use of mosquito repellent strategies reduced the risk of severe disease (Snow et al. 1998).

Although use of personal protection was generally low, it was more common in wealthier families (Table 3). As our model also included factors that are associated with differences in socio-economic status and which directly affect exposure, such as house construction and personal protection, the observed lack of an association between malaria incidence and socio-economic status is perhaps not surprising. Recent studies in Tanzania, which also show a lack of association between reported illness, including fever, and socio-economic status (Armstrong-Schellenberg et al. 2003). However, socio-economic status in these studies was strongly related to the rate of hospital admission and the probability of receiving appropriate treatment once ill, and care-seeking behaviour was worse in the poorest families. We observed a similar trend. Poorer families were less likely to keep medicines in the home than wealthier families. Furthermore, our data suggested that, independent of socio-economic status, children from households that kept medicines in the home may have a lower malaria infection risk. Having used a system of active case detection and treatment, we were unable to investigate risk factors for severe malaria, such as differences in treatment seeking and access to drugs. Household socio-economic status could be expected to be important in the spatial patterning of severe malaria morbidity and mortality in highland areas, and warrants further investigation.

Table 3.  Frequencies and mean values of variables summarized by wealth quintiles (12 individuals had missing data for wealth index)
Variable1 (poorest)2345 (least poor)P-value
  1. * 12 households had missing data.

  2. † Significance tested using non-parametric test for trend across ordered groups (nptrend command in STATA).

  3. ‡ Significance tested using approximate chi-squared test of homogeneity of odds and a test for linear trend of odds (tabodds command in STATA).

Family uses sprays or coils10.25.811.517.739.2<0.001
Family keeps medicines at home36.736.550.‡
Distance to forest fringe68274110958238830.04†

In malaria-endemic areas, repeated exposure to malaria parasites in early life leads ultimately to the ability to limit parasite growth, and by the time a child starts school, episodes of clinical malaria have usually become both less common and less severe. In contrast, people who live in highland areas where transmission is low and unstable transmission remain non-immune and at risk of life-threatening attacks of malaria at older ages. Within our subset of schoolchildren in standards three to seven, malaria incidence was high and did not decline with age, confirming the low level of immunity within our study population. In this analysis, differences in immunity within the study population were minimized through selection of schools in which the majority of children were of indigenous highland descent. Data on history of residence further revealed that none of the children had previously lived in a malarious area. The only host factor significantly associated with malaria risk that we observed was an almost threefold increase in risk amongst children who were underweight. Whether this represents a biological difference in host response among children who are acutely malnourished and those who are not, or as marker of some other important socio-economic difference is not clear.

Potential shortcomings are inevitable in any analysis and two are recognized here. First, the spatial scan statistic only looks for clustering using a circular process centred on any given fixed point. Obviously, true clusters may not be circular, but could be any shape, including elliptic and rectangular. However, using these approaches reduces the ability to detect other shaped clusters. Hence, there is a trade-off. One attractive feature of using the circular scan statistic is that it is isotropic with respect to a rotation of the map. This is not the case with rectangular or elliptic scan statistics unless all possible angles are considered, which is difficult for computational reasons (M. Kulldorff, personal communication). For this and other reasons, the spatial scan statistic using circular process has proved useful for a variety of infectious diseases (Fèvre et al. 2001; Cousens et al. 2001; Ghebreyesus et al. 2003; Mostashari et al. 2003).

Secondly, by not including explicit geographical dependence between households, standard errors of the odds ratios may be underestimated and therefore the statistical significance of the covariates may be over-estimated. In a recent analysis of geographical risk factors of malaria in The Gambia, Thomson et al. (1999) adjusted model standard errors according to a computed semi-variogram. Comparison of this approach to a standard logistic-regression model showed that only borderline variables dropped from significance. In the present analysis however, it is unlikely that spatial autocorrelation acts sufficiently to inflate the standard error estimates of altitude to render invalid the overall results.

In epidemic-prone areas in which malaria risk can be highly focal, aiming control strategies at areas of highest risk can potentially increase the programme's effectiveness (Carter et al. 2000). In our study, the most important risk factor was altitude but this association may not be generalizable to other highland areas of Africa and it is therefore unclear whether such information can readily be exploited to target control at small spatial scales. Guaranteeing people's access to quick and effective treatment and the use of vector-control methods on a wide scale remains the cornerstone of effective malaria control. In highland areas, indoor residual spraying also offers a cost-effective method of malaria control (Guyatt et al. 2002).


We are very grateful to the children, teachers and community of Kiborgok, Koibem and Kabaskei who kindly participated in the study, and to Njau, Nderitu, Osangale, Pauline, Joanne and other fieldworkers who assisted with field data collection. Financial support for this work was provided through an award from the Bill and Melinda Gates Foundation to the Gates Malaria Partnership, DfID, London School of Hygiene and Tropical Medicine; the Danish Bilharziasis Laboratory; the World Bank; and the Wellcome Trust [through a Prize Fellowship (062692) to SB at the time of the study]. Tom Smith provided helpful advice on regression modelling. We also thank Simon Cousens, Brian Greenwood, Lydia Osorio, and Joanna Armstrong-Schellenberg and two anonymous reviewers for their contributions and suggestions.