Plasma proteomic signature of age in healthy humans

Abstract To characterize the proteomic signature of chronological age, 1,301 proteins were measured in plasma using the SOMAscan assay (SomaLogic, Boulder, CO, USA) in a population of 240 healthy men and women, 22–93 years old, who were disease‐ and treatment‐free and had no physical and cognitive impairment. Using a p ≤ 3.83 × 10−5 significance threshold, 197 proteins were positively associated, and 20 proteins were negatively associated with age. Growth differentiation factor 15 (GDF15) had the strongest, positive association with age (GDF15; 0.018 ± 0.001, p = 7.49 × 10−56). In our sample, GDF15 was not associated with other cardiovascular risk factors such as cholesterol or inflammatory markers. The functional pathways enriched in the 217 age‐associated proteins included blood coagulation, chemokine and inflammatory pathways, axon guidance, peptidase activity, and apoptosis. Using elastic net regression models, we created a proteomic signature of age based on relative concentrations of 76 proteins that highly correlated with chronological age (r = 0.94). The generalizability of our findings needs replication in an independent cohort.


| INTRODUCTION
Older age is the main risk factor for a myriad of chronic diseases, and it is invariably associated with progressive loss of function in multiple physiological systems. In some individuals, the combined effect of physiological decline and diseases leads to physical and cognitive disability. Despite its importance for health, most epidemiological research considers aging merely as a confounder, a nuance dimension to be accounted for and then discarded, under the assumption that aging is unavoidable and unchangeable (Fried & Ferrucci, 2016). This view is now changed. As the intrinsic biological mechanism of aging is slowly revealed, there is hope that interventions that slow aging and prevent or delay the onset of chronic disease and functional impairments can be discovered (Kennedy et al., 2014;Lopez-Otin, Blasco, Partridge, Serrano, & Kroemer, 2013).
A critical goal in the field of aging biomarkers is to identify molecular changes that show robust patterns of change with normal aging, with the assumption that departures from this "signature" pattern provide not only information regarding future risk of pathology and functional decline but also clues on compensatory mechanisms by which our organism counteracts the effects of aging (Sierra, Hadley, Suzman, & Hodes, 2009). Such a signature could be used both to identify individuals in the trajectory of accelerated aging and to track the effectiveness of interventions designed to slowdown biological aging.
A challenge in this field is the need to differentiate between aging and diseases. Most participants enrolled in epidemiological studies include a significant number of individuals affected by pathology or disability, and the proportion of such individuals increase with age. Thus, it is difficult to dissect changes in biomarkers of normal aging from those of disease pathology.
DNA methylation and gene expression have been used to develop molecular markers or signatures associated with chronological age (Bocklandt et al., 2011;Hannum et al., 2013;Horvath, 2013;Lin et al., 2016;Weidner et al., 2014). The "epigenetic clock," a biomarker index that combines weighted information of a subset of DNA methylation sites raised great interest because it is both strongly associated with chronological age across multiple tissues and populations and independent of age, predicts multiple health outcomes, including cardiovascular disease, cancer, and mortality (Chen et al., 2016;Levine et al., 2015;Perna et al., 2016). These findings suggest that aging is associated with stereotyped and reproducible molecular changes that can potentially be used to identify individuals who are aging faster or slower than the average population. However, the underpinnings of these molecular changes have not been fully elucidated, at least in part because the effect of methylation on DNA function, locally and distally from the methylation site, remains unclear (Declerck & Vanden Berghe, 2018).
A promising alternative to current methods may be to construct a similar aging biomarker clock based on circulating proteins. Proteins are attractive because they directly affect phenotypes and provide direct information on biological pathways that can be involved in many of the physiological and pathological manifestations of aging. However, performing discovery proteomics is challenging because of the wide dynamic range of plasma proteins and because of the interference from large, multiply charged proteins such as albumin, apolipoprotein A1, and C-reactive protein (Geyer, Holdt, Teupser, & Mann, 2017). Attempts to address this challenge by depletion of highly abundant proteins from plasma samples have generated conflicting results, with some suggestions that proteins in depleted samples are no longer representative of the those in the original sample (Bellei et al., 2011). An alternative approach is to use the SOMAscan assay, a technology that uses slow off-rate modified aptamers (SOMAmer)-based capture to quantify multiple proteins in human biological liquids, including plasma (Baird, Westwood, & Lovestone, 2015;Di Narzo et al., 2017;Menni et al., 2015). Previous studies using this approach were conducted in convenience samples originally collected for purposes other than studying aging, and included people affected by multiple diseases (Di Narzo et al., 2017).
It is not clear to what extent the results of those studies reflect age independently of disease.
To address this issue, we conducted proteomic analyses using the version of the SOMAscan assay that measured 1,301 proteins in 240 adults aged 22-93 years, free of major chronic diseases, cognitive, and functional impairment. The goal was to identify proteins associated with chronological age avoiding as much as possible the effect of clinically detectable disease, examine their association with several clinical characteristics, and further compare our results to previous proteomic profile analyses that used the same technology. We further constructed a proteomic signature of age to begin exploring to what extent the proteome can predict chronological age.

| Association of proteins with chronological age
Proteomic profiling was conducted on 240 healthy men and women between the ages of 22-93 years. The basic characteristics of the subjects are displayed in Table S1. The association of 1,301 SOMAmers with chronological age was examined. There were 217 proteins (20 negatively associated, 197 positively associated) associated with age (p < 3.83 × 10 −5 ) in the basic model adjusted for sex, study (Baltimore Longitudinal Study of Aging [BLSA] or Genetic and Epigenetic Signatures of Translational Aging Laboratory Testing [GESTALT]), race, and batch (Tables 1 and S2, Figure 1). Further adjustment for body mass index (BMI), and serum creatinine resulted in 210 (22 negative, 188 positive) age-associated proteins (Table S2). To explore whether some of the proteins had nonlinear relationship with age, we fitted a model that included an age square term (age 2 ) to account for nonlinearity. The proteins were then ranked by the variance explained by the age term for proteins that were linearly correlated with age, or the variance explained by the age plus age 2 terms for proteins that had evidence of nonlinearity (i.e., had significant age 2 term). The proteins ranks based on p-values in the linear model were highly correlated with the protein rankings based on a mix of linear and nonlinear models (r = 0.96). We concluded that overall, the linear model was adequate for our purpose. The protein with the strongest age association was GDF15 (β[SE] = 0.018[0.001], p = 7.49 × 10 −56 , Figure 2a) that showed positive association with age. To validate the result obtained with GDF15, its plasma level was measured in a subset of 88 subjects using ELISA. GDF15 level assessed by ELISA strongly correlated with age ( Figure 2b Studies have shown that the number of senescent cells increases with aging in multiple human tissues, including circulating T cells (Liu et al., 2009). Senescent cells are characterized by senescence-associated secretory phenotype (SASP) that release inflammatory mediators, proteinases and other molecules in the surrounding niche, from where they are eventually released into circulation. Of a list of SASP proteins reported in the literature, 72 were targeted by SOMAmers (Table S3), and 21 of the 72 SASP SOMAmers were significantly associated with chronological age, with an overall significant SASP enrichment (p = 0.007; Figure S1).

| Proteomic signature of age
To create a proteomic predictor of chronological age, we fitted an elastic net regression model to select a parsimonious cluster of proteins of the 1,301 proteins measured that best predicted chronological age. We started by randomly splitting the study population into two equally sized groups of 120 participants. The first group was used as a training set and the second as a validation set. From the randomly selected training set of 120 subjects, the elastic net regression selected 76 proteins (Table S4). Of the 76 proteins selected, 37 proteins were among the 217 age-associated proteins.
In the validation set, the correlation between the fitted proteomic age predictor and chronological age was r = 0.94 ( Figure 3). The correlation between predicted and observed age did not differ by sex (data not shown). To determine the minimum number of proteins required to create a meaningful a proteomic predictor, we fitted a series of models in which we constrained the maximum number of variables to be selected for the calculation of the age predictor in the elastic net regression model (Table 2). This resulted in the generation of predictors with progressively fewer proteins. A total of 13 proteomic age predictors were created ranging from a model with 76 predictor proteins to only one protein, which was the GDF15 (Tables 2 and S4). The precision of the proteomic age predictor was very high even with few proteins, with a correlation of 0.92 between predicted and observed age with as few as 8 proteins. In fact, a predictor including just GDF15 had a relative high correlation with chronological age at r = 0.82. The accuracy of the prediction, however, declined substantially when the number of proteins included in the predictor was reduced ( Table 2). With the full 76 protein predictor, the mean absolute difference between predicted and observed age was 5.7 years, while the 8-protein model had a difference of 13.1 years, and the GDF15 only model had a difference of 16.6 years ( Table 2). In the 76-protein age predictor model, only half of the proteins were among significant age-associated proteins. As the number of proteins included in the predictor decreases, higher percentage of the selected proteins was associated with age, and for predictors with <9 proteins, all the selected proteins were associated with age in univariate analysis.
To evaluate how well the proteomic age predictor reflects age, we examined the association of 76 proteins, 8 proteins, and GDF15 Model 2: Model 1 + BMI + inverse of serum creatinine.
F I G U R E 1 Associations of proteins with age. Volcano plot displaying the association of 1,301 proteins with chronological age. Protein values were log-transformed and associations with age were tested using a linear model adjusting for sex, race, study (BLSA or GESTALT), and batch. The figure displays the effect size (beta coefficient from the linear model), against significance presented as the −log 10 (p-value) proteomic age predictors with 13 age-associated clinical variables (Table 3). With a few exceptions, all three proteomic age predictors were associated with the clinical parameters in a direction consistent with the associations observed with chronological age. When the associations between proteomic age predictors and clinical parameters were adjusted for chronological age, the associations were no longer significant suggesting that the proteomic age predictors are a good proxy of chronological age.

| Sex-specific age associations
For eight proteins, the correlation with age was different between sexes (Table S4, Figure 4). Not surprisingly, half of these proteins were sex hormones (luteinizing hormone [LH], FSH, human chorionic gonadotropin, sex hormone-binding globulin [SHGB]). There was a greater positive association of FSH, LH, and SHGB with age in women compared to men (Table S5). There was a positive association of SHGB with age that was significant only in men. The association of tissue factor pathway inhibitor, vitamin K-dependent protein S, and insulin-like growth factor binding protein 7 was positively associated with age in women but not significant in men. At last, netrin-4 had a significant negative association with age in men but not women.

| Functional annotation and enrichment analysis
The 217 age-associated proteins were examined for patterns of functional enrichment. The Kyoto Encyclopedia of Gene and Genomes (KEGG) pathways enriched were "cytokine-cytokine receptor interactions," "complement coagulation cascades," and "axon guidance" (Table 4). There were many gene ontology (GO) biological process (30), cellular component (14), and molecular function (12) terms that were enriched among the 217 proteins (Table S6), many of which had shared sets of proteins.
To better understand the patterns of co-occurrence of proteins, a functional annotation clustering analysis was conducted using DAVID. There were five clusters of GO terms with enrichment scores >3 (Table S7; Figure 5). The first cluster included four GO F I G U R E 2 Correlation of GDF15 with age and validation with ELISA assay. (a) The most significant age association was observed for growth differentiation factor 15 (GDF15), which was positively associated with age (β = 0.018 ± 0.001, p = 7.5 × 10 −56 ). To validate association of GDF15 using an independent assay, GDF15 abundance was measured with ELISA on a subset of 88 subjects. (b) GDF15 abundance measured with ELISA correlated with age (β = 0.018 ± 0.002, p = 3.83 × 10 −20 ). (c) Plasma GDF15 measured by ELISA assay was correlated with the measure from SOMAscan, and a correlation of 0.821 was found F I G U R E 3 Proteomic signature of age. Using elastic net regression model, proteomic predictors of age were created with variable numbers of predictor proteins in the model. This graphs show the correlation between the predicted age on the y-axis and chronological age on the x-axis for proteomic predictors with 76 predictor proteins. The correlation between predicted age using the proteomic signature and observed age was 0.94

| DISCUSSION
In this study, we used the SOMAscan assay to examine the plasma proteomic profile of age in healthy humans. To reduce potential bias from disease and maximize the chance to capture age-related differences, we selected a sample of individuals spanning a wide agerange who were very healthy according to strict criteria originally developed for enrollment in the BLSA (Shock et al., 1984). We identified 217 proteins significantly associated with age and show that a precise proteomic predictor of age can be generated using a combination of these proteins. Of the age-associated proteins, some, such as the GDF15 and NPPB, have previously been described to increase with age, but for many others their association with age has never been previously reported. It is an interesting fact that some of the classic aging biomarkers such as IL6, TNFα, and IGF-1 were not among the top proteins significantly associated with age. This finding was surprising but may be explained by the exceptional health status of the individuals enrolled in this study. Whether these proteins are better correlated with age in a more representative population that does not exclude persons affected by diseases and disabilities should be explored in future studies.
Several proteomic studies of aging using earlier versions of the SOMAscan platform have been reported. One of these studies was conducted in a sample of women enrolled in the TwinsUK study (Menni et al., 2015). In this study, 1,129 plasma proteins were measured by SOMAscan in 206 women, and the top proteins were tested for replication in 677 subjects from AddNeuroMed, T A B L E 2 Precision and Accuracy of proteomic predictors of age Alzheimer's Research UK, and Dementia Case Registry cohorts.
There were 13 age-associated proteins in the discovery, 10 of which were replicated in the independent samples. In our study, 12 of the 13 proteins that were age-associated in the TwinsUK study were confirmed to be associated with age. Two other proteomics studies of age were performed in cerebral spinal fluid (CSF) and serum (Baird et al., 2015;Di Narzo et al., 2017). In the first study, 800 proteins were measured in CSF from 90 cognitive normal participants between 21 and 85 years old (Baird et al., 2015). Of these, 248 proteins exhibiting a signal twofold greater than the background were tested for association with age, of which 81 were found to be asso- In our study, we identified many other proteins associated with age that were not previously described using this technology. The  -10  IL1R2, CCL3, CXCL9, TNFSF15, TNFRSF8, CCL7, CXCL10, TNFRSF1A, TNFRSF1B, CCL3L1, IL10RA,  TNFRSF19, IL15RA, FAS, EGF, IL13RA1, EPO, EGFR, CCL4L1, CCL11, AMH, TNFRSF9, RELT, IFNB1 Functional enrichment analysis highlighted some key pathways that are important in aging. The GO term clusters targeted included blood coagulation, chemokine and inflammatory pathways, axon guidance, peptidase activity, and apoptosis. The two main proteins in the blood coagulation cluster were fibrinogen and fibronectin, both previously shown to increase with age, and both related with a proinflammatory state (Folsom et al., 1991;Labat-Robert, Potazman, Derouette, & Robert, 1981). A second cluster included a number of peptidases, with substantial overlap with the blood coagulation cluster, and included SERPINF2, AHSG, SERPING1, SERPINA3, and F I G U R E 5 Functional annotation clustering using Database for Annotation, Visualization and Integrated Discovery (DAVID). Pathway enrichment analysis was conducted using DAVID, and to better visualize the shared proteins between the top GO annotation terms, functional annotation clustering was conducted on GO "biological processes," "molecular function," and "cellular component" terms.
The GO terms and proteins shared among the terms for the top five clusters are displayed TANAKA ET AL.
| 7 of 13 TIMP1 suggesting that this second group of proteins taps into some different aspects of blood clotting pathways. Of note, increased levels of all proteins included in the second cluster have been associated with major age-related conditions. SERPINF2 modulates insulin sensitivity and is associated with cardiovascular diseases and diabetes (Aso et al., 2000;Uitte de Willige et al., 2011). SERPING1 modulates the complement cascade and is important in many inflammatory diseases, including macular degeneration (Ennis et al., 2008).
SERPINA3 has been identified as a specific biomarker of delirium and Alzheimer's disease (Padmanabhan, Levy, Dickson, & Potter, 2006;Poljak et al., 2014). TIMP1 has been involved in age-associated renal sclerotic and impairment kidney angiogenesis (Tan & Liu, 2012). Enrichment analysis also reveals the changes in protein levels of various CC chemokine family. For many of these chemokine proteins, there are reports that aging affect both their gene expression and protein levels Whiting et al., 2015;. One of these proteins, CCL11 or eotaxin has been proposed as an important factor in neurogenesis in parabiotic models of aging in mice models (Villeda et al., 2011). Our study provides supportive evidence that these class of proteins change with age in healthy older adults.
The main proteins that define the axon guidance cluster are ephrin proteins that are important in axonal growth during development (Fiore & Puschel, 2003). In adults, some ephrin proteins have been implicated in cancer development (Royet et al., 2017). The implication of changes in ephrin proteins in healthy proteins should be investigated further.
Consistent with the hypothesis of increase apoptosis with aging, one of the enriched functional clusters involved several proteins from the TNF receptor superfamily. The TNF receptor superfamily plays an important role in regulating cell fate, not only apoptosis but also proliferation, and morphogenesis (Aggarwal, Gupta, & Kim, 2012). The TNF receptors can be categorized as activating receptors (such as TNFRSF1B) that control the nuclear factor κB and mitogenactivated protein kinase (MAPK) pathways, and death receptors (such as FAS) that contain a death domain that induces cell death.
TNFR1 (TNFRSF1A) has both activating and death receptor functions and can affect cell metabolism, differentiation, and proliferation (Li, Yin, & Wu, 2013). Soluble TNF receptor 1 (TNFRSF1A) and 2 (TNFRSF1B) have been associated with advance age and aging pathologies such as kidney function, fractures, and cognitive performance (Cauley et al., 2016;Gao et al., 2016;Schei et al., 2017). Our study results would suggest that there may be a more coordinate change in the TNF receptor family with age that may be important determinant of healthy aging.
We sought to examine whether there was enrichment of proteins involved in important aging phenomenon that may not be annotated in established databases. There is a growing interest in understanding the role of senescence in aging. It has been hypothesized that many age-related, degenerative pathologies are driven at least in part by the accumulation of cell senescence (Campisi & Robert, 2014). An elegant study has shown that clearance of senescent cells can delay age-associated conditions such as cataract, lordokyphosis, muscle mass and function, and increase longevity in mice (Baker et al., 2011). Several studies have documented that senescence cells release a variety of bioactive molecules including interleukins, chemokines, growth factors, secreted proteases, and extracellular matrix components into the extracellular matrix.
Although a comprehensive list of SASP proteins is still not available, in our study we found an enrichment of SASP proteins that has been reported in the literature, suggesting that senescence increases with aging even in subjects who remain relatively healthy. It is possible that these blood biomarkers of age may be used to monitor the trajectories of aging.
Using data from multiple proteins, we created a proteomic signature that is tightly correlated with age. It is an interesting fact that the precision of the proteomic age predictor was not compromised by reducing the number of proteins used in the predictor; however, with fewer proteins, there was a substantial decline of accuracy. Our results suggest that there are stereotypical biological changes that occur with aging that are reflected by circulating proteins. Regardless of whether these protein modifications reflect biological aging or track compensatory mechanisms triggered by aging, similarly to the epigenetic clock, our signature accurately predict age. It is critical that the proteomic "signature" of age identified in our analysis be other key aging proteins missing from this analysis; thus, our results do not comprehensively represent the aging proteome. For example, we observed that most of age-associated proteins show increased abundance with age. This trend was also observed in the previous study of aging in plasma using the SOMAscan platform (Menni et al., 2015). It is unlikely that this is a biological phenomenon but rather an artificial observation based on the proteins that are targeted by the SOMAmers. Other proteomic aging studies in humans using technology such as two-dimensional gel electrophoresis (Byerley et al., 2010) or quantitative mass spectrometry (Waldera-Lupa et al., 2014) showed an equal number of age-associated proteins that decreased as well as increased with age. Second, the SOMAscan is not an absolute measure of proteins, and therefore, we cannot make comparisons between proteins. Third, the accuracy of the protein specificity revealed by the SOMAscan technology is still controversial (Schafer et al., 2016). While the aptamers are designed to measure proteins in their native confirmation, there is still a possibility of cross-reactivity for proteins with high similarity. We validated the measure and the age association for our top protein GDF15 by comparing values obtained with the SOMAScan with those obtained by ELISA. Nevertheless, substantial work remains to be done to validate the other proteins. Large aging proteomic studies conducted with different technologies are needed to provide a comprehensive picture of the aging proteome in addition to validate our findings. A substantial step in this field is to overcome the current limitations of LC-MS approached to study proteomics in plasma to obtain a comprehensive profile in this highly accessible biological fluid. At last, our study involved individuals that were exceptionally healthy, which is an advantage of our approach, but it can also be a limitation. As we have applied the same selection criteria across the age spectrum, it is most likely that the younger and older populations are different.
The older subjects in this study are by all accounts healthy agers, while the likelihood of the younger subjects to be as healthy in older age is not guaranteed. In addition, the healthy older subjects in this study are not generalizable to the average American population.
In summary, using a discovery proteomic approach, we identified over 200 proteins that are robustly associated with age. Our findings could provide a window to a new area of investigation with enormous potential. Future studies are needed to replicate and expand our findings in a larger population and, possibly, in representative cohorts that are followed for many years. Under the assumption that the age-proteomic profile summarizes the biological mechanisms of aging, one could anticipate that such profile would predict many of the aging phenotypes as well as multimorbidity, disability, and death.
If future studies show that longitudinal changes in the age-proteomic profile track the phenotypic manifestations of aging, plasma proteomics may shed light into the biology of aging and contribute to the development of interventions aimed at preventing the burden of disease and disability in older persons.

| Study population
This study was conducted in healthy men and women participating in the BLSA and the GESTALT studies.
The BLSA study is a population-based study aimed at depicting physiological and functional trajectories with aging and discover factors that affect those trajectories. The study evaluates contributors of healthy aging in persons 20 years old and older (Shock et al., 1984  The experimental process for proteomic assessment and data normalization has been previously described .

| Proteomic assessment
The data reported are SOMAmer Reagent abundance in relative fluorescence units (RFU). The abundance of the SOMAmer Reagent TANAKA ET AL.
| 9 of 13 represents a surrogate of protein concentration in the plasma sample.
Data normalization was conducted in three stages. First, hybridization control normalization removes individual sample variance on the basis of signaling differences between microarray or Agilent scanner. Second, median signal normalization removes intersample differences within a plate due to technical differences such as pipetting variation. At last, calibration normalization removes variance across assay runs. Further, there is an additional interplate normalization process that utilizes CHI calibrator that allows normalization across all experiments conducted at CHI laboratory . A interactive Shiny web tool was used during the CHI QC process .

| Statistical analysis
Protein RFU values were natural log-transformed and outliers outside

| Proteomic signature of age
To construct a proteomic age predictor, a penalized regression model was implemented using the R package glmnet. First, a training set was selected by stratified random sampling method selecting 24 subjects from each of the 15-year age strata (20-35, 35-50, 50-65, 65-80, 80+ years). The remaining 120 subjects were used as a validation sample. In the training dataset, chronological age was regressed on 1,301 log-transformed protein abundances. The alpha value was set to 0.5 (for elastic net regression) and a lambda of 0.8767859 was selected using a 10-fold cross-validation on the training set using the cv.glmnet function. The resulting age-prediction model from the penalized regression was applied to the validation data and the correlation between predicted and chronological age was examined.
Proteomic age predictor with varying number of predictor proteins was created to explore the minimum number of proteins needed to create a meaningful age predictor. This was carried out by constraining the maximum number of variables selected using the dfmax option in the training set. A total of 12 additional age predictors were created with 63,58,49,40,27,17,9,8,7,5,3, or 1 proteins in the model. These age predictor models were applied to the validation dataset to check for the correlation between predicted and chronological age.
The association between the 13 proteomic age predictors with 13 age-associated clinical phenotypes (IL-6, CRP, total cholesterol, fasting glucose, HBA1C, blood urea nitrogen, alkaline phosphatase, serum albumin, waist circumference, grip strength, usual walking speed, systolic blood pressure, and red blood cell distribution width) was tested using multiple linear regression adjusting for chronological sex, study (BLSA or GESTALT), and race (white, black, other).
Two clinical variables (IL-6 and CRP) were natural log-transformed to achieve near normality. For this analysis, a p-value <0.05 was considered as statistically significant.

| Functional annotation and enrichment analysis
To explore whether certain biological processes, molecular function or cellular components are enriched among the proteins that were found significantly correlated with age, a gene enrichment analysis was run on the 217 age-associated proteins using the Database for Annotation, Visualization and Integrated Discovery (DAVID https://david.ncifcrf.gov/) tool (Dennis et al., 2003). The GO, KEGG pathway enrichment, and functional annotation clustering (Huang et al., 2007) were conducted using default DAVID parameters. Enriched GO and KEGG pathways were considered significant at FDR or Benjamini-Hochberg corrected p < 0.05.

ACKNOWLEDG MENTS
The BLSA and the GESTALT study was supported by the Intramural Research Program of the National Institute on Aging. RDS is supported by NIA R01 AG027012. This research was also supported in part by the Intramural Research Program of the National Institute of Allergy and Infectious Diseases, Center for Human Immunology, Trans-NIH.

DATA AVAILABILITY
Data proteomic data generated from this study are available upon request. Please contact the corresponding author for further information.

CONF LICT OF I NTEREST
B.S. is a former SomaLogic, Inc. (Boulder, CO, USA) employee and a company shareholder. The remaining authors have no competing interests to declare.

AUTHOR CONTRIBU TI ONS
LF directed and supervised the project. SOMAscan assay was run by AB and GF and supervised by KS, BS, and YK. JC, FC, BS conducted proteomic data normalization and cleaning. TT, AZM, and MA conducted the statistical and bioinformatic analysis. RM and RDS contributed to the interpretation of data. TT prepared the manuscript and all authors have contributed to and approved the final version of the manuscript.

S U P P O R T I N G I N F O R M A T I O N
Additional supporting information may be found online in the Supporting Information section at the end of the article.