Moving towards best practice when using inverse probability of treatment weighting (IPTW) using the propensity score to estimate causal treatment effects in observational studies
Abstract
The propensity score is defined as a subject's probability of treatment selection, conditional on observed baseline covariates. Weighting subjects by the inverse probability of treatment received creates a synthetic sample in which treatment assignment is independent of measured baseline covariates. Inverse probability of treatment weighting (IPTW) using the propensity score allows one to obtain unbiased estimates of average treatment effects. However, these estimates are only valid if there are no residual systematic differences in observed baseline characteristics between treated and control subjects in the sample weighted by the estimated inverse probability of treatment. We report on a systematic literature review, in which we found that the use of IPTW has increased rapidly in recent years, but that in the most recent year, a majority of studies did not formally examine whether weighting balanced measured covariates between treatment groups. We then proceed to describe a suite of quantitative and qualitative methods that allow one to assess whether measured baseline covariates are balanced between treatment groups in the weighted sample. The quantitative methods use the weighted standardized difference to compare means, prevalences, higher‐order moments, and interactions. The qualitative methods employ graphical methods to compare the distribution of continuous baseline covariates between treated and control subjects in the weighted sample. Finally, we illustrate the application of these methods in an empirical case study. We propose a formal set of balance diagnostics that contribute towards an evolving concept of ‘best practice’ when using IPTW to estimate causal treatment effects using observational data. © 2015 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd.
1 Introduction
Researchers are increasingly using observational studies to estimate the effects of treatments, exposures, and interventions on health outcomes. In randomized controlled trials, randomization ensures that, on average, treated subjects will not differ systematically from control subjects in both measured and unmeasured baseline characteristics. Therefore, the effect of treatment can be estimated by directly comparing outcomes between the treatment groups. However, non‐randomized studies of the effect of treatment on outcomes can be subject to treatment‐selection bias in which treated subjects differ systematically from control subjects. Therefore, in non‐randomized studies, the effect of treatment cannot be estimated by simply comparing outcomes between treatment groups.
Propensity score methods are being used with increasing frequency to estimate treatment effects using observational data. The propensity score is defined as the probability of treatment assignment conditional on measured baseline covariates 1-3. Rosenbaum and Rubin demonstrated a key property of the propensity score: conditional on the propensity score, treatment status is independent of measured baseline covariates 1. Thus, the propensity score is a balancing score: treated and control subjects with the same propensity score will have similar distributions of observed baseline covariates.
Four methods of using the propensity score have been described in the statistical literature: covariate adjustment using the propensity score, stratification or subclassification on the propensity score, matching on the propensity score, and inverse probability of treatment weighting (IPTW) 1, 4. Rubin argues that an advantage to the use of propensity score methods is that they allow observational studies to be designed similar to randomized experiments: the design of the study is separated from the analysis of the effect of exposure on the outcome 5. Rubin states that ‘diagnostics for the successful design of observational studies proposed on estimated propensity scores …is a critically important activity in most observational studies’ 6. Such diagnostics enable applied researchers to determine whether conditioning on the estimated propensity score has removed observed systematic differences between treated and control subjects. Diagnostics have been developed for stratification on the propensity score 2, 7, 8, matching on the propensity score 9, 10, and covariate adjustment using the propensity score 11. However, diagnostics have been less well described in the context of IPTW using the propensity score.
The objective of the current study is to describe methods to assess whether the use of IPTW has induced a weighted sample in which the distribution of measured baseline covariates is similar between treated and control subjects. These methods are based on comparing the distribution of measured baseline covariates between treated and control subjects in the sample weighted by the estimated inverse probability of treatment. We refer to these methods as balance diagnostics. The paper is structured as follows. In Section 2, we briefly review methods based on IPTW. In Section 3, we report on a review of the literature examining the frequency with which appropriate diagnostics were employed when using IPTW. In Section 4, we describe balance diagnostics for use with IPTW and methods for assessing the validity of the positivity assumption. We first describe quantitative methods to compare means, prevalences, higher‐order moments, and interactions between covariates across treatment groups in the weighted sample. We then describe how graphical methods can be used to compare the distribution of continuous covariates between treated and control subjects in the weighted sample. These graphical methods are complemented by a numerical method to compare the distribution of continuous covariates between treated and control subjects. In Section 5, we describe a case study illustrating the application of these methods. Finally, in Section 6, we discuss our proposed methods in the context of the existing literature and provide recommendations for practice.
2 Inverse probability of treatment weighting
2.1 Potential outcomes framework and average treatment effects
We consider the setting in which there is a binary or dichotomous exposure. Thus, we assume that there are two possible treatments (e.g., active treatment vs. control treatment). The potential outcomes framework assumes that each subject has a pair of potential outcomes: Yi(0) and Yi(1), the outcomes under the control treatment and the active treatment, respectively, when received under identical circumstances 12. However, each subject receives only one of the control treatment or the active treatment. Let Z denote an indicator variable denoting the treatment received (Z = 0 for control treatment vs. Z = 1 for active treatment). Thus, only one outcome, Yi, is observed for each subject: the outcome under the actual treatment received. The observed outcome is equal to Yi=ZiYi(1) + (1 − Zi)Yi(0). Thus, Yi is defined to be equal to Yi(0) if Zi=0, and to be equal to Yi(1) if Zi=1.
For each subject, the effect of treatment is defined as Yi(1) − Yi(0): the difference between the two potential outcomes. The average treatment effect (ATE) is defined to be: E[Yi(1) − Yi(0)] 13, with the expectation taken across the population of interest. The ATE is the average effect, at the population level, of moving an entire population from control to treated.
If treatment were assigned at random, we would have that E[Y|Z = 1] = E[ZY(1) + (1 − Z)Y(0)|Z = 1] = E[ZY(1)|Z = 1] + E[(1 − Z)Y(0)|Z = 1] = E[Y(1)|Z = 1] = E[Y(1)]. The last equality holds because, under randomization, treatment assignment is independent of the potential outcomes: (Y(1),Y(0))⊥⊥Z. Similarly, E[Y(0)] = E[Y|Z = 0]. Therefore, under randomization, one has that E[Yi(1) − Yi(0)] = E[Y|Z = 1] − E[Y|Z = 0]13. Thus, randomization provides an unbiased estimate of the average treatment effect. However, in an observational study, we have that, in general, E[Y(1)|Z = 1] ≠ E[Y(1)]. Thus, in an observational study simply comparing outcomes between the two treatment groups does not necessarily yield an unbiased estimate of the average treatment effect.
2.2 The propensity score and inverse probability of treatment weighting
As previously discussed, let Z denote treatment assignment (Z = 1 denoting treatment; Z = 0 denoting absence of treatment), and let X denote a vector of observed baseline covariates. The propensity score is defined as e = P(Z = 1|X): the probability of a subject receiving the treatment of interest conditional on their observed baseline covariates 1. The inverse probability of treatment weight is defined as
. Each subject's weight is equal to the inverse of the probability of receiving the treatment that the subject received 4.
Lunceford and Davidian provide a review of methods for estimating treatment effects that use weighting by the inverse of the probability of treatment 14. If Y denotes an outcome variable, the average treatment effect (ATE) can be estimated by
, where n denotes the number of subjects. An alternative estimator of the ATE is
14. When the propensity score model is correctly specified, both estimators are consistent estimators of the true treatment effect 14. However, Lunceford and Davidian found that in empirical studies, in general, the variance of the former estimator is greater than that of the latter estimator 14.
Joffe et al. describe how weighting by the inverse probability of treatment results in an artificial population in which baseline covariates are independent of treatment status 15. Furthermore, Joffe et al. describe how regression models can be combined with weighting by the inverse probability of treatment to estimate causal treatment effects. While weighting by the inverse probability of treatment allows the comparison of expectations and distributions between treated and control subjects, methods that account for the weighting must be used in estimating variances and significance levels 14, 15. For instance, Joffe et al. suggest that a robust, sandwich‐type variance estimator be used to account for the fact that the weights are estimated, rather than known with certainty. Other alternatives to variance estimation include bootstrap‐based methods.
A difficulty that can arise when using the weights described previously is that treated subjects with a very low propensity score can result in a very large weight. Similarly, a control subject with a propensity score close to one can result in a very large weight. Such weights can increase the variability of the estimated treatment effect 16. An alternative to the conventional weights described previously is to use stabilized weights:
16. Pr(Z = 1) and Pr(Z = 0) denote the marginal probability of treatment and control in the overall sample. Another alternative to address the problems that can arise with very large weights is to use trimmed or truncated weights, in which weights that exceed a specified threshold are each set to that threshold 16, 17. The threshold is often based on quantiles of the distribution of the weights (e.g., the 1st and 99th percentiles).
The weights described previously
permit estimation of the ATE. However, a different set of weights permit estimation of the average treatment effect in the treated (ATT):
18. These weights are obtained by multiplying the conventional weights by e, so that treated subjects receive a weight of one. Thus, the treated sample is being used as the reference population to which the treated and control samples are being standardized. While the current article is focused on the use of the ATE weights, the balance diagnostics discussed are equally applicable to situations in which the ATT weights are employed.
2.3 Variable selection for the propensity score model
The propensity score is defined as the probability of treatment selection conditional on measured baseline covariates. A natural question that arises is what variables should be included in the propensity score model. A reasonable suggestion would be to include those variables that influence the treatment selection process. A different answer can be obtained by remembering the primary property of the propensity score: that it is a balancing score 1. Thus, conditioning on the propensity score permits one to balance the distribution of measured baseline covariates between treated and control subjects. Accordingly, Rosenbaum suggests that one ask ‘which covariates do you wish to balance by matching on the propensity score?’ 19 (page 356). The goal of propensity score analyses should be to induce balance in measured baseline covariates between treatment groups. However, when considering balance, not all covariates are of equal importance. It is more important to balance prognostically important covariates than those covariates that influence treatment selection but have no effect on the outcome. Indeed, prior evidence has suggested that it is preferable to include either the prognostically important covariates (those related to outcomes) or the confounding covariates (those related to treatment and outcomes) in the propensity score model than to include those variables that affect the treatment‐selection process 8. In a similar vein, Myers et al. state that conditioning on instruments (i.e., variables that affect treatment‐selection but not the outcome) can result in increased bias and variance of the treatment‐effect estimate 20.
The identification of the set of variables that are prognostically important or that confound the treatment–outcome relationship can be identified using causal diagrams 21 in conjunction with a review of the subject‐matter literature and expert opinion. We suggest that statistical hypothesis testing in the analytic sample not be used to identify the requisite variables, in the spirit of separating ‘design’ from ‘analysis’ and not using the outcome data in the propensity score process 22. A further reason for this caution is the possibly low statistical power to detect all of the prognostically important or confounding covariates. Having identified the appropriate set of variables, the objective of IPTW using the propensity score is to create a weighted sample in which the distribution of these covariates is the same between treated and control subjects.
2.4 Assumptions of propensity score methods
Causal inference using the propensity score requires four assumptions: consistency, exchangeability, positivity, and no misspecification of the propensity score model 16. Consistency means that a subject's potential outcome under the treatment actually received is equal to the subject's observed outcome. Exchangeability, also known as ignorable treatment assignment, is the assumption that there are no unmeasured confounders: that one has measured and has access to all of the variables that affect treatment selection and outcomes. Positivity is the assumption that all subjects have a non‐zero probability of receiving each treatment: 0< Pr(Z = 1) < 1. Cole and Hernan note that the assumption of no unmeasured confounding cannot be formally tested 16. Instead, subject matter knowledge is required in designing the study so that all confounders are collected.
Rosenbaum and Rubin suggest that there are multiple balancing scores (of which f(X) = X is the finest and the propensity score is the coarsest) 1. The use of any of these balancing scores would induce balance on the measured baseline covariates. Given that conditioning on any balance score will induce balance, it may not be possible to assess whether the propensity score model has been correctly specified; that is, it may be that we have an ‘incorrect’ propensity score but yet have something that is still a balancing score. We would argue that while the specification of the propensity score model may be unverifiable, the important issue is whether weighting using the estimated propensity score induced balance of measured covariates between treated and control subjects. Thus, rather than assessing the accuracy of the specification of the propensity score model, we focus on assessing balance of measured covariates between treated and control subjects in the weighted sample.
3 Literature review of the use of balance diagnostic with IPTW in the applied medical literature
We conducted a review of the applied biomedical literature to determine the frequency with which IPTW methods are applied and how often appropriate balance diagnostics are employed in this context.
3.1 Methods
We conducted a literature search using the Web of Science ©(Thomson Reuters) to address two objectives. The first objective was to determine the frequency with which applied articles reported using IPTW methods. The second was to examine in detail all the articles published in a recent year to determine the frequency with which authors reported using balance diagnostics in conjunction with IPTW methods. We limited our literature search to articles published between 1987 and 2014. The start date was selected as the year in which Rosenbaum's original article on inverse probability of treatment weighting was published 4.
We searched the Web of Science Core Collection on 30 March 2015 using the following search strategy: TOPIC: (“inverse probability of treatment weight*”) AND YEAR PUBLISHED: (1987–2014) Refined by: DOCUMENT TYPES: (ARTICLE) AND [excluding] WEB OF SCIENCE CATEGORIES: (STATISTICS PROBABILITY OR MATHEMATICS INTERDISCIPLINARY APPLICATIONS OR MATHEMATICAL COMPUTATIONAL BIOLOGY OR COMPUTER SCIENCE INTERDISCIPLINARY APPLICATIONS OR MEDICAL INFORMATICS OR SOCIAL SCIENCES MATHEMATICAL METHODS OR COMPUTER SCIENCE INFORMATION SYSTEMS)
The last set of exclusions was used to exclude articles published in the methodological literature, because we were interested in the application of IPTW in the applied literature.
3.2 Results
The search identified 139 articles published between 1987 and 2014 that used IPTW. The number of identified studies published each year is reported in Figure 1. No applied studies were identified that used IPTW until 2000, when two studies used this method. From 2000 until 2007, few studies were identified as having used IPTW, with a total of 10 published studies during this 8‐year period. From 2007 onwards, the annual number of published studies that used IPTW grew in an approximately linear fashion. In 2014, 34 articles were identified that employed IPTW.

We examined in detail the 34 articles published in 2014. The most frequent research areas (using the Web of Science Research Area classification system) were ‘Cardiovascular system cardiology’ (9 articles), followed by ‘Surgery’ (6 articles), and ‘Respiratory system’ (5 articles). Upon a more detailed examination of these 34 articles, five were excluded because they used inverse probability of treatment weighting in the context of marginal structural models (MSMs) (4 articles) or considered a continuous exposure (1 article). This left 29 articles for assessment of the use of balance and weight diagnostics. Of these 29 articles, three (10.3%) presented some assessment of the distribution of weights, while 14 (48.3%) assessed the distribution of baseline covariates after implementing IPTW. Only two studies conducted an assessment of baseline covariate balance and examined the distribution of the weights 23, 24.
Of the three studies that assessed the distribution of the weights, one article computed the mean and standard deviation of the weights, as well as reported the range of the weights 24. This study conducted separate analyses using stabilized weights and trimmed weights. A second study conducted three separate analyses using conventional weights, standardized weights, and trimmed weights 23. The authors used boxplots to examine the distribution of the weights. Finally, the third study reported the range of the weights 25.
Of the 14 studies that examined the distribution of baseline covariates between treated and control subjects in the weighted sample, a range of methods were used. These included using standardized differences in the weighted sample 23, 24, 26-31, the Kolmogorov–Smirnov statistic 26, a crude comparison of baseline characteristics 32, and statistical significance testing in the weighted sample 30, 33-35. Two studies reported that comparisons were carried out but did not report the results 36, 37.
While it was not the focus of the review, we noted that several sets of authors incorrectly defined the weights as the reciprocal of the propensity score, rather the reciprocal of the probability of receiving the treatment that was actually received.
4 IPTW diagnostics
In this section, we describe both quantitative and qualitative methods for assessing balance in observed baseline covariates between treated and control subjects in a sample weighted by the inverse probability of treatment. We also describe methods for assessing the validity of the positivity assumption.
4.1 Balance diagnostics
In this sub‐section, we consider diagnostics for assessing the balance of baseline covariates between treated and control subjects in a sample weighted by the inverse probability of treatment. As noted previously, the objective of IPTW analyses is to create a weighted sample in which the distribution of either the confounding variables or the prognostically important covariates is the same between treated and control subjects.
4.1.1 Comparison of means and proportions of baseline variables
and
denote the sample mean of X in treated and control subjects, respectively, while
and
denote the sample variance of X in treated and control subjects, respectively. Similarly, for a dichotomous variable,
and
denote the sample prevalence of the variable in treated and control subjects, respectively. In an unweighted sample, the standardized difference is defined as
(1)
(2)The sample means, sample variances, and sample prevalences in formulas 1 and 2 are unweighted estimates. However, each sample estimate can be replaced by its weighted equivalent. The weighted mean is defined as
, while the weighted sample variance is defined as
, where wi is the weight assigned to the i‐th subject. In our context, the weight is the inverse probability of treatment received, as defined in Section 2. The use of standardized differences allows researchers to quantitatively compare balance in measured baseline covariates between treated and control subjects in the sample weighted by the inverse probability of treatment.
4.1.2 Comparison of interactions and higher‐order moments of continuous variables
The methods described in the previous section allow one to compare means and prevalences of continuous and dichotomous variables, respectively, between treated and control subjects in the weighted sample. However, one desires to balance not only means and prevalences but also other characteristics of the distribution. In particular, higher‐order moments and interactions between variables should be similar between treatment groups in the weighted sample. In the context of propensity‐score matching, both Ho et al. and Austin have suggested comparing interactions and higher‐order moments between treatment groups 9, 10. Therefore, we suggest that standardized differences be used to compare the mean of higher‐order moments (e.g., squares and cubes of continuous variables) and interactions between continuous variables. Comparing the mean of squares of continuous variables is equivalent to comparing the variance of that variable between treatment groups. One wants to ensure that the variance, and not only the mean, of a continuous variable is similar between treatment groups in the weighted sample.
4.1.3 Graphical comparisons of the distribution of continuous variables
Standardized differences allow for the comparison of means and higher‐order terms between treated and control subjects. However, one wants to induce balance on the entire distribution of continuous covariates, not just means and higher‐order terms of baseline variables. We now describe graphical methods that permit a broader, qualitative, comparison of the distribution of a continuous variable between two groups in a sample that has been weighted by the inverse probability of treatment.
Side‐by‐side boxplots 40 and empirical cumulative distribution functions (CDFs) 41 can be used to compare the distribution of continuous baseline covariates between treated and control subjects in the weighted sample. The use of side‐by‐side boxplots to compare the distribution of baseline covariates between treated and control subjects in the weighted sample has previously been described by Joffe et al. 15. These methods allow one to assess whether the variability of a continuous baseline variable differs between treatment groups and whether the tails of the distribution of the variable differ between treatment groups.
4.1.4 Numerical comparisons of the distribution of continuous variables
The graphical methods described in the previous sub‐section permit the analyst to compare the distribution of continuous baseline covariates between treated and control subjects, both in the original sample as well as in the weighted sample. A limitation of the graphical approach is that it relies on a subjective comparison of graphs, especially when comparing two different specifications of the propensity score model. In this section, we propose a numerical method for comparing the distribution of continuous baseline covariates between treatment groups. In the preceding section, we proposed that empirical CDFs be used to compare the distribution of continuous baseline covariates between treatment groups. The Kolmogorov–Smirnov test permits a formal comparison of the distribution of a continuous variable between two independent groups. The test statistic is defined to be the maximal vertical distance between the two empirical CDFs of the variable in the two groups 42. The use of the Kolmogorov–Smirnov test statistic permits a quantification of the difference in the distribution of a continuous baseline covariate between treated and control subjects.
We suggest that analysts restrict their use of this method to estimation of the Kolmogorov–Smirnov test statistic, rather than use it for formal hypothesis testing to detect statistically significant differences in the distribution of the covariate between treatment groups. While there is precedent for the use of statistical hypothesis testing for assessing balance in baseline covariates 2, other authors have suggested that the balance‐test fallacy precludes the use of hypothesis testing when conducting balance assessment 43. One reason for this criticism of statistical hypothesis testing is that one is interested in assessing balance in the particular analytic sample, rather than in the super‐population from which the sample was drawn. Furthermore, while the Kolmogorov–Smirnov test statistic can be computed when using weighted data, we are unaware of a statistical test that permits one to formally use the weighted test statistic to test whether the distribution is different between the two groups However, if an analyst were to conduct formal hypothesis testing using the Kolmogorov–Smirnov test, it could be possible to use a permutation‐based approach to assess the statistical significance of the difference in the distributions between treatment groups.
4.2 Diagnostics for assessing the positivity assumption
In this section, we describe methods for assessing the validity of the positivity assumption. In the context of MSMs, which use IPTW to account for time‐varying treatment and confounding, Cole and Hernan recommend that analysts should determine the mean stabilized weight (note that this recommendation pertains only to when stabilized weights are used) and the standard deviation of the stabilized weights 16. Similarly, they suggest determining the minimum and maximum weights. They suggest that if the mean of the stabilized weights is far from one or if there are very extreme values, then this can be indicative of non‐positivity or that the propensity score model has been misspecified. The standard deviation of the weights can be useful when comparing between different specifications of the propensity score model. Everything else being equal, one would select the specification that resulted in weights with the lowest standard deviation.
5 Case study
5.1 Data sources
We used data on 9107 patients who were discharged alive with an acute myocardial infarction (or heart attack) from 102 hospitals in Ontario, Canada, between 1 April 1999 and 31 March 2001. These data were collected as part of the Enhanced Feedback for Effective Cardiac Treatment (EFFECT) study, an initiative that is focused on improving the quality of care for cardiovascular disease patients in Ontario 44. Data on patient demographics, presenting signs and symptoms, classic cardiac risk factors, comorbid conditions and vascular history, vital signs on admission, and results of laboratory tests were abstracted directly from patients' medical records. The exposure of interest was whether the patient was prescribed a beta‐blocker at hospital discharge.
Overall, 6178 (67.8%) of patients received a prescription for a beta‐blocker at discharge, while 2929 (32.2%) did not receive a prescription at discharge. Table 1 compares the characteristics of patients who did and did not receive a beta‐blocker at hospital discharge. Standardized differences were used to compare the balance in measured baseline covariates between those who did and did not receive a prescription for a beta‐blocker at discharge. Eighteen of the 24 measured baseline covariates had standardized differences that exceeded 10%. While there is no consensus as to what value of a standardized difference can be taken to indicate the presence of meaningful confounding, some authors have suggested that a standardized difference in excess of 10% may be indicative of meaningful imbalance in a covariates between treated and control subjects 10, 45, 46. The largest observed standardized differences were for age (−34.1%) and respiratory rate (−33.2%).
| Variable | Beta‐blocker: No (N=2929) | Beta‐blocker: Yes (N=6178) | Standardized difference |
|---|---|---|---|
| Demographic characteristics | |||
| Age | 69.6 ± 13.5 | 65 ± 13.3 | −34.1 |
| Female | 1144 (39.1%) | 1984 (32.1%) | −14.5 |
| Presenting signs and symptoms | |||
| Cardiogenic shock | 26 (0.9%) | 32 (0.5%) | −4.4 |
| Acute congestive heart failure (CHF)/pulmonary edema | 214 (7.3%) | 224 (3.6%) | −16.2 |
| Classic cardiac risk factors | |||
| Diabetes | 842 (28.7%) | 1494 (24.2%) | −10.4 |
| Current smoker | 916 (31.3%) | 2158 (34.9%) | 7.8 |
| Hyperlipidemia | 767 (26.2%) | 2132 (34.5%) | 18.2 |
| Hypertension | 1343 (45.9%) | 2793 (45.2%) | −1.3 |
| Family history of coronary artery disease | 745 (25.4%) | 2195 (35.5%) | 22.1 |
| Comorbid conditions | |||
| Cerebrovascular disease/transient ischemic attack (CVA/TIA) | 354 (12.1%) | 493 (8%) | −13.7 |
| Angina | 975 (33.3%) | 1982 (32.1%) | −2.6 |
| Cancer | 110 (3.8%) | 154 (2.5%) | −7.3 |
| Congestive heart failure (CHF) | 189 (6.5%) | 177 (2.9%) | −17.1 |
| Renal disease | 21 (0.7%) | 26 (0.4%) | −3.9 |
| Vital signs on admission | |||
| Systolic blood pressure | 146.8 ± 31.4 | 149.9 ± 30.9 | 10.2 |
| Diastolic blood pressure | 81.8 ± 18.6 | 84.9 ± 18.3 | 17.3 |
| Heart rate | 86.9 ± 25.9 | 82.1 ± 22.7 | −19.9 |
| Respiratory rate | 22.2 ± 6.5 | 20.3 ± 4.8 | −33.2 |
| Laboratory tests | |||
| Glucose | 9.8 ± 5.2 | 9.2 ± 5.2 | −11.8 |
| White blood count | 10.6 ± 5.5 | 10 ± 4.3 | −12.4 |
| Hemoglobin | 135.2 ± 20 | 140.2 ± 17.7 | 26.1 |
| Sodium | 138.7 ± 4.2 | 139.2 ± 3.5 | 10.8 |
| Potassium | 4.1 ± 0.6 | 4.1 ± 0.5 | −12.5 |
| Creatinine | 114.2 ± 77.4 | 98.8 ± 50.3 | −23.5 |
- Note: Continuous variables are represented as mean ± standard deviation, while dichotomous variables are represented as N (%).
5.2 IPTW diagnostics
The propensity score was estimated using a logistic regression model in which treatment assignment (beta‐blocker vs. no beta‐blocker) was regressed on the 24 covariates listed in Table 1. We considered two different specifications of the propensity score model. In the first specification, each covariate entered the propensity score model as a main effect only. For the 11 continuous covariates, it was assumed that each covariate was linearly related to the log‐odds of receiving a prescription for a beta‐blocker at hospital discharge. In the second specification, restricted cubic smoothing splines with five knots were used to model the relationship between each of the continuous variables and the log‐odds of treatment 47. We refer to these two specifications as the simple and complex specifications, respectively. Stabilized weights were computed as described previously. The methods described in Section 4 were then used to assess whether, in the sample weighted by the inverse probability of treatment (using the stabilized weights), treated and control subjects had similar distributions of the covariates listed in Table 1, all of which are plausible predictors of outcomes in patients with acute myocardial infarction. By comparing diagnostics between the two different specifications of the propensity score model, we are able to assess whether one specification was preferable to the other.
5.2.1 Diagnostics based on the stabilized weights
When using the simple specification of the propensity score model, the mean stabilized weight was equal to 1.002, while the standard deviation of the stabilized weights was equal to 0.29. The minimum and maximum weights were 0.353 and 4.656, respectively. When using the complex specification of the propensity score model, the mean stabilized weight was equal to 1.001, while the standard deviation of the stabilized weights was equal to 0.30. The minimum and maximum weights were 0.361 and 3.793, respectively. There was no evidence of non‐positivity or of misspecification of the propensity score model based on an examination of the distribution of the weights derived from either specification of the propensity score model. Based on these diagnostics, neither specification was clearly preferable over the other.
5.2.2 Comparison of means and prevalences in the weighted sample
When using the simple specification of the propensity score model, the largest absolute standardized difference in the weighted sample was 2.1% (age) among the 24 baseline covariates. When using the complex specification of the propensity score model, the largest absolute standardized difference in the weighted sample was 1.7% (hyperlipidemia) among the 24 baseline covariates. In contrast, the standardized differences in the unweighted sample exceeded 10% for 18 (75%) of the 24 baseline covariates. Figure 2 reports the absolute standardized differences for each of the 24 baseline covariates in the unweighted sample and in the two samples weighted by weights derived from the simple and complex specifications of the propensity score model. We have superimposed a vertical line on this figure denoting a standardized difference of 10%, as some authors consider standardized differences below this threshold as indicative of negligible imbalance. These diagnostic assessments suggest that weighting by the inverse probability of treatment has created a sample in which the means of continuous baseline covariates and the prevalence of binary baseline variables are similar between treated and control subjects. While better balance was achieved using the complex specification of the propensity score model, differences between the two specifications were at most modest.

5.2.3 Comparison of higher‐order moments and interactions
The mean of the square of each continuous variable was compared between treatment groups in both the original unweighted sample and in each of the two weighted samples. The 11 absolute standardized differences comparing the square of the continuous variables between treatment groups in the original unweighted sample ranged from a low of 5.1% to a high of 35.3%. The 25th percentile, median, and 75th percentile were 9.6%, 14.1%, and 25.6%, respectively. In the sample weighted using the weights derived from the simple specification of the propensity score model, the 11 absolute standardized differences comparing the squares of continuous variables between treatment groups ranged from a low of 0.1% to a high of 2.5%. The 25th percentile, median, and 75th percentile were 0.2%, 0.8%, and 1.5%, respectively. In the sample weighted using the weights derived from the complex specification of the propensity score model, the 11 absolute standardized differences comparing the squares of continuous variables between treatment groups ranged from a low of 0% to a high of 1.9%. The 25th percentile, median, and 75th percentile were 0.1%, 0.5%, and 0.8%, respectively. Similar results were obtained when we compared the mean of cubes of continuous variables between treatment groups.
The 55 interactions between pairs of continuous variables were computed in the original sample. The absolute value of standardized differences comparing the mean of the product of a pair of variables between treated and control subjects in the original unweighted sample ranged from a low of 2.4% to a high of 43.3%. The 25th percentile, median, and 75th percentile were 10.7%, 18.0%, and 24.6%, respectively. Thus, in the original unweighted sample, approximately 75% of the standardized differences comparing pairs of interactions between groups exceeded 10%. In the sample weighted by the weights derived from the simple specification of the propensity score model, the absolute value of the standardized differences comparing the 55 means of products of continuous variables ranged from a low of 0% to a high of 2.3%. The 25th percentile, median, and 75th percentile were 0.5%, 1.0%, and 1.5%, respectively. In the sample weighted by the weights derived from the complex specification of the propensity score model, the absolute value of the standardized differences comparing the 55 means of products of continuous variables ranged from a low of 0% to a high of 1.4%. The 25th percentile, median, and 75th percentile were 0.3%, 0.6%, and 0.9%, respectively.
These analyses suggest that by weighting by the inverse probability of treatment, a sample has been created in which the means of higher‐order terms and interactions between continuous variables are similar between treated and control subjects. Better balance was achieved using the weights derived from the complex specification of the propensity score model than using the simple specification; however, differences between the two approaches were at most modest.
5.2.4 Graphical comparisons of the distribution of continuous covariates
Figures 3-6 display side‐by‐side boxplots and the empirical CDFs comparing the distribution of age, respiratory rate, creatinine, and haemoglobin between treated and control subjects. This is performed in both the original unweighted sample and in the two weighted samples. The three top panels display the side‐by‐side boxplots in the original unmatched sample and in the two weighted samples. The three lower panels display the empirical cumulative distribution functions in the unweighted sample and in the two weighted samples.




In examining the boxplots in the upper left panel of Figure 3, one observes that the median age is greater in patients who did not receive a beta‐blocker compared with patients who did receive a prescription for a beta‐blocker at discharge. Furthermore, the distribution of age is shifted upwards in those who did not receive a prescription compared with those who did receive a prescription. However, after weighting by the inverse probability of treatment, the two side‐by‐side boxplots appear nearly identical. Similarly, the empirical cumulative distribution is nearly identical between treated and control subjects in both weighted samples. Both graphical diagnostics indicate that the distribution of age is nearly identical between treated and control subjects in the two weighted samples. Modest improvement in balance was achieved with the complex specification of the propensity score model compared with the simple specification of the propensity score model.
Figures 4-6 depict the graphical balance diagnostics for respiratory rate, creatinine, and haemoglobin, respectively. These figures (Figure 5 in particular) illustrate a limitation to the use of boxplots. When the interquartile range is very small compared with the range of the data, the box portion of the plot can be very compressed, and it can be difficult to qualitatively compare the similarity of the two boxplots. However, the cumulative distribution plots do not suffer from this limitation. In examining the lower panels of Figures 4-6, it is evident that weighting using the inverse probability of treatment has resulted in a sample in which the distributions of respiratory rate, creatinine, and haemoglobin are nearly identical between treated and control subjects. In order to address the difficulty of interpreting the boxplots for creatinine, Figure 7 compares the distribution of the natural logarithm of creatinine (thereby reducing the influence of extreme observations on the interpretability of the boxplots) between treated and control subjects. Figure 7 reinforces the conclusion that weighting by the inverse probability of treatment has resulted in a sample in which the distribution of creatinine is very similar between treated and control subjects.

These analyses suggest that by weighting by the inverse probability of treatment, a sample has been created in which the univariate distribution of continuous variables are similar between treated and control subjects. While the specifications of the propensity score resulted in weighted samples with comparable balance between treated and control subjects, one could prefer the more complex specification because of its minor improvement in balancing the distribution of age between treated and control subjects.
5.2.5 Kolmogorov–Smirnov test statistic for comparing distribution of baseline covariates between treatment groups
The Kolmogorov–Smirnov test statistic for the 11 continuous covariates in the original unweighted sample ranged from a low of 0.050 (sodium) to a high of 0.164 (age). The Kolmogorov–Smirnov test statistic for the 11 continuous covariates in the sample weighted using the simple specification of the propensity score model ranged from a low of 0.014 (creatinine) to 0.027 (diastolic blood pressure). The Kolmogorov–Smirnov test statistic for the 11 continuous covariates in the sample weighted using the complex specification of the propensity score model ranged from a low of 0.005 (respiratory rate) to a high of 0.020 (heart rate). The test statistic for each of the 11 variables was higher in the sample weighted by the simple specification of the propensity score than it was in the sample weighted by the complex specification of the propensity score (differences ranged from a low of 0.003 to a high of 0.017). As with the graphical tests considered in the previous sub‐section, one would express a slight preference for the more complex specification of the propensity score, as it resulted in slight improvements in balance compared with that observed for the simple specification of the propensity score model.
6 Discussion
We have described methods to assess the adequacy of assumptions necessary for making causal inferences when using IPTW using the propensity score. We described a comprehensive suite of diagnostics to assess whether weighting the sample by the inverse probability of treatment received induced a sample in which the distribution of measured baseline covariates is the same between treated and control subjects. Diagnostics for assessing balance have not been developed formally in the context of IPTW, nor has their use been forcefully advocated. Likely in part because of this, applied researchers commonly do not use appropriate diagnostics. We propose that the weighted standardized difference be used to compare means and prevalences of continuous and binary variables, respectively, between treated and control subjects in the weighted sample and also to compare higher‐order moments and interactions between continuous variables. Furthermore, the use of cumulative distribution functions and side‐by‐side boxplots allows researchers to qualitatively compare the distribution of continuous variables between treated and control subjects in the sample weighted by the inverse probability of treatment. The Kolmogorov–Smirnov test statistic permits a numerical comparison of the distribution of continuous baseline covariates between treatment groups in the weighted sample. If, in the sample weighted by the estimated inverse probability of treatment, systematic differences persist between treated and control subjects, this may be an indication that the specification of the propensity score model requires modification. If this occurs, an iterative approach to developing the propensity score model, similar to the one suggested by Rosenbaum and Rubin, may be required 2. Finally, when using stabilized weights, examining the distribution of the weights (in particular the mean weight) permits an examination of whether the propensity score model had been correctly specified and whether the positivity assumption had been violated.
Diagnostics for the adequacy of the specification of the propensity score model have been developed in the context of propensity score matching, stratification on the propensity score, and covariate adjustment using the propensity score. Methods have been developed to assess whether matching on the propensity score has resulted in a matched sample in which the distribution of measured baseline covariates are similar between treated and control subjects. Ho et al. suggest comparing higher‐order moments and important two‐way interactions between treated and control subjects 9. Furthermore, Austin described a comprehensive set of methods, including graphical approaches, for assessing the comparability of treated and control subjects in the matched sample 10. Similarly, balance diagnostics have been proposed for when stratification on the quintiles of the propensity score is employed. In the first application of propensity score methods, Rosenbaum and Rubin used two‐way analysis of variance models to regress each measured baseline covariate on propensity score quintile (as a five‐level categorical variable), an indicator variable for treatment selection, and the two‐way interaction between these two factors 2. The significance of either the treatment indicator or the interaction variable was used to infer that the mean of that baseline covariate differed between treated and control subjects within at least one quintile of the propensity score. Other authors have proposed the use of within‐quintile standardized differences to compare the distribution of baseline covariates between treated and control subjects 8, 48. In the context of covariate adjustment using the propensity score, weighted conditional absolute standardized differences and quantile regression have been proposed to assess the balance in measured baseline covariates between treated and control subjects with the same propensity score 11. While several applied studies have reported the area under the receiver operating characteristic curve of the propensity score model (equivalent to the model c‐statistic), recent research has indicated that this does not serve as a test of whether the propensity score model has been correctly specified 10, 49. Similarly, in the context of propensity score matching, comparing the empirical distribution of the propensity score does not serve as an indication of whether the propensity score model has been correctly specified 10.
In our literature review, we observed that of the 29 studies published in 2014 that used IPTW methods, fewer than half described a comparison of baseline covariates between treated and control subjects in the weighted sample (regardless of the quality or appropriateness of the method for comparison). Further, only a small minority of studies (10.3%) reported examining the distribution of weights. These observations can be compared with those from a previous review of the use of propensity score matching in studies published in the medical literature between 1996 and 2003 50. Of the 47 articles that employed propensity score matching, in only eight (17%) was there no reported assessment of the comparability of the treatment groups. In the large majority of papers (83%), some form of balance assessment was conducted and reported. Thus, it appears that when analysts use propensity score matching, there is a general awareness of the need to compare the distribution of measured baseline covariates between treatment groups. However, there is a much more limited recognition that this is equally important when using IPTW. These observations highlight the need for guidance on optimal statistical practice when using IPTW. As our review also demonstrated, the use of IPTW has grown substantially in recent years. Given the increasing interest in this method for estimating causal effects and the poor statistical practice that is evident when this method is used, it is imperative that information on best statistical practice be disseminated to a wide audience.
We acknowledge that the balance diagnostics described in this paper are not particularly novel. Indeed, they are all weighted versions of methods that have been proposed for examining covariate balance in the context of propensity score matching. Furthermore, some of the proposed methods have been used previously in the literature. In a case study comparing the effect of Catholic schooling versus public school on student outcomes, Morgan and Todd used standardized differences to compare means of covariates between the two educational systems in the weighted sample 30. They compared the variance of continuous variables between groups. Similarly, Joffe et al. used side‐by‐side boxplots to compare the distribution of continuous variables between treatment groups in the weighted sample 15. Thus, some of the diagnostics that we have described have been previously employed in the causal inference literature. By having a paper focussed on balance diagnostics in the context of IPTW, our objective is to contribute towards the evolvement of what constitutes best practice when using IPTW. Morgan and Todd suggest that a comparison of characteristics between treated and control subjects should be a step in an analysis of causal treatment effects using IPTW 18. Our hope is that formal balance diagnostics will become a well‐accepted and formal step in any analysis that uses propensity score weighting. The methods described in this paper can serve as a template on how to conduct this step. Our literature review indicated that the use of IPTW has become increasingly popular in recent years and that the use of appropriate balance and weight diagnostics are frequently omitted from published studies. This suggests that there is an urgent need for the dissemination of appropriate diagnostics to complement the reporting of effect estimates.
As noted in the Introduction, there are four primary methods of using the propensity score to estimate treatment effects: matching, stratification (or subclassification), weighting, and covariate adjustment using the propensity score. Of these, the last two use the propensity score directly in estimating the effect of treatment, while the first two use the propensity score only for grouping subjects but not in estimating the effect of treatment. Rubin has suggested that for this reason, the latter two methods may be more sensitive to misspecification of the propensity score model than the first two methods 6. For this reason, it is imperative that balance and weight diagnostics accompany analyses that use IPTW.
The interpretation of balance diagnostics is, to a certain extent, inherently subjective. The degree of imbalance that is acceptable likely depends on the magnitude of the effect of the covariate on the outcome. Thus, greater imbalance may be acceptable for covariates that are weakly prognostic than for covariates that are strongly prognostic. Furthermore, the analyst may be faced with a situation in which one specification of the propensity score model results in better balance of a given covariate and worse balance of a different covariate compared with a different specification of the propensity score model. In such a setting, the analyst would need to consider the relative effects of each covariate on the outcome when deciding which specification to use in the final analyses.
There are avenues for future extensions of the proposed diagnostics. Our discussion of diagnostics for use with IPTW using the propensity score has been in the context of a binary or dichotomous treatment. While not considered in the current paper, these methods can be extended to settings with polytomous exposures. Furthermore, we have restricted our focus to settings in which propensity score methods are most commonly used: cohort designs in which there is a point‐exposure that is applied and defined at baseline. MSMs are a family of models for use with longitudinal studies in which there is both time‐varying treatment and time‐varying confounding 15, 51-53. The parameters of these models are often estimated using IPTW. While more complex, the methods described in the current paper should be modifiable for use with MSMs.
In summary, we found that the use of IPTW is increasing rapidly in the applied literature. However, many published studies omit the crucial step of assessing the comparability of the treated and control groups in the weighted sample. To address this weakness in published studies, we have described diagnostics for assessing the balance of baseline covariates between treatment groups in the sample weighted by the inverse probability of treatment. Our hope is that increased use of these diagnostics will improve the practice of IPTW for estimating average treatment effects using observational data. Adherence to the methods described in this paper may contribute towards the evolvement of what is considered ‘best practice’ when using IPTW to estimate causal treatment effects.
Acknowledgements
This study was supported by the Institute for Clinical Evaluative Sciences (ICES), which is funded by an annual grant from the Ontario Ministry of Health and Long‐Term Care (MOHLTC). Dr. Austin is supported in part by a Career Investigator award from the Heart and Stroke Foundation of Ontario. This study was supported in part by an operating grant from the Canadian Institutes of Health Research (CIHR) (funding number: MOP 86508). Dr. Stuart's time was supported by the National Institute of Mental Health, R01MH099010. The EFFECT data used in the study was funded by a CIHR team grant in Cardiovascular Outcomes Research. These datasets were linked using unique, encoded identifiers and analyzed at the Institute for Clinical Evaluative Sciences (ICES).
Disclaimer
The opinions, results, and conclusions reported in this paper are those of the authors and are independent from the funding sources. No endorsement by ICES or the Ontario MOHLTC is intended or should be inferred.
References
Citing Literature
Number of times cited according to CrossRef: 817
- Daniel Higgins, Aslihan Arslan, Paul Winters, What role can small-scale irrigation play in promoting inclusive rural transformation? Evidence from smallholder rice farmers in the Philippines, Agricultural Water Management, 10.1016/j.agwat.2020.106437, 243, (106437), (2021).
- Adam J. Olszewski, James N. Butera, John L. Reagan, Jorge J. Castillo, Outcomes of bendamustine‐ or cyclophosphamide‐based first‐line chemotherapy in older patients with indolent B‐cell lymphoma, American Journal of Hematology, 10.1002/ajh.25707, 95, 4, (354-361), (2020).
- Avirup Guha, Amit Kumar Dey, Shuab Omer, William T. Abraham, Guilherme Attizzani, Hani Jneid, Daniel Addison, Contemporary Trends and Outcomes of Percutaneous and Surgical Mitral Valve Replacement or Repair in Patients with Cancer, The American Journal of Cardiology, 10.1016/j.amjcard.2020.01.047, (2020).
- Peter Alping, Johan Askling, Joachim Burman, Katharina Fink, Anna Fogdell‐Hahn, Martin Gunnarsson, Jan Hillert, Annette Langer‐Gould, Jan Lycke, Petra Nilsson, Jonatan Salzer, Anders Svenningsson, Magnus Vrethem, Tomas Olsson, Fredrik Piehl, Thomas Frisell, Cancer Risk for Fingolimod, Natalizumab, and Rituximab in Multiple Sclerosis Patients, Annals of Neurology, 10.1002/ana.25701, 87, 5, (688-699), (2020).
- Maria K. Pomponio, Laura Burkbauer, Macy Goldbach, Susanna M. Nazarian, Fei Xie, Amy S. Clark, Jennifer M. Matro, Kevin R. Fox, Lawrence N. Shulman, Luke J. Keele, Julia Tchou, Refining the indications for neoadjuvant chemotherapy for patients with HER2+ breast cancer: A single institution experience, Journal of Surgical Oncology, 10.1002/jso.25814, 121, 3, (447-455), (2020).
- Kylee L. Martens, Parisa R. Khalighi, Shan Li, Andrew A. White, Emily Silgard, Deborah Frieze, Eli Estey, David A. Garcia, Sangeeta Hingorani, Ang Li, Comparative Effectiveness of Rasburicase versus Allopurinol for Cancer Patients with Renal Dysfunction and Hyperuricemia, Leukemia Research, 10.1016/j.leukres.2020.106298, (106298), (2020).
- Lawrence P. O. Were, Edwin Were, Richard Wamai, Joseph Hogan, Omar Galarraga, Effects of social health insurance on access and utilization of obstetric health services: results from HIV+ pregnant women in Kenya, BMC Public Health, 10.1186/s12889-020-8186-y, 20, 1, (2020).
- Mark Broadwin, Patrick E. Grant, Michael P. Robich, Monica L. Palmeri, F.L. Lucas, Joseph Rappold, Robert S. Kramer, Comparison of intraoperative tranexamic acid and epsilon-aminocaproic acid in cardiopulmonary bypass patients, JTCVS Open, 10.1016/j.xjon.2020.05.003, (2020).
- Bénédicte Delcoigne, Ali Manouchehrinia, Christian Barro, Pascal Benkert, Zuzanna Michalak, Ludwig Kappos, David Leppert, Jon A. Tsai, Tatiana Plavina, Bernd C. Kieseier, Jan Lycke, Lars Alfredsson, Ingrid Kockum, Jens Kuhle, Tomas Olsson, Fredrik Piehl, Blood neurofilament light levels segregate treatment effects in multiple sclerosis, Neurology, 10.1212/WNL.0000000000009097, 94, 11, (e1201-e1212), (2020).
- Philip H. Pucher, Saqib A. Rahman, Robert C. Walker, Ben L. Grace, Andrew Bateman, Tim Iveson, Andrew Jackson, Charlotte Rees, James P. Byrne, Jamie J. Kelly, Fergus Noble, Timothy J. Underwood, Outcomes and survival following neoadjuvant chemotherapy versus neoadjuvant chemoradiotherapy for cancer of the esophagus: Inverse propensity score weighted analysis, European Journal of Surgical Oncology, 10.1016/j.ejso.2020.06.038, (2020).
- Konrad Salata, Faraj W. Abdallah, Mohamad A. Hussain, Charles de Mestral, Elisa Greco, Badr Aljabri, Muhammad Mamdani, C. David Mazer, Thomas L. Forbes, Subodh Verma, Mohammed Al-Omran, Short-term outcomes of combined neuraxial and general anaesthesia versus general anaesthesia alone for elective open abdominal aortic aneurysm repair: retrospective population-based cohort study†, British Journal of Anaesthesia, 10.1016/j.bja.2020.01.018, (2020).
- Kathryn M. Leifheit, Gabriel L. Schwartz, Craig E. Pollack, Maureen M. Black, Kathryn J. Edin, Keri N. Althoff, Jacky M. Jennings, Eviction in early childhood and neighborhood poverty, food security, and obesity in later childhood and adolescence: Evidence from a longitudinal birth cohort, SSM - Population Health, 10.1016/j.ssmph.2020.100575, (100575), (2020).
- Hadrien Alric, Aurélien Amiot, Julien Kirchgesner, Xavier Tréton, Matthieu Allez, Yoram Bouhnik, Laurent Beaugerie, Franck Carbonnel, Antoine Meyer, The effectiveness of either ustekinumab or vedolizumab in 239 patients with Crohn's disease refractory to anti‐tumour necrosis factor, Alimentary Pharmacology & Therapeutics, 10.1111/apt.15706, 51, 10, (948-957), (2020).
- A. S. Foulkes, R. Balasubramanian, J. Qian, M. P. Reilly, Non-random sampling leads to biased estimates of transcriptome association, Scientific Reports, 10.1038/s41598-020-62575-x, 10, 1, (2020).
- Te‐yuan Chyou, Revathi Nishtala, Prasad S. Nishtala, Comparative risk of Parkinsonism associated with olanzapine, risperidone and quetiapine in older adults‐a propensity score matched cohort study, Pharmacoepidemiology and Drug Safety, 10.1002/pds.5007, 29, 6, (692-700), (2020).
- Sophie Rym Hamada, Romain Pirracchio, Jocelyn Beauchesne, Mohammed Nadjib Benlaldj, Eric Meaudre, Marc Leone, Julien Pottecher, Paer Selim Abback, Tobias Gauss, Mathieu Boutonnet, Fabrice Cook, Delphine Garrigue, Frédéric Lesache, Josse Julie, Alexandra Rouquette, Jacques Duranteau, Effect of fibrinogen concentrate administration on early mortality in traumatic hemorrhagic shock, Journal of Trauma and Acute Care Surgery, 10.1097/TA.0000000000002624, 88, 5, (661-670), (2020).
- E. A. Vega, C. Conrad, Author response to: Comment on: Comparison of oncological outcomes after open and laparoscopic re‐resection of incidental gallbladder cancer, BJS (British Journal of Surgery), 10.1002/bjs.11548, 107, 6, (770-771), (2020).
- Koji Matsuo, David J. Nusbaum, Hiroko Machida, Yongmei Huang, Varun Khetan, Shinya Matsuzaki, Maximilian Klar, Brendan H. Grubbs, Lynda D. Roman, Jason D. Wright, Populational trends and outcomes of postoperative radiotherapy for high-risk early-stage cervical cancer with lymph node metastasis: concurrent chemo-radiotherapy versus radiotherapy alone, American Journal of Obstetrics and Gynecology, 10.1016/j.ajog.2019.10.010, 222, 5, (484.e1-484.e15), (2020).
- Hideo Yasunaga, Introduction to Applied Statistics—Chapter 1 Propensity Score Analysis, Annals of Clinical Epidemiology, 10.37737/ace.2.2_33, 2, 2, (33-37), (2020).
- Stephen Rocks, Mina Fazel, Apostolos Tsiachristas, Impact of transforming mental health services for young people in England on patient access, resource use and health: a quasi-experimental study, BMJ Open, 10.1136/bmjopen-2019-034067, 10, 1, (e034067), (2020).
- Shun Kohsaka, Jun Katada, Kumiko Saito, Aaron Jenkins, Benjamin Li, Jack Mardekian, Yasuo Terayama, Safety and effectiveness of non-vitamin K oral anticoagulants versus warfarin in real-world patients with non-valvular atrial fibrillation: a retrospective analysis of contemporary Japanese administrative claims data, Open Heart, 10.1136/openhrt-2019-001232, 7, 1, (e001232), (2020).
- Prasad S. Nishtala, Heather Allore, Ling Han, Hamish A. Jamieson, Sarah N. Hilmer, Te-yuan Chyou, Impact of Anticholinergic Burden on Cognitive Performance: A Cohort Study of Community-Dwelling Older Adults, Journal of the American Medical Directors Association, 10.1016/j.jamda.2020.03.027, (2020).
- Tak Kyu Oh, Jung-Hee Ryu, Hye-Min Sohn, Young-Tae Jeon, Intraoperative Hypothermia Is Associated with Reduced Acute Kidney Injury After Spine Surgery Under General Anesthesia, Journal of Neurosurgical Anesthesiology, 10.1097/ANA.0000000000000552, 32, 1, (63-69), (2020).
- Yu Wu, Claire Boulogne, Stefan Carle, Maria Podinovskaia, Holger Barth, Anne Spang, Jean‐Christophe Cintrat, Daniel Gillet, Julien Barbier, Regulation of endo‐lysosomal pathway and autophagic flux by broad‐spectrum antipathogen inhibitor ABMA, The FEBS Journal, 10.1111/febs.15201, 287, 15, (3184-3199), (2020).
- Matteo Daverio, Filippo Patrucco, Francesco Gavelli, Chiara Airoldi, Giusy Sciortino, Claudia Chiaramonte, Ottavio Rena, Piero E. Balbo, Renzo L. Boldorini, Comparative analysis of programmed death ligand 1 expression in paired cytologic and histologic specimens of non–small cell lung cancer, Cancer Cytopathology, 10.1002/cncy.22292, 128, 8, (580-588), (2020).
- Rachel Jug, Wen‐Chi Foo, Claudia Jones, Sara Ahmadi, Xiaoyin “Sara” Jiang, High‐risk and intermediate‐high–risk results from the ThyroSeq v2 and v3 thyroid genomic classifier are associated with neoplasia: Independent performance assessment at an academic institution, Cancer Cytopathology, 10.1002/cncy.22283, 128, 8, (563-569), (2020).
- Yuan Wu, Xiaofei Wang, Jiaxing Lin, Beilin Jia, Kouros Owzar, Predictive accuracy of markers or risk scores for interval censored survival data, Statistics in Medicine, 10.1002/sim.8547, 39, 18, (2437-2446), (2020).
- Claudio Bellevicine, Elena Vigliar, Giancarlo Troncone, Thyroid FNA in the time of coronavirus: The interventional cytopathologist point of view, Cancer Cytopathology, 10.1002/cncy.22294, 128, 8, (589-589), (2020).
- Swati P. Satturwar, Joshua L. Pantanowitz, Christopher D. Manko, Lindsey Seigh, Sara E. Monaco, Liron Pantanowitz, Ki‐67 proliferation index in neuroendocrine tumors: Can augmented reality microscopy with image analysis improve scoring?, Cancer Cytopathology, 10.1002/cncy.22272, 128, 8, (535-544), (2020).
- Deeksha Tripathi, Sashi Kant, Saurabh Pandey, Nasreen Z. Ehtesham, Resistin in metabolism, inflammation, and disease, The FEBS Journal, 10.1111/febs.15322, 287, 15, (3141-3149), (2020).
- Ricardo Pastorello, Ediel Valerio, Alberto Lobo, Antonio Maia, Mauro Saieg, Do thyroid nodules that arise in the isthmus have a higher risk of malignancy?, Cancer Cytopathology, 10.1002/cncy.22260, 128, 8, (520-522), (2020).
- Alessia Munzone, Bilal El Kerdi, Mathieu Fanuel, Hélène Rogniaux, David Ropartz, Marius Réglier, Antoine Royant, A. Jalila Simaan, Christophe Decroos, Characterization of a bacterial copper‐dependent lytic polysaccharide monooxygenase with an unusual second coordination sphere, The FEBS Journal, 10.1111/febs.15203, 287, 15, (3298-3314), (2020).
- Andriy Derkach, Steven C. Moore, Simina M. Boca, Joshua N. Sampson, Group testing in mediation analysis, Statistics in Medicine, 10.1002/sim.8546, 39, 18, (2423-2436), (2020).
- Csaba Szabo, The re‐emerging pathophysiological role of the cystathionine‐β‐synthase ‐ hydrogen sulfide system in Down syndrome, The FEBS Journal, 10.1111/febs.15214, 287, 15, (3150-3160), (2020).
- Qiaoqiao Zhou, Cheng Cheng, Yujuan Wei, Jing Yang, Wanzhu Zhou, Qiuyi Song, Mengxiang Ke, Wanyao Yan, Ling Zheng, Yu Zhang, Kun Huang, USP15 potentiates NF‐κB activation by differentially stabilizing TAB2 and TAB3, The FEBS Journal, 10.1111/febs.15202, 287, 15, (3165-3183), (2020).
- Carolann Risley, Megan A. Clarke, Kim R. Geisinger, Mary W. Stewart, Lei Zhang, Kim W. Hoover, Laree M. Hiser, Kenyata Owens, Maria DeMarco, Mark Schiffman, Nicolas Wentzensen, Racial differences in HPV type 16 prevalence in women with ASCUS of the uterine cervix, Cancer Cytopathology, 10.1002/cncy.22267, 128, 8, (528-534), (2020).
- Xiaochun Li, Changyu Shen, Doubly Robust Estimation of Causal Effect, Circulation: Cardiovascular Quality and Outcomes, 10.1161/CIRCOUTCOMES.119.006065, 13, 1, (2020).
- Elizabeth Alwers, Lina Jansen, Hendrik Bläker, Matthias Kloor, Katrin E. Tagscherer, Wilfried Roth, Daniel Boakye, Esther Herpel, Carsten Grüllich, Jenny Chang‐Claude, Hermann Brenner, Michael Hoffmeister, Microsatellite instability and survival after adjuvant chemotherapy among stage II and III colon cancer patients: results from a population‐based study, Molecular Oncology, 10.1002/1878-0261.12611, 14, 2, (363-372), (2020).
- Hidetaka Kawamura, Toshitaka Morishima, Akira Sato, Michitaka Honda, Isao Miyashiro, Effect of adjuvant chemotherapy on survival benefit in stage III colon cancer patients stratified by age: a Japanese real-world cohort study, BMC Cancer, 10.1186/s12885-019-6508-1, 20, 1, (2020).
- Lin Han, Hedong Han, Liang Wang, Yiming Ruan, Xin Wei, Jia He, Xuhua Lu, Prior bariatric surgery is associated with lower complications, in-hospital mortality and healthcare utilization following elective spine fusion surgery, Surgery for Obesity and Related Diseases, 10.1016/j.soard.2019.12.027, (2020).
- Pajaree Mongkhon, Laura Fanning, Wallis C.Y. Lau, Gary Tse, Kui Kai Lau, Li Wei, Chuenjid Kongkaew, Ian C.K. Wong, Oral Anticoagulant and Reduced Risk of Dementia in Patients with Atrial Fibrillation: A Population-Based Cohort Study, Heart Rhythm, 10.1016/j.hrthm.2020.01.007, (2020).
- Maria Pomponio, Laura Burkbauer, Macy Goldbach, Luke Keele, Kelly C. Allison, Yun R. Li, Susanna M. Nazarian, Julia Tchou, Is there an association between body mass index and 21-gene recurrence score?, Surgical Oncology, 10.1016/j.suronc.2020.01.007, (2020).
- Sandra Chyderiotis, Tarik Benmarhnia, François Beck, Stanislas Spilka, Stéphane Legleye, Does e-cigarette experimentation increase the transition to daily smoking among young ever-smokers in France?, Drug and Alcohol Dependence, 10.1016/j.drugalcdep.2020.107853, (107853), (2020).
- Soonil Kwon, So-Ryoung Lee, Eue-Keun Choi, Won-Seok Choe, Euijae Lee, Jin-Hyung Jung, Kyung-Do Han, Seil Oh, Gregory Y.H. Lip, Non-vitamin K antagonist Oral anticoagulants in very elderly east Asians with atrial fibrillation: A Nationwide population-based study, American Heart Journal, 10.1016/j.ahj.2020.08.006, (2020).
- Ajinkya Pawar, Rishi J Desai, Nileesa Gautam, Seoyoung C Kim, Risk of admission to hospital for serious infection after initiating tofacitinib versus biologic DMARDs in patients with rheumatoid arthritis: a multidatabase cohort study, The Lancet Rheumatology, 10.1016/S2665-9913(19)30137-7, (2020).
- Hideyuki Mouri, Taisuke Jo, Hiroki Matsui, Kiyohide Fushimi, Hideo Yasunaga, Effect of Sugammadex on Postoperative Myasthenic Crisis in Myasthenia Gravis Patients, Anesthesia & Analgesia, 10.1213/ANE.0000000000004239, 130, 2, (367-373), (2020).
- Julie Fiset-Laniel, Mamadou Diop, Sylvie Provost, Erin C. Strumpf, The Impact of Team-Based Primary Care on Guideline-Recommended Disease Screening, American Journal of Preventive Medicine, 10.1016/j.amepre.2019.10.019, (2020).
- Tarik K. Yuce, Ryan J. Ellis, Jeanette Chung, Ryan P. Merkow, Anthony D. Yang, Nathaniel J. Soper, Edward J. Tanner, Edward M. Schaeffer, Karl Y. Bilimoria, Gregory B. Auffenberg, Association between surgical approach and survival following resection of abdominopelvic malignancies, Journal of Surgical Oncology, 10.1002/jso.25841, 121, 4, (620-629), (2020).
- Doo Sun Sim, Dae Young Hyun, Myung Ho Jeong, Hyo Soo Kim, Kiyuk Chang, Dong Ju Choi, Kyoo Rok Han, Tae Hoon Ahn, Jang Hwan Bae, Si Wan Choi, Jong Seon Park, Seung Ho Hur, Jei Keon Chae, Seok Kyu Oh, Kwang Soo Cha, Jin Yong Hwang, Effect of Low-Dose Nebivolol in Patients with Acute Myocardial Infarction: A Multi-Center Observational Study, Chonnam Medical Journal, 10.4068/cmj.2020.56.1.55, 56, 1, (55), (2020).
- Lasse H. Jakobsen, Fredrik Ellin, Knut B. Smeland, Tove Wästerlid, Jacob H. Christensen, Judit M. Jørgensen, Pär L. Josefsson, Andreas K. Øvlisen, Harald Holte, Yngvild N. Blaker, Jacob H. Grauslund, Jon Bjørn, Daniel Molin, Ingemar Lagerlöf, Karin E. Smedby, Katherine Colvin, Gita Thanarajasingam, Matthew J. Maurer, Thomas M. Habermann, Kevin W. Song, Katie Y. Zhu, Alina S. Gerrie, Chan Y. Cheah, Tarec C. El‐Galaly, Minimal relapse risk and early normalization of survival for patients with Burkitt lymphoma treated with intensive immunochemotherapy: an international study of 264 real‐world patients, British Journal of Haematology, 10.1111/bjh.16425, 189, 4, (661-671), (2020).
- Lara A Kimmel, Pamela M Simpson, Anne E Holland, Elton R Edwards, Peter A Cameron, Richard S Steiger, Richard S Page, Raphael Hau, Andrew Bucknill, Jessica Kasza, Belinda J Gabbe, Discharge destination and patient‐reported outcomes after inpatient treatment for isolated lower limb fractures, Medical Journal of Australia, 10.5694/mja2.50485, 212, 6, (263-270), (2020).
- Zhongkai Hao, Qi Chen, Wenrui Dai, Yinjuan Ren, Yin Zhou, Jinlin Yang, Sijie Xie, Yanbin Shen, Jihong Wu, Wei Chen, Guo Qin Xu, Oxygen‐Deficient Blue TiO2 for Ultrastable and Fast Lithium Storage, Advanced Energy Materials, 10.1002/aenm.201903107, 10, 10, (2020).
- María Bárbara Postillone, Gustavo Martínez, Gustavo Flensborg, Cristina B. Dejean, First analysis of mitochondrial lineages from the eastern Pampa–Patagonia transition during the final late Holocene, American Journal of Physical Anthropology, 10.1002/ajpa.24016, 171, 4, (659-670), (2020).
- Cynthia A. Jackevicius, Harlan M. Krumholz, Joseph S. Ross, Maria Koh, Alice Chong, Peter C. Austin, Therese A. Stukel, Paymon Azizi, Dennis T. Ko, Clinical Outcomes with Beta-Blocker Use in Patients with Prior History of Myocardial Infarction, Canadian Journal of Cardiology, 10.1016/j.cjca.2020.01.024, (2020).
- Jaewhan Kim, Steven Simper, Rodrick McKinlay, Daniel Cottam, Amit Surve, Ted Adams, Healthcare Cost and Utilization of Bariatric Surgical Patients with and without Pre-operative Mental Health Diagnoses, Surgery for Obesity and Related Diseases, 10.1016/j.soard.2020.01.035, (2020).
- Rachel S. Mandelbaum, Marcia A. Ciccone, David J. Nusbaum, Mahdi Khoshchehreh, Heena Purswani, Elise B. Morocco, Meghan B. Smith, Shinya Matsuzaki, Christina E. Dancz, Begum Ozel, Lynda D. Roman, Richard J. Paulson, Koji Matsuo, Progestin therapy for obese women with complex atypical hyperplasia: levonorgestrel-releasing intrauterine device vs systemic therapy, American Journal of Obstetrics and Gynecology, 10.1016/j.ajog.2019.12.273, (2020).
- Shannon Doocy, Martin Busingye, Emily Lyles, Elizabeth Colantouni, Bridget Aidam, George Ebulu, Kevin Savage, Cash and voucher assistance and children's nutrition status in Somalia, Maternal & Child Nutrition, 10.1111/mcn.12966, 16, 3, (2020).
- Courtenay M. Holscher, Tanveen Ishaque, Christine E. Haugen, Kyle R. Jackson, Jacqueline M. Garonzik Wang, Yifan Yu, Fawaz Al Ammary, Dorry L. Segev, Allan B. Massie, Association Between Living Kidney Donor Postdonation Hypertension and Recipient Graft Failure, Transplantation, 10.1097/TP.0000000000002832, 104, 3, (583-590), (2020).
- Marco Zimarino, Emanuele Barbato, Sunao Nakamura, Francesco Radico, Marta Di Nicola, Carlo Briguori, Robert J. Gil, Vojko Kanic, Matteo Perfetti, Mariano Pellicano, Kristina Mairic, Goran Stankovic, The impact of the extent of side branch disease on outcomes following bifurcation stenting, Catheterization and Cardiovascular Interventions, 10.1002/ccd.28842, 96, 1, (E84-E92), (2020).
- RS Mandelbaum, MB Smith, CJ Violette, S Matsuzaki, K Matsushima, M Klar, LD Roman, RJ Paulson, K Matsuo, Conservative surgery for ovarian torsion in young women: perioperative complications and national trends, BJOG: An International Journal of Obstetrics & Gynaecology, 10.1111/1471-0528.16179, 127, 8, (957-965), (2020).
- Amy L. Petts, It’s All in the Definition: Color‐Blind Interpretations of School Diversity, Sociological Forum, 10.1111/socf.12590, 35, 2, (465-487), (2020).
- Thai‐Son Tang, Peter C. Austin, Keith A. Lawson, Antonio Finelli, Olli Saarela, Constructing inverse probability weights for institutional comparisons in healthcare, Statistics in Medicine, 10.1002/sim.8657, 39, 23, (3156-3172), (2020).
- Zefu Liu, Jialing Huang, Xiangdong Li, Chaowen Huang, Yunlin Ye, Jinxin Zhang, Zhouwei Liu, The effectiveness of chemotherapy for patients with pT3N0M0 renal pelvic urothelial carcinomas: An inverse probability of treatment weighting comparison using Surveillance, Epidemiology, and End Results data, Cancer Medicine, 10.1002/cam4.3238, 9, 16, (5756-5766), (2020).
- Jian Xi Zhang, Peipei Li, Zhibin Chen, Huogui Lin, Zhezhen Cai, Weijia Liao, Zirong Pan, Impact of liver fibrosis score on prognosis after common therapies for intrahepatic cholangiocarcinoma: a propensity score matching analysis, BMC Cancer, 10.1186/s12885-020-07051-5, 20, 1, (2020).
- Association between Aspirin and Hepatocellular Carcinoma, New England Journal of Medicine, 10.1056/NEJMc2009497, 382, 25, (2480-2482), (2020).
- Lovekesh Mehta, Rakhi Dhankhar, Pooja Gulati, Rajeev Kumar Kapoor, Aparajita Mohanty, Sanjay Kumar, Natural and grafted cyclotides in cancer therapy: An insight, Journal of Peptide Science, 10.1002/psc.3246, 26, 4-5, (2020).
- Jingrui Liu, Sien Zhan, Yan Jia, Youran Li, Ying Liu, Ying Dong, Guodong Tang, Lin Li, Yanhong Zhai, Zheng Cao, Retinol and α‐tocopherol in pregnancy: Establishment of reference intervals and associations with CBC, Maternal & Child Nutrition, 10.1111/mcn.12975, 16, 3, (2020).
- Jian Ma, Yijuan Qiao, Pei Zhao, Wei Li, Peter T. Katzmarzyk, Jean‐Philippe Chaput, Mikael Fogelholm, Rebecca Kuriyan, Estelle V. Lambert, Carol Maher, Jose Maia, Victor Matsudo, Timothy Olds, Vincent Onywera, Olga L. Sarmiento, Martyn Standage, Mark S. Tremblay, Catrine Tudor‐Locke, Gang Hu, Breastfeeding and childhood obesity: A 12‐country study, Maternal & Child Nutrition, 10.1111/mcn.12984, 16, 3, (2020).
- Hui‐Jie Lee, John B. Wong, Beilin Jia, Xinyue Qi, Elizabeth R. DeLong, Empirical use of causal inference methods to evaluate survival differences in a real‐world registry vs those found in randomized clinical trials, Statistics in Medicine, 10.1002/sim.8581, 39, 22, (3003-3021), (2020).
- Megan S. Schuler, Sara E. Heins, Rosanna Smart, Beth Ann Griffin, David Powell, Elizabeth A. Stuart, Bryce Pardo, Sierra Smucker, Stephen W. Patrick, Rosalie Liccardo Pacula, Bradley D. Stein, The state of the science in opioid policy research, Drug and Alcohol Dependence, 10.1016/j.drugalcdep.2020.108137, 214, (108137), (2020).
- Tracey G. Simon, Ann-Sofi Duberg, Soo Aleman, Raymond T. Chung, Andrew T. Chan, Jonas F. Ludvigsson, Association of Aspirin with Hepatocellular Carcinoma and Liver-Related Mortality, New England Journal of Medicine, 10.1056/NEJMoa1912035, 382, 11, (1018-1028), (2020).
- Yan Li, Chris Delcher, Yu‐Jung Jenny Wei, Gary M. Reisfield, Joshua D. Brown, Patrick Tighe, Almut G. Winterstein, Risk of Opioid Overdose Associated With Concomitant Use of Opioids and Skeletal Muscle Relaxants: A Population‐Based Cohort Study, Clinical Pharmacology & Therapeutics, 10.1002/cpt.1807, 108, 1, (81-89), (2020).
- So-Ryoung Lee, Eue-Keun Choi, Soonil Kwon, Jin-Hyung Jung, Kyung-Do Han, Myung-Jin Cha, Seil Oh, Gregory Y.H. Lip, Effectiveness and Safety of Direct Oral Anticoagulants in Relation to Temporal Changes in Their Use, Circulation: Cardiovascular Quality and Outcomes, 10.1161/CIRCOUTCOMES.119.005894, 13, 3, (2020).
- Naoki Fujita, Shingo Hatakeyama, Masaki Momota, Yuki Tobisawa, Tohru Yoneyama, Hayato Yamamoto, Atsushi Imai, Hiroyuki Ito, Takahiro Yoneyama, Yasuhiro Hashimoto, Kazuaki Yoshikawa, Yasushi Mariya, Chikara Ohyama, Safety and feasibility of radiation therapy to the primary tumor in patients with metastatic castration-resistant prostate cancer, Clinical Genitourinary Cancer, 10.1016/j.clgc.2020.03.009, (2020).
- Tetsuji Ota, Pichdara Lonn, Nobuya Mizoue, A country scale analysis revealed effective forest policy affecting forest cover changes in Cambodia, Land Use Policy, 10.1016/j.landusepol.2020.104597, 95, (104597), (2020).
- Mohammad Movahedi, Elliot Hepworth, Reza Mirza, Angela Cesta, Maggie Larche, Claire Bombardier, Discontinuation of Biologic Therapy due to Lack/loss of Response and Adverse Events is Similar Between TNFi and non-TNFi Class: Results from a Real-World Rheumatoid Arthritis Cohort, Seminars in Arthritis and Rheumatism, 10.1016/j.semarthrit.2020.06.020, (2020).
- Brennan H. Baker, Haotian Wu, Hannah E. Laue, Amélie Boivin, Virginie Gillet, Marie-France Langlois, Jean-Philippe Bellenger, Andrea A. Baccarelli, Larissa Takser, Methylparaben in meconium and risk of maternal thyroid dysfunction, adverse birth outcomes, and Attention-Deficit Hyperactivity Disorder (ADHD), Environment International, 10.1016/j.envint.2020.105716, 139, (105716), (2020).
- Géric Maura, Marc Bardou, Cécile Billionnet, Alain Weill, Jérôme Drouin, Anke Neumann, Oral anticoagulants and risk of acute liver injury in patients with nonvalvular atrial fibrillation: a propensity-weighted nationwide cohort study, Scientific Reports, 10.1038/s41598-020-68304-8, 10, 1, (2020).
- Claude Samy Elayi, Azadeh Shohoudi, Erica Moodie, Farshid Etaee, Maya Guglin, Denis Roy, Paul Khairy, Digoxin, mortality, and cardiac hospitalizations in patients with atrial fibrillation and heart failure with reduced ejection fraction and atrial fibrillation: An AF-CHF analysis, International Journal of Cardiology, 10.1016/j.ijcard.2020.04.047, (2020).
- Samuel Z. Goldhaber, Walter Ageno, Ivan B. Casella, Kok Han Chee, Sebastian Schellong, Daniel E. Singer, Marc Desch, Paul A. Reilly, Esther Donado, Wenbo Tang, Isabelle Voccia, Sam Schulman, Profile of Patients Diagnosed with Acute Venous Thromboembolism in Routine Clinical Practice: The RE-COVERY DVT/PE™ Study, The American Journal of Medicine, 10.1016/j.amjmed.2020.03.036, (2020).
- Amy K. Rosen, Megan E. Vanneman, William J. O'Brien, Suzann Pershing, Todd H. Wagner, Erin Beilstein‐Wedel, Jeanie Lo, Qi Chen, Glenn C. Cockerham, Michael Shwartz, Comparing cataract surgery complication rates in veterans receiving VA and community care, Health Services Research, 10.1111/1475-6773.13320, 55, 5, (690-700), (2020).
- Joanne Salas, John E. Morley, Jeffrey F. Scherrer, James S. Floyd, Susan A. Farr, Max Zubatsky, Doug Barthold, Sascha Dublin, Risk of incident dementia following metformin initiation compared with noninitiation or delay of antidiabetic medication therapy, Pharmacoepidemiology and Drug Safety, 10.1002/pds.5014, 29, 6, (623-634), (2020).
- Jonathan D. Baghdadi, Robert H. Brook, Daniel Z. Uslan, Jack Needleman, Douglas S. Bell, William E. Cunningham, Mitchell D. Wong, Association of a Care Bundle for Early Sepsis Management With Mortality Among Patients With Hospital-Onset or Community-Onset Sepsis, JAMA Internal Medicine, 10.1001/jamainternmed.2020.0183, 180, 5, (707), (2020).
- Byeong Yeob Choi, Chen‐Pin Wang, Jonathan Gelfond, Machine learning outcome regression improves doubly robust estimation of average causal effects, Pharmacoepidemiology and Drug Safety, 10.1002/pds.5074, 29, 9, (1120-1133), (2020).
- Jose W. Valdez, Arthropods as vertebrate predators: A review of global patterns, Global Ecology and Biogeography, 10.1111/geb.13157, 29, 10, (1691-1703), (2020).
- Ádám Nagy, Enikő Holndonner-Kirst, Csaba Eke, Balázs Szécsi, András Szabó, Marie-Jo Plamondon, Levente Fazekas, Miklós Polos, Kálmán Benke, Zoltán Szabolcs, István Hartyánszky, Béla Merkely, János Gál, Andrea Székely, Perioperative Low Tetraiodothyronine Levels and Adverse Outcomes After Heart Transplantation: A Retrospective, Observational Study, Journal of Cardiothoracic and Vascular Anesthesia, 10.1053/j.jvca.2020.03.052, (2020).
- Jeffrey F. Scherrer, Joanne Salas, F. David Schneider, Matthew J. Friedman, Carissa van den Berk-Clark, Kathleen M. Chard, Sonya B. Norman, Patrick J. Lustman, Peter Tuerk, Paula P. Schnurr, Beth E. Cohen, PTSD improvement and incident cardiovascular disease in more than 1000 veterans, Journal of Psychosomatic Research, 10.1016/j.jpsychores.2020.110128, 134, (110128), (2020).
- Layla Parast, Beth Ann Griffin, Quantifying the bias due to observed individual confounders in causal treatment effect estimates, Statistics in Medicine, 10.1002/sim.8549, 39, 18, (2447-2476), (2020).
- Wael Sabbah, Gary D. Slade, Anne E. Sanders, Eduardo Bernabé, Denture wearing and mortality risk in edentulous American adults: A propensity score analysis, Journal of Dentistry, 10.1016/j.jdent.2020.103360, (103360), (2020).
- Michael J. Green, Linsay Gray, Helen Sweeting, Youth vaping and smoking and parental vaping: a panel survey, BMC Public Health, 10.1186/s12889-020-09228-w, 20, 1, (2020).
- Victoria Divino, Girishanthy Krishnarajah, Stephen I. Pelton, Joaquin Mould-Quevedo, Vamshi Ruthwik Anupindi, Mitch DeKoven, Maarten J. Postma, A real-world study evaluating the relative vaccine effectiveness of a cell-based quadrivalent influenza vaccine compared to egg-based quadrivalent influenza vaccine in the US during the 2017–18 influenza season, Vaccine, 10.1016/j.vaccine.2020.07.023, (2020).
- Nikhil P. Mankuzhy, Muayad F. Almahariq, Zaid A. Siddiqui, Andrew B. Thompson, Inga S. Grills, Thomas M. Guerrero, Kuei C. Lee, Craig W. Stevens, Thomas J. Quinn, The Role of Post-operative Radiation Therapy for pN2 Non-Small Cell Lung Cancer, Clinical Lung Cancer, 10.1016/j.cllc.2020.07.008, (2020).
- Kenichiro Ishida, Yusuke Katayama, Tetsuhisa Kitamura, Tomoya Hirose, Shunichiro Nakao, Yutaka Umemura, Takeyuki Kiguchi, Tasuku Matsuyama, Kosuke Kiyohara, Takeshi Shimazu, Mitsuo Ohnishi, Abdominal angiography is associated with reduced in-hospital mortality among pediatric patients with blunt splenic and hepatic injury: a propensity-score-matching study from the national trauma registry in Japan, Journal of Pediatric Surgery, 10.1016/j.jpedsurg.2020.07.029, (2020).
- Alexandru Costea, Laura Goldstein, Sonia Maccioni, Iftekhar Kalsekar, Rahul Khanna, Real-world outcomes comparison among adults with atrial fibrillation undergoing catheter ablation with a contact force porous tip catheter versus a second-generation cryoballoon catheter: a retrospective analysis of multihospital US database, BMJ Open, 10.1136/bmjopen-2019-035499, 10, 8, (e035499), (2020).
- Wanwan Qi, Xiaoyi Wang, Lu Gan, Yunhai Li, Hongyuan Li, Qiao Cheng, The effect of reduced RDI of chemotherapy on the outcome of breast cancer patients, Scientific Reports, 10.1038/s41598-020-70187-8, 10, 1, (2020).
- Ozan M. Demir, Enrico Poletti, Francesca Lombardo, Alessandra Laricchia, Alessandro Beneduce, Davide Maccagni, Alberto Cappelletti, Antonio Colombo, Barbara Bellini, Marco B. Ancona, Mauro Carlino, Alaide Chieffo, Matteo Montorfano, Lorenzo Azzalini, Rotational atherectomy-based percutaneous coronary intervention and the risk of contrast-induced nephropathy, Minerva Cardioangiologica, 10.23736/S0026-4725.20.05099-9, 68, 2, (2020).
- Alessio Signori, Francesco Saccà, Roberta Lanzillo, Giorgia Teresa Maniscalco, Elisabetta Signoriello, Anna Maria Repice, Pietro Annovazzi, Damiano Baroncini, Marinella Clerico, Eleonora Binello, Raffaella Cerqua, Giorgia Mataluni, Paola Perini, Simona Bonavita, Luigi Lavorgna, Ignazio Roberto Zarbo, Alice Laroni, Lorena Pareja-Gutierrez, Sara La Gioia, Barbara Frigeni, Valeria Barcella, Jessica Frau, Eleonora Cocco, Giuseppe Fenu, Valentina Torri Clerici, Arianna Sartori, Sarah Rasia, Cinzia Cordioli, Maria Laura Stromillo, Alessia Di Sapio, Simona Pontecorvo, Roberta Grasso, Stefania Barone, Caterina Barrilà, Cinzia Valeria Russo, Sabrina Esposito, Domenico Ippolito, Doriana Landi, Andrea Visconti, Maria Pia Sormani, Cladribine vs other drugs in MS, Neurology - Neuroimmunology Neuroinflammation, 10.1212/NXI.0000000000000878, 7, 6, (e878), (2020).
- Joy D. Scheidell, Lorna E. Thorpe, Adaora A. Adimora, Ellen C. Caniglia, Carl W. Lejuez, Andrea B. Troxel, Maria R. Khan, Perceived Stress, Sexually Transmitted Infection, and Pelvic Inflammatory Disease: Examination of Differences in Associations Among Black and White Women, Sexually Transmitted Diseases, 10.1097/OLQ.0000000000001232, 47, 9, (617-624), (2020).
- Amgad Mentias, Mary Vaughan Sarrazin, Marwan Saad, Sidakpal Panaich, Samir Kapadia, Phillip A. Horwitz, Saket Girotra, Long-Term Outcomes of Coronary Stenting With and Without Use of Intravascular Ultrasound, JACC: Cardiovascular Interventions, 10.1016/j.jcin.2020.04.052, 13, 16, (1880-1890), (2020).
- Zoe M. Weinstein, Debbie M. Cheng, Maria J. D’Amico, Leah S. Forman, Danny Regan, Alexandra Yurkovic, Jeffrey H. Samet, Alexander Y. Walley, Inpatient addiction consultation and post-discharge 30-day acute care utilization, Drug and Alcohol Dependence, 10.1016/j.drugalcdep.2020.108081, (108081), (2020).
- See more




