BACKGROUND: When exposure is infrequent, propensity-score matching results in reduced precision because it discards a large proportion of unexposed patients. To our knowledge, the relative performance of propensity-score stratification in these circumstances has not been examined.
METHODS: Using an empirical example of the association of first trimester statin exposure (prevalence = 0.04%) with risk of congenital malformations and 1,000 simulated cohorts (n = 20,000) with eight combinations of exposure prevalence (0.5%, 1%, 5%, 10%) and outcome risk (3.5%, 10%), we compared four propensity-score-based approaches to confounding adjustment: (1) matching (1:1, 1:5, full), (2) stratification in 10, 50, and 100 strata by entire cohort propensity-score distribution, (3) stratification in 10, 50, and 100 strata by exposed group propensity-score distribution, (4) standardized mortality ratio (SMR) weighting. Weighted generalized linear models were used to derive effect estimates after weighting unexposed according to the distribution of the exposed in their stratum for the stratification approaches.
RESULTS: In the empirical example, propensity-score stratification (cohort) approaches resulted in greater imbalances in covariate distributions between statin-exposed and unexposed compared with propensity-score stratification (exposed) and matching. In simulations, propensity-score stratification (exposed) resulted in smaller relative bias than the cohort approach with 10 and 50 strata, and greater precision than matching and SMR weighting at 0.5% and 1% exposure prevalence, but similar performance at 5% and 10%.
CONCLUSION: For exposures with prevalence under 5%, propensity-score stratification with fine strata, based on the exposed group propensity-score distribution, produced the best results. For more common exposures, all approaches were equivalent.