Q: "What if the batter had swung?"
"What-if" questions require us to think causally and imagine unobserved potential outcomes
Inability to observe all potential outcomes
However, using the gold standard RCT may not be possible in real world settings
How to estimate causal effects without randomization?
Utilize propensity score method (Rosenbaum and Rubin, 1983)
Propensity score = Probability of receiving exposure given set of confounders
Pr
Estimation
Bias Removal
High obesity rates of low-income 2-5 year olds in Los Angeles County (PHFE WIC, 2010)
Estimate causal effects of non-randomly assigned obesity interventions on WIC preschool-aged child obesity rates
Intervention Period: 2010-2016
Exposure:
Outcome:
# | Name | Group | n (%) |
---|---|---|---|
1 | Government Policies | Macro | 1 (3%) |
2 | Institutional Polices | Macro | 4 (12%) |
3 | Infrastructure Investments | Macro | 3 (9%) |
4 | Business Practices | Macro | 4 (12%) |
5 | Group Education | Micro | 21 (66%) |
6 | Counseling | Micro | 14 (44%) |
7 | Health Communication | Micro | 17 (53%) |
8 | Home Visitation | Micro | 8 (25%) |
9 | Screening and Referral | Micro | 14 (44%) |
SS\in[1,9] = strength score:
Estimated for each strategy and then summed by group, macro and micro
Adapted IDI construction example
Micro
Macro
C_{ijk}: k^{th} confounder of the j^{th} exposure for the i^{th} individual
Observed data
\begin{align} Y_{i}(\mathbf{d})\perp \!\!\! \perp\mathbf{D}_{i}\mid \mathcal{C} \quad \forall \quad \mathbf{d}\in\mathcal{D}. \end{align}
Replace \mathcal{C} with f_{\mathbf{D}\mid\mathcal{C}}
Difficult to rationalize
"Weak" refers to conditional independence for each level of exposure not joint distribution of exposures
\begin{align} 0<f_{\mathbf{D}\mid\mathcal{C}}(\mathbf{D}=\mathbf{d}\mid \mathcal{C})<1 \quad \forall \quad \mathbf{d}\in\mathcal{D} \end{align}
Grid or convex hull
Trimmed or untrimmed
The potential outcome of each unit does not depend on the exposure that other units receive and that there exists only one version of each exposure
Assumes there is no interference between units
No errors in defining the potential outcomes caused by multiple versions of the exposure
Y_{i} = Y_{i}(\mathbf{d})
\begin{align} f(\mathbf{D}|\mathbf{C}_{1},\dots,\mathbf{C}_{m}) \end{align}
\begin{align} w=\frac{f(\mathbf{D})}{f(\mathbf{D}|\mathbf{C}_{1},\dots,\mathbf{C}_{m})}, \end{align}
Returns consistent estimate of average dose-response function
Covariance between each exposure, D_{j} and confounder, C_{jk}, is zero
Weights are normalized and maintain marginal moments
Sample size : n=200
Repetitions: B=1000
Compare mvGPS to other univariate alternative methods
Simulate bivariate normal exposure
Effect of trimming weights (Kang and Schafer, 2007)
M1: No common confounding
M2: Partially common confounding
M3: Common confounding
American Community Survey
WIC Admin
NETS (National Establishment Time-Series)
Max Abs. Corr. | Avg. Abs. Corr. | ESS | Method |
---|---|---|---|
0.12 | 0.04 | 637 | mvGPS |
0.18 | 0.08 | 580 | CBGPS (Macro) |
0.22 | 0.07 | 659 | PS (Macro) |
0.25 | 0.08 | 541 | Entropy (Macro) |
0.35 | 0.12 | 779 | Entropy (Micro) |
0.37 | 0.14 | 828 | PS (Micro) |
0.49 | 0.16 | 679 | CBGPS (Micro) |
0.41 | 0.19 | 1079 | Unweighted |
Balance
Significant imbalance prior to weighting
ESS for all methods are reduced but power is still reasonably high
Dose-Response
Unweighted estimate was monotonic plane where effectiveness increased as you increased either exposure dose
mvGPS estimate was a saddle with optimal effectiveness using high micro and low macro
Doyle, W. R. (2011). "Effect of increased academic momentum on transfer rates: an application of the generalized propensity score". In: Economics of Education Review 30.1, pp. 191-200.
Flores, C. A., A. Flores-Lagunes, A. Gonzalez, and T. C. Neumann (2012). "Estimating the effects of length of exposure to instruction in a training program: The case of job corps". In: Review of Economics and Statistics 94.1, pp. 153-171.
Fong, C., C. Hazlett, and K. Imai (2018). "Covariate balancing propensity score for a continuous treatment: application to the efficacy of political advertisements". In: Annals of Applied Statistics 12.1, pp. 156-177.
Hirano, K. and G. W. Imbens (2004). "The propensity score with continuous treatments". In: Applied Bayesian Modeling and Causal Inference from Incomplete-Data Perspectives. Ed. by A. Gelman and X. Meng. Hoboken, N.J.: John Wiley & Sons, pp. 73-84.
Holland, P. W. (1986). "Statistics and Causal Inference". In: Journal of the American Statistical Association 81.396, pp. 945-960.
Imai, K. and D. A. Van Dyk (2004). "Causal inference with general treatment regimes: Generalizing the propensity score". In: Journal of the American Statistical Association 99.467, pp. 854-866.
Imbens, G. W. (2000). "The role of the propensity score in estimating dose-response functions". In: Biometrika 87.3, pp. 706-710.
Jiang, M. and E. M. Foster (2013). "Duration of breastfeeding and childhood obesity: a generalized propensity score approach". In: Health Services Research 48.2.1, pp. 628-651.
Kang, J. D. and J. L. Schafer (2007). "Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data". In: Statistical Science 22.4, pp. 523-539.
Kluve, J., H. Schneider, A. Uhlendorff, and Z. Zhao (2012). "Evaluating continuous training programmes by using the generalized propensity score". In: Journal of the Royal Statistical Society: Series A 175.2, pp. 587-617.
Kreif, N., R. Grieve, I. Díaz, and D. Harrison (2015). "Evaluation of the effect of a continuous treatment: A machine learning approach with an application to treatment for traumatic brain injury". In: Health Economics 24.9, pp. 1213-1228.
Pearl, J. and D. Mackenzie (2018). The Book of Why: The New Science of Cause and Effect. New York: Basic Books.
PHFE WIC (2010). WIC data 2003-2009: A report on low income families with young children in Los Angeles County. Technical Report. Special Supplemental Nutrition Assistance Program for Women, Infants and Children.
Rosenbaum, P. R. and D. B. Rubin (1983). "The central role of the propensity score in observational studies for causal effects". In: Biometrika 70.1, pp. 41-55.
Sanders, J. M., M. L. Monogue, T. Z. Jodlowski, and J. B. Cutrell (2020). "Pharmacologic Treatments for Coronavirus Disease 2019 (COVID-19): A Review". In: JAMA 323.18, pp. 1824-1836.
Tübbicke, S. (2020). "Entropy Balancing for Continuous Treatments". In: arXiv preprint arXiv:2001.06281. arXiv: 2001.06281 [econ.EM].
Vegetabile, B. G., B. A. Griffin, D. L. Coffman, M. Cefalu, et al. (2020). "Nonparametric Estimation of Population Average Dose-Response Curves using Entropy Balancing Weights for Continuous Exposures". In: arXiv preprint arXiv:2003.02938. arXiv: 2003.02938 [stat.ME].
Wang, M. C., C. M. Crespi, L. H. Jiang, T. Nobari, et al. (2018). "Developing an index of dose of exposure to early childhood obesity community interventions". In: Preventive Medicine 111, pp. 135-141.
Zhu, Y., D. L. Coffman, and D. Ghosh (2015). "A boosting algorithm for estimating generalized propensity scores with continuous treatments". In: Journal of Causal Inference 3.1, pp. 25-40.
Radius truncation
Micro exposure inverse distance weighting
Overlapping catchment area adjustment
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
o | Tile View: Overview of Slides |
Esc | Back to slideshow |
Q: "What if the batter had swung?"
"What-if" questions require us to think causally and imagine unobserved potential outcomes
Inability to observe all potential outcomes
However, using the gold standard RCT may not be possible in real world settings
How to estimate causal effects without randomization?
Utilize propensity score method (Rosenbaum and Rubin, 1983)
Propensity score = Probability of receiving exposure given set of confounders
\begin{align} \Pr(D=1\mid C) \end{align}
Estimation
Bias Removal
High obesity rates of low-income 2-5 year olds in Los Angeles County (PHFE WIC, 2010)
Estimate causal effects of non-randomly assigned obesity interventions on WIC preschool-aged child obesity rates
Intervention Period: 2010-2016
Exposure:
Outcome:
# | Name | Group | n (%) |
---|---|---|---|
1 | Government Policies | Macro | 1 (3%) |
2 | Institutional Polices | Macro | 4 (12%) |
3 | Infrastructure Investments | Macro | 3 (9%) |
4 | Business Practices | Macro | 4 (12%) |
5 | Group Education | Micro | 21 (66%) |
6 | Counseling | Micro | 14 (44%) |
7 | Health Communication | Micro | 17 (53%) |
8 | Home Visitation | Micro | 8 (25%) |
9 | Screening and Referral | Micro | 14 (44%) |
SS\in[1,9] = strength score:
Estimated for each strategy and then summed by group, macro and micro
Adapted IDI construction example
Micro
Macro
C_{ijk}: k^{th} confounder of the j^{th} exposure for the i^{th} individual
Observed data
\begin{align} Y_{i}(\mathbf{d})\perp \!\!\! \perp\mathbf{D}_{i}\mid \mathcal{C} \quad \forall \quad \mathbf{d}\in\mathcal{D}. \end{align}
Replace \mathcal{C} with f_{\mathbf{D}\mid\mathcal{C}}
Difficult to rationalize
"Weak" refers to conditional independence for each level of exposure not joint distribution of exposures
\begin{align} 0<f_{\mathbf{D}\mid\mathcal{C}}(\mathbf{D}=\mathbf{d}\mid \mathcal{C})<1 \quad \forall \quad \mathbf{d}\in\mathcal{D} \end{align}
Grid or convex hull
Trimmed or untrimmed
The potential outcome of each unit does not depend on the exposure that other units receive and that there exists only one version of each exposure
Assumes there is no interference between units
No errors in defining the potential outcomes caused by multiple versions of the exposure
Y_{i} = Y_{i}(\mathbf{d})
\begin{align} f(\mathbf{D}|\mathbf{C}_{1},\dots,\mathbf{C}_{m}) \end{align}
\begin{align} w=\frac{f(\mathbf{D})}{f(\mathbf{D}|\mathbf{C}_{1},\dots,\mathbf{C}_{m})}, \end{align}
Returns consistent estimate of average dose-response function
Covariance between each exposure, D_{j} and confounder, C_{jk}, is zero
Weights are normalized and maintain marginal moments
Sample size : n=200
Repetitions: B=1000
Compare mvGPS to other univariate alternative methods
Simulate bivariate normal exposure
Effect of trimming weights (Kang and Schafer, 2007)
M1: No common confounding
M2: Partially common confounding
M3: Common confounding
American Community Survey
WIC Admin
NETS (National Establishment Time-Series)
Max Abs. Corr. | Avg. Abs. Corr. | ESS | Method |
---|---|---|---|
0.12 | 0.04 | 637 | mvGPS |
0.18 | 0.08 | 580 | CBGPS (Macro) |
0.22 | 0.07 | 659 | PS (Macro) |
0.25 | 0.08 | 541 | Entropy (Macro) |
0.35 | 0.12 | 779 | Entropy (Micro) |
0.37 | 0.14 | 828 | PS (Micro) |
0.49 | 0.16 | 679 | CBGPS (Micro) |
0.41 | 0.19 | 1079 | Unweighted |
Balance
Significant imbalance prior to weighting
ESS for all methods are reduced but power is still reasonably high
Dose-Response
Unweighted estimate was monotonic plane where effectiveness increased as you increased either exposure dose
mvGPS estimate was a saddle with optimal effectiveness using high micro and low macro
Doyle, W. R. (2011). "Effect of increased academic momentum on transfer rates: an application of the generalized propensity score". In: Economics of Education Review 30.1, pp. 191-200.
Flores, C. A., A. Flores-Lagunes, A. Gonzalez, and T. C. Neumann (2012). "Estimating the effects of length of exposure to instruction in a training program: The case of job corps". In: Review of Economics and Statistics 94.1, pp. 153-171.
Fong, C., C. Hazlett, and K. Imai (2018). "Covariate balancing propensity score for a continuous treatment: application to the efficacy of political advertisements". In: Annals of Applied Statistics 12.1, pp. 156-177.
Hirano, K. and G. W. Imbens (2004). "The propensity score with continuous treatments". In: Applied Bayesian Modeling and Causal Inference from Incomplete-Data Perspectives. Ed. by A. Gelman and X. Meng. Hoboken, N.J.: John Wiley & Sons, pp. 73-84.
Holland, P. W. (1986). "Statistics and Causal Inference". In: Journal of the American Statistical Association 81.396, pp. 945-960.
Imai, K. and D. A. Van Dyk (2004). "Causal inference with general treatment regimes: Generalizing the propensity score". In: Journal of the American Statistical Association 99.467, pp. 854-866.
Imbens, G. W. (2000). "The role of the propensity score in estimating dose-response functions". In: Biometrika 87.3, pp. 706-710.
Jiang, M. and E. M. Foster (2013). "Duration of breastfeeding and childhood obesity: a generalized propensity score approach". In: Health Services Research 48.2.1, pp. 628-651.
Kang, J. D. and J. L. Schafer (2007). "Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data". In: Statistical Science 22.4, pp. 523-539.
Kluve, J., H. Schneider, A. Uhlendorff, and Z. Zhao (2012). "Evaluating continuous training programmes by using the generalized propensity score". In: Journal of the Royal Statistical Society: Series A 175.2, pp. 587-617.
Kreif, N., R. Grieve, I. Díaz, and D. Harrison (2015). "Evaluation of the effect of a continuous treatment: A machine learning approach with an application to treatment for traumatic brain injury". In: Health Economics 24.9, pp. 1213-1228.
Pearl, J. and D. Mackenzie (2018). The Book of Why: The New Science of Cause and Effect. New York: Basic Books.
PHFE WIC (2010). WIC data 2003-2009: A report on low income families with young children in Los Angeles County. Technical Report. Special Supplemental Nutrition Assistance Program for Women, Infants and Children.
Rosenbaum, P. R. and D. B. Rubin (1983). "The central role of the propensity score in observational studies for causal effects". In: Biometrika 70.1, pp. 41-55.
Sanders, J. M., M. L. Monogue, T. Z. Jodlowski, and J. B. Cutrell (2020). "Pharmacologic Treatments for Coronavirus Disease 2019 (COVID-19): A Review". In: JAMA 323.18, pp. 1824-1836.
Tübbicke, S. (2020). "Entropy Balancing for Continuous Treatments". In: arXiv preprint arXiv:2001.06281. arXiv: 2001.06281 [econ.EM].
Vegetabile, B. G., B. A. Griffin, D. L. Coffman, M. Cefalu, et al. (2020). "Nonparametric Estimation of Population Average Dose-Response Curves using Entropy Balancing Weights for Continuous Exposures". In: arXiv preprint arXiv:2003.02938. arXiv: 2003.02938 [stat.ME].
Wang, M. C., C. M. Crespi, L. H. Jiang, T. Nobari, et al. (2018). "Developing an index of dose of exposure to early childhood obesity community interventions". In: Preventive Medicine 111, pp. 135-141.
Zhu, Y., D. L. Coffman, and D. Ghosh (2015). "A boosting algorithm for estimating generalized propensity scores with continuous treatments". In: Journal of Causal Inference 3.1, pp. 25-40.
Radius truncation
Micro exposure inverse distance weighting
Overlapping catchment area adjustment