Causal inference and counterfactual prediction in machine learning for actionable healthcare



Prosperi, Mattia, Guo, Yi, Sperrin, Matt, Koopman, James S, Min, Jae S, He, Xing, Rich, Shannan, Wang, Mo, Buchan, Iain E ORCID: 0000-0003-3392-1650 and Bian, Jiang
(2020) Causal inference and counterfactual prediction in machine learning for actionable healthcare. Nature Machine Intelligence, 2 (7). pp. 369-375.

[img] Text
causal_ai_in_healthcare_final.pdf - Author Accepted Manuscript

Download (327kB) | Preview

Abstract

Big data, high-performance computing, and (deep) machine learning are increasingly becoming key to precision medicine—from identifying disease risks and taking preventive measures, to making diagnoses and personalizing treatment for individuals. Precision medicine, however, is not only about predicting risks and outcomes, but also about weighing interventions. Interventional clinical predictive models require the correct specification of cause and effect, and the calculation of so-called counterfactuals, that is, alternative scenarios. In biomedical research, observational studies are commonly affected by confounding and selection bias. Without robust assumptions, often requiring a priori domain knowledge, causal inference is not feasible. Data-driven prediction models are often mistakenly used to draw causal effects, but neither their parameters nor their predictions necessarily have a causal interpretation. Therefore, the premise that data-driven prediction models lead to trustable decisions/interventions for precision medicine is questionable. When pursuing intervention modelling, the bio-health informatics community needs to employ causal approaches and learn causal structures. Here we discuss how target trials (algorithmic emulation of randomized studies), transportability (the licence to transfer causal effects from one population to another) and prediction invariance (where a true causal model is contained in the set of all prediction models whose accuracy does not vary across different settings) are linchpins to developing and testing intervention models.

Item Type: Article
Uncontrolled Keywords: Education, Machine learning, Outcomes research, Research management
Depositing User: Symplectic Admin
Date Deposited: 05 Oct 2020 07:51
Last Modified: 15 Mar 2024 15:22
DOI: 10.1038/s42256-020-0197-y
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3103363