Smoke and mirrors in causal downstream tasks

Cadei R, Lindorfer L, Cremer S, Schmid C, Locatello F. 2024. Smoke and mirrors in causal downstream tasks. ICML 2024 Workshop AI4Science. ICML: International Conference on Machine Learning vol. 38.

Download
OA 2024_ICML_Cadei.pdf 4.45 MB [Published Version]
Conference Paper | Published | English

Scopus indexed

Corresponding author has ISTA affiliation

Abstract
Machine Learning and AI have the potential to transform data-driven scientific discovery, enabling accurate predictions for several scientific phenomena. As many scientific questions are inherently causal, this paper looks at the causal inference task of treatment effect estimation, where the outcome of interest is recorded in high-dimensional observations in a Randomized Controlled Trial (RCT). Despite being the simplest possible causal setting and a perfect fit for deep learning, we theoretically find that many common choices in the literature may lead to biased estimates. To test the practical impact of these considerations, we recorded ISTAnt, the first real-world benchmark for causal inference downstream tasks on high-dimensional observations as an RCT studying how garden ants (Lasius neglectus) respond to microparticles applied onto their colony members by hygienic grooming. Comparing 6 480 models fine-tuned from state-of-the-art visual backbones, we find that the sampling and modeling choices significantly affect the accuracy of the causal estimate, and that classification accuracy is not a proxy thereof. We further validated the analysis, repeating it on a synthetically generated visual data set controlling the causal model. Our results suggest that future benchmarks should carefully consider real downstream scientific questions, especially causal ones. Further, we highlight guidelines for representation learning methods to help answer causal questions in the sciences.
Publishing Year
Date Published
2024-09-25
Proceedings Title
ICML 2024 Workshop AI4Science
Publisher
Curran Associates
Acknowledgement
We thank Piersilvio De Bartolomeis, and the full Causal Learning and Artificial Intelligence (CLAI) group at ISTA for the extremely helpful discussions. Riccardo Cadei was supported by a Google Research Scholar Award and a Google Initiated Gift to Francesco Locatello. We thank the Social Immunity team at ISTA particularly Michaela Hönigsberger and Wilfrid Jean Louis, for supporting the ecological experiment and Farnaz Beikzadeh Abbasi, Luisa Fiebig and Martin Estermann for annotating ant behavior in ISTAnt.
Volume
38
Conference
ICML: International Conference on Machine Learning
Conference Date
2024-07-26 – 2024-07-26
IST-REx-ID

Cite this

Cadei R, Lindorfer L, Cremer S, Schmid C, Locatello F. Smoke and mirrors in causal downstream tasks. In: ICML 2024 Workshop AI4Science. Vol 38. Curran Associates; 2024.
Cadei, R., Lindorfer, L., Cremer, S., Schmid, C., & Locatello, F. (2024). Smoke and mirrors in causal downstream tasks. In ICML 2024 Workshop AI4Science (Vol. 38). Curran Associates.
Cadei, Riccardo, Lukas Lindorfer, Sylvia Cremer, Cordelia Schmid, and Francesco Locatello. “Smoke and Mirrors in Causal Downstream Tasks.” In ICML 2024 Workshop AI4Science, Vol. 38. Curran Associates, 2024.
R. Cadei, L. Lindorfer, S. Cremer, C. Schmid, and F. Locatello, “Smoke and mirrors in causal downstream tasks,” in ICML 2024 Workshop AI4Science, 2024, vol. 38.
Cadei R, Lindorfer L, Cremer S, Schmid C, Locatello F. 2024. Smoke and mirrors in causal downstream tasks. ICML 2024 Workshop AI4Science. ICML: International Conference on Machine Learning vol. 38.
Cadei, Riccardo, et al. “Smoke and Mirrors in Causal Downstream Tasks.” ICML 2024 Workshop AI4Science, vol. 38, Curran Associates, 2024.
All files available under the following license(s):
Creative Commons Attribution 4.0 International Public License (CC-BY 4.0):
Main File(s)
File Name
Access Level
OA Open Access
Date Uploaded
2025-01-27
MD5 Checksum
beedf05388bbdb7ddda81ec3d5ec7026


Material in ISTA:
Research Data

Export

Marked Publications

Open Data ISTA Research Explorer

Sources

arXiv 2405.17151

Search this title in

Google Scholar