Smoke and mirrors in causal downstream tasks
Cadei R, Lindorfer L, Cremer S, Schmid C, Locatello F. Smoke and mirrors in causal downstream tasks. arXiv, 2405.17151.
Download (ext.)
https://doi.org/10.48550/arXiv.2405.17151
[Preprint]
Preprint
| Submitted
| English
Author
Cadei, RiccardoISTA;
Lindorfer, LukasISTA;
Cremer, SylviaISTA ;
Schmid, Cordelia;
Locatello, FrancescoISTA
Corresponding author has ISTA affiliation
Department
Abstract
Machine Learning and AI have the potential to transform data-driven
scientific discovery, enabling accurate predictions for several scientific
phenomena. As many scientific questions are inherently causal, this paper looks
at the causal inference task of treatment effect estimation, where the outcome
of interest is recorded in high-dimensional observations in a Randomized
Controlled Trial (RCT). Despite being the simplest possible causal setting and
a perfect fit for deep learning, we theoretically find that many common choices
in the literature may lead to biased estimates. To test the practical impact of
these considerations, we recorded ISTAnt, the first real-world benchmark for
causal inference downstream tasks on high-dimensional observations as an RCT
studying how garden ants (Lasius neglectus) respond to microparticles applied
onto their colony members by hygienic grooming. Comparing 6 480 models
fine-tuned from state-of-the-art visual backbones, we find that the sampling
and modeling choices significantly affect the accuracy of the causal estimate,
and that classification accuracy is not a proxy thereof. We further validated
the analysis, repeating it on a synthetically generated visual data set
controlling the causal model. Our results suggest that future benchmarks should
carefully consider real downstream scientific questions, especially causal
ones. Further, we highlight guidelines for representation learning methods to
help answer causal questions in the sciences.
Publishing Year
Date Published
2024-05-27
Journal Title
arXiv
Acknowledgement
We thank Piersilvio De Bartolomeis, and the full Causal Learning and Artificial Intelligence (CLAI) group at ISTA for the extremely helpful discussions. Riccardo Cadei was supported by a Google Research Scholar Award and a Google Initiated Gift to Francesco Locatello. We thank the Social Immunity team at ISTA particularly Michaela Hönigsberger and Wilfrid Jean Louis, for supporting the ecological experiment and Farnaz Beikzadeh Abbasi, Luisa Fiebig and Martin Estermann for annotating ant behavior in ISTAnt.
Article Number
2405.17151
IST-REx-ID
Cite this
Cadei R, Lindorfer L, Cremer S, Schmid C, Locatello F. Smoke and mirrors in causal downstream tasks. arXiv. doi:10.48550/arXiv.2405.17151
Cadei, R., Lindorfer, L., Cremer, S., Schmid, C., & Locatello, F. (n.d.). Smoke and mirrors in causal downstream tasks. arXiv. https://doi.org/10.48550/arXiv.2405.17151
Cadei, Riccardo, Lukas Lindorfer, Sylvia Cremer, Cordelia Schmid, and Francesco Locatello. “Smoke and Mirrors in Causal Downstream Tasks.” ArXiv, n.d. https://doi.org/10.48550/arXiv.2405.17151.
R. Cadei, L. Lindorfer, S. Cremer, C. Schmid, and F. Locatello, “Smoke and mirrors in causal downstream tasks,” arXiv. .
Cadei R, Lindorfer L, Cremer S, Schmid C, Locatello F. Smoke and mirrors in causal downstream tasks. arXiv, 2405.17151.
Cadei, Riccardo, et al. “Smoke and Mirrors in Causal Downstream Tasks.” ArXiv, 2405.17151, doi:10.48550/arXiv.2405.17151.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Link(s) to Main File(s)
Access Level
Open Access
Export
Marked PublicationsOpen Data ISTA Research Explorer
Sources
arXiv 2405.17151