Demystifying amortized causal discovery with transformers
Montagna F, Cairney-Leeming MT, Sridhar D, Locatello F. 2025. Demystifying amortized causal discovery with transformers. Transactions on Machine Learning Research.
Download
Journal Article
| Published
| English
Scopus indexed
Author
Montagna, FrancescoISTA;
Cairney-Leeming, Maximilian TISTA;
Sridhar, Dhanya;
Locatello, FrancescoISTA 
Corresponding author has ISTA affiliation
Department
Series Title
TMLR
Abstract
Supervised learning for causal discovery from observational data often achieves competitive performance despite seemingly avoiding the explicit assumptions that traditional methods require for identifiability. In this work, we analyze CSIvA (Ke et al., 2023) on bivariate causal models, a transformer architecture for amortized inference promising to train on synthetic data and transfer to real ones. First, we bridge the gap with identifiability theory, showing that the training distribution implicitly defines a prior on the causal model of the test observations: consistent with classical approaches, good performance is achieved when we have a good prior on the test data, and the underlying model is identifiable. Second, we find that CSIvA can not generalize to classes of causal models unseen during training: to overcome this limitation, we theoretically and empirically analyze \textit{when} training CSIvA on datasets generated by multiple identifiable causal models with different structural assumptions improves its generalization at test time. Overall, we find that amortized causal discovery still adheres to identifiability theory, violating the previous hypothesis from Lopez-Paz et al. (2015) that supervised learning methods could overcome its restrictions.
Publishing Year
Date Published
2025-12-18
Journal Title
Transactions on Machine Learning Research
Publisher
ML Research Press
eISSN
IST-REx-ID
Cite this
Montagna F, Cairney-Leeming MT, Sridhar D, Locatello F. Demystifying amortized causal discovery with transformers. Transactions on Machine Learning Research. 2025.
Montagna, F., Cairney-Leeming, M. T., Sridhar, D., & Locatello, F. (2025). Demystifying amortized causal discovery with transformers. Transactions on Machine Learning Research. ML Research Press.
Montagna, Francesco, Maximilian T Cairney-Leeming, Dhanya Sridhar, and Francesco Locatello. “Demystifying Amortized Causal Discovery with Transformers.” Transactions on Machine Learning Research. ML Research Press, 2025.
F. Montagna, M. T. Cairney-Leeming, D. Sridhar, and F. Locatello, “Demystifying amortized causal discovery with transformers,” Transactions on Machine Learning Research. ML Research Press, 2025.
Montagna F, Cairney-Leeming MT, Sridhar D, Locatello F. 2025. Demystifying amortized causal discovery with transformers. Transactions on Machine Learning Research.
Montagna, Francesco, et al. “Demystifying Amortized Causal Discovery with Transformers.” Transactions on Machine Learning Research, ML Research Press, 2025.
All files available under the following license(s):
Creative Commons Attribution 4.0 International Public License (CC-BY 4.0):
Main File(s)
File Name
2025_PMLR_Montagna.pdf
1.03 MB
Access Level
Open Access
Date Uploaded
2026-01-05
MD5 Checksum
968c471bb1f682cf823b2d4cadea8a3f
Export
Marked PublicationsOpen Data ISTA Research Explorer
Sources
arXiv 2405.16924
