Identifiable object-centric representation learning via probabilistic slot attention

Kori A, Locatello F, Santhirasekaram A, Toni F, Glocker B, De Sousa Ribeiro F. 2024. Identifiable object-centric representation learning via probabilistic slot attention. 38th Conference on Neural Information Processing Systems. NeurIPS: Neural Information Processing Systems, NeurIPS, vol. 38.

Download
OA 2024_NeurIPS_Kori.pdf 6.94 MB [Published Version]
Conference Paper | Published | English
Author
Kori, Avinash; Locatello, FrancescoISTA ; Santhirasekaram, Ainkaran; Toni, Francesca; Glocker, Ben; De Sousa Ribeiro, Fabio
Department
Series Title
NeurIPS
Abstract
Learning modular object-centric representations is crucial for systematic generalization. Existing methods show promising object-binding capabilities empirically, but theoretical identifiability guarantees remain relatively underdeveloped. Understanding when object-centric representations can theoretically be identified is crucial for scaling slot-based methods to high-dimensional images with correctness guarantees. To that end, we propose a probabilistic slot-attention algorithm that imposes an aggregate mixture prior over object-centric slot representations, thereby providing slot identifiability guarantees without supervision, up to an equivalence relation. We provide empirical verification of our theoretical identifiability result using both simple 2-dimensional data and high-resolution imaging datasets.
Publishing Year
Date Published
2024-12-01
Proceedings Title
38th Conference on Neural Information Processing Systems
Publisher
Curran Associates
Acknowledgement
A. Kori is supported by UKRI (grant number EP/S023356/1), as part of the UKRI Centre for Doctoral Training in Safe and Trusted AI. B. Glocker and F.D.S. Ribeiro acknowledge the support of the UKRI AI programme, and the Engineering and Physical Sciences Research Council, for CHAI - EPSRC Causality in Healthcare AI Hub (grant number EP/Y028856/1).
Volume
38
Conference
NeurIPS: Neural Information Processing Systems
Conference Location
Vancouver, Canada
Conference Date
2024-12-16 – 2024-12-16
IST-REx-ID

Cite this

Kori A, Locatello F, Santhirasekaram A, Toni F, Glocker B, De Sousa Ribeiro F. Identifiable object-centric representation learning via probabilistic slot attention. In: 38th Conference on Neural Information Processing Systems. Vol 38. Curran Associates; 2024.
Kori, A., Locatello, F., Santhirasekaram, A., Toni, F., Glocker, B., & De Sousa Ribeiro, F. (2024). Identifiable object-centric representation learning via probabilistic slot attention. In 38th Conference on Neural Information Processing Systems (Vol. 38). Vancouver, Canada: Curran Associates.
Kori, Avinash, Francesco Locatello, Ainkaran Santhirasekaram, Francesca Toni, Ben Glocker, and Fabio De Sousa Ribeiro. “Identifiable Object-Centric Representation Learning via Probabilistic Slot Attention.” In 38th Conference on Neural Information Processing Systems, Vol. 38. Curran Associates, 2024.
A. Kori, F. Locatello, A. Santhirasekaram, F. Toni, B. Glocker, and F. De Sousa Ribeiro, “Identifiable object-centric representation learning via probabilistic slot attention,” in 38th Conference on Neural Information Processing Systems, Vancouver, Canada, 2024, vol. 38.
Kori A, Locatello F, Santhirasekaram A, Toni F, Glocker B, De Sousa Ribeiro F. 2024. Identifiable object-centric representation learning via probabilistic slot attention. 38th Conference on Neural Information Processing Systems. NeurIPS: Neural Information Processing Systems, NeurIPS, vol. 38.
Kori, Avinash, et al. “Identifiable Object-Centric Representation Learning via Probabilistic Slot Attention.” 38th Conference on Neural Information Processing Systems, vol. 38, Curran Associates, 2024.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Main File(s)
File Name
Access Level
OA Open Access
Date Uploaded
2025-02-05
MD5 Checksum
d27b3c7102adc28e798fe41001f0b919


Export

Marked Publications

Open Data ISTA Research Explorer

Sources

arXiv 2406.07141

Search this title in

Google Scholar