Please note that ISTA Research Explorer no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

82 Publications


2025 | Published | Conference Paper | IST-REx-ID: 20033 | OA
High-dimensional analysis of knowledge distillation: Weak-to-Strong generalization and scaling laws
M. Emrullah Ildiz, H.A. Gozeten, E.O. Taga, M. Mondelli, S. Oymak, in:, 13th International Conference on Learning Representations, ICLR, 2025, pp. 2967–3006.
[Published Version] View | Files available | arXiv
 

2025 | Published | Conference Paper | IST-REx-ID: 20035 | OA
Wide neural networks trained with weight decay provably exhibit neural collapse
A. Jacot, P. Súkeník, Z. Wang, M. Mondelli, in:, 13th International Conference on Learning Representations, ICLR, 2025, pp. 1905–1931.
[Published Version] View | Files available | arXiv
 

2025 | Epub ahead of print | Journal Article | IST-REx-ID: 20081 | OA
Sibson α-mutual information and its variational representations
A.R. Esposito, M. Gastpar, I. Issa, IEEE Transactions on Information Theory (2025).
[Preprint] View | DOI | Download Preprint (ext.) | arXiv
 

2025 | Published | Conference Paper | IST-REx-ID: 20300 | OA
Learning Pareto manifolds in high dimensions: How can regularization help?
T. Wegel, F. Kovačević, A. Ţifrea, F. Yang, in:, The 28th International Conference on Artificial Intelligence and Statistics, ML Research Press, 2025, pp. 4591–4599.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2025 | Published | Conference Paper | IST-REx-ID: 20667
Mean estimation in high-dimensional binary timeinhomogeneous Markov Gaussian mixture models
A. El Latif Kadry, Y. Zhang, N. Weinberger, in:, 2025 IEEE International Symposium on Information Theory Proceedings, IEEE, 2025.
View | DOI
 

2025 | Published | Journal Article | IST-REx-ID: 20734 | OA | PlanS
Spectral estimators for structured generalized linear models via approximate message passing
Y. Zhang, H.C. Ji, R. Venkataramanan, M. Mondelli, Mathematical Statistics and Learning 8 (2025) 193–304.
[Published Version] View | Files available | DOI
 

2025 | Published | Journal Article | IST-REx-ID: 18986 | OA
Information limits and Thouless-Anderson-Palmer equations for spiked matrix models with structured noise
J. Barbier, F. Camilli, Y. Xu, M. Mondelli, Physical Review Research 7 (2025).
[Published Version] View | Files available | DOI | arXiv
 

2025 | Published | Journal Article | IST-REx-ID: 19065 | OA | PlanS
Efficient identification of wide shallow neural networks with biases
M. Fornasier, T. Klock, M. Mondelli, M. Rauchensteiner, Applied and Computational Harmonic Analysis 77 (2025).
[Published Version] View | Files available | DOI | WoS
 

2025 | Published | Conference Paper | IST-REx-ID: 19281 | OA
Tight bounds on list-decodable and list-recoverable zero-rate codes
N. Resch, C. Yuan, Y. Zhang, in:, 16th Innovations in Theoretical Computer Science Conference, Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2025.
[Published Version] View | Files available | DOI | WoS | arXiv
 

2025 | Published | Journal Article | IST-REx-ID: 19627 | OA
Privacy for free in the overparameterized regime
S. Bombari, M. Mondelli, Proceedings of the National Academy of Sciences 122 (2025).
[Published Version] View | Files available | DOI | WoS | PubMed | Europe PMC | arXiv
 

2025 | Published | Conference Paper | IST-REx-ID: 21324 | OA
Spurious correlations in high dimensional regression: The roles of regularization, simplicity bias and over-parameterization
S. Bombari, M. Mondelli, in:, Proceedings of the 42nd International Conference on Machine Learning, ML Research Press, 2025, pp. 4839–4873.
[Published Version] View | Files available | arXiv
 

2025 | Published | Conference Paper | IST-REx-ID: 21325 | OA
Test-time training provably improves transformers as in-context learners
H.A. Gozeten, M.E. Ildiz, X. Zhang, M. Soltanolkotabi, M. Mondelli, S. Oymak, in:, Proceedings of the 42nd International Conference on Machine Learning, ML Research Press, 2025, pp. 20266–20295.
[Published Version] View | Files available | PubMed | Europe PMC
 

2025 | Published | Conference Paper | IST-REx-ID: 21326 | OA
Neural collapse beyond the unconstrained features model: Landscape, dynamics, and generalization in the mean-field regime
D. Wu, M. Mondelli, in:, Proceedings of the 42nd International Conference on Machine Learning, ML Research Press, 2025, pp. 67499–67536.
[Published Version] View | Files available | arXiv
 

2025 | Published | Conference Paper | IST-REx-ID: 21328 | OA
Spectral estimators for multi-index models: Precise asymptotics and optimal weak recovery
F. Kovačević, Z. Yihan, M. Mondelli, in:, Proceedings of 38th Conference on Learning Theory, ML Research Press, 2025, pp. 3354–3404.
[Published Version] View | Files available | arXiv
 

2024 | Published | Journal Article | IST-REx-ID: 14665 | OA
Multiple packing: Lower bounds via error exponents
Y. Zhang, S. Vatedka, IEEE Transactions on Information Theory 70 (2024) 1008–1039.
[Preprint] View | DOI | Download Preprint (ext.) | WoS | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 17893 | OA
Properties of the strong data processing constant for Rényi divergence
L. Jin, A.R. Esposito, M. Gastpar, in:, Proceedings of the 2024 IEEE International Symposium on Information Theory, Institute of Electrical and Electronics Engineers, 2024, pp. 3178–3183.
[Preprint] View | DOI | Download Preprint (ext.) | WoS | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 17894
Variational characterizations of Sibson's α-mutual information
A.R. Esposito, M. Gastpar, I. Issa, in:, Proceedings of the 2024 IEEE International Symposium on Information Theory , Institute of Electrical and Electronics Engineers, 2024, pp. 2110–2115.
View | DOI | WoS
 

2024 | Published | Conference Paper | IST-REx-ID: 17895
Computationally efficient codes for strongly Dobrushin-Stambler nonsymmetrizable oblivious AVCs
B.K. Dey, S. Jaggi, M. Langberg, A.D. Sarwate, Y. Zhang, in:, Proceedings of the 2024 IEEE International Symposium on Information Theory , Institute of Electrical and Electronics Engineers, 2024, pp. 1586–1591.
View | DOI | WoS
 

2024 | Published | Journal Article | IST-REx-ID: 18652
Codes for adversaries: Between worst-case and average-case jamming
B.K. Dey, S. Jaggi, M. Langberg, A.D. Sarwate, Y. Zhang, Foundations and Trends in Communications and Information Theory 21 (2024) 300–588.
View | DOI
 

2024 | Published | Conference Paper | IST-REx-ID: 18890 | OA
Average gradient outer product as a mechanism for deep neural collapse
D. Beaglehole, P. Súkeník, M. Mondelli, M. Belkin, in:, 38th Annual Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
[Preprint] View | Download Preprint (ext.) | arXiv
 

Filters and Search Terms

department=MaMo

Search

Filter Publications

Display / Sort

Export / Embed