Eldar Kurtic
10 Publications
2025 |
Published |
Conference Paper |
IST-REx-ID: 20684 |
“Give me BF16 or give me death”? Accuracy-performance trade-offs in LLM quantization
E. Kurtic, A. Marques, S. Pandit, M. Kurtz, D.-A. Alistarh, in:, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, 2025, pp. 26872–26886.
[Published Version]
View
| Files available
| arXiv
E. Kurtic, A. Marques, S. Pandit, M. Kurtz, D.-A. Alistarh, in:, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, 2025, pp. 26872–26886.
2025 |
Published |
Conference Paper |
IST-REx-ID: 20820 |
EvoPress: Accurate dynamic model compression via evolutionary search
O. Sieberling, D. Kuznedelev, E. Kurtic, D.-A. Alistarh, in:, 42nd International Conference on Machine Learning, ML Research Press, 2025, pp. 55556–55590.
[Published Version]
View
| Files available
| arXiv
O. Sieberling, D. Kuznedelev, E. Kurtic, D.-A. Alistarh, in:, 42nd International Conference on Machine Learning, ML Research Press, 2025, pp. 55556–55590.
2025 |
Published |
Book Chapter |
IST-REx-ID: 21257 |
Sparse Fine-Tuning for Inference Acceleration of Large Language Models
E. Kurtic, D. Kuznedelev, E. Frantar, M. Goinv, S. Pandit, A. Agarwalla, T. Nguyen, A. Marques, M. Kurtz, D.-A. Alistarh, in:, P. Passban, A. Way, M. Rezagholizadeh (Eds.), Enhancing LLM Performance. Efficacy, Fine-Tuning, and Inference Techniques, Springer Nature, 2025, pp. 83–97.
[Preprint]
View
| DOI
| Download Preprint (ext.)
| arXiv
E. Kurtic, D. Kuznedelev, E. Frantar, M. Goinv, S. Pandit, A. Agarwalla, T. Nguyen, A. Marques, M. Kurtz, D.-A. Alistarh, in:, P. Passban, A. Way, M. Rezagholizadeh (Eds.), Enhancing LLM Performance. Efficacy, Fine-Tuning, and Inference Techniques, Springer Nature, 2025, pp. 83–97.
2024 |
Published |
Conference Paper |
IST-REx-ID: 18975 |
Error feedback can accurately compress preconditioners
I.-V. Modoranu, A. Kalinov, E. Kurtic, E. Frantar, D.-A. Alistarh, in:, 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 35910–35933.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
I.-V. Modoranu, A. Kalinov, E. Kurtic, E. Frantar, D.-A. Alistarh, in:, 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 35910–35933.
2024 |
Published |
Conference Paper |
IST-REx-ID: 19510 |
MICROADAM: Accurate adaptive optimization with low space overhead and provable convergence
I.-V. Modoranu, M. Safaryan, G. Malinovsky, E. Kurtic, T. Robert, P. Richtárik, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
[Preprint]
View
| Files available
| Download Preprint (ext.)
| arXiv
I.-V. Modoranu, M. Safaryan, G. Malinovsky, E. Kurtic, T. Robert, P. Richtárik, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
2024 |
Published |
Conference Paper |
IST-REx-ID: 15011 |
How to prune your language model: Recovering accuracy on the "Sparsity May Cry" benchmark
E. Kurtic, T. Hoefler, D.-A. Alistarh, in:, Proceedings of Machine Learning Research, ML Research Press, 2024, pp. 542–553.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
E. Kurtic, T. Hoefler, D.-A. Alistarh, in:, Proceedings of Machine Learning Research, ML Research Press, 2024, pp. 542–553.
2023 |
Published |
Conference Paper |
IST-REx-ID: 14460 |
SparseProp: Efficient sparse backpropagation for faster training of neural networks at the edge
M. Nikdan, T. Pegolotti, E.B. Iofinova, E. Kurtic, D.-A. Alistarh, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 26215–26227.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
M. Nikdan, T. Pegolotti, E.B. Iofinova, E. Kurtic, D.-A. Alistarh, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 26215–26227.
2023 |
Published |
Conference Paper |
IST-REx-ID: 13053 |
CrAM: A Compression-Aware Minimizer
A. Krumes, A. Vladu, E. Kurtic, C. Lampert, D.-A. Alistarh, in:, 11th International Conference on Learning Representations , OpenReview, 2023.
[Published Version]
View
| Files available
| Download Published Version (ext.)
| arXiv
A. Krumes, A. Vladu, E. Kurtic, C. Lampert, D.-A. Alistarh, in:, 11th International Conference on Learning Representations , OpenReview, 2023.
2022 |
Published |
Conference Paper |
IST-REx-ID: 17088 |
The optimal BERT surgeon: Scalable and accurate second-order pruning for large language models
E. Kurtic, D. Campos, T. Nguyen, E. Frantar, M. Kurtz, B. Fineran, M. Goin, D.-A. Alistarh, in:, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2022, pp. 4163–4181.
[Published Version]
View
| Files available
| DOI
| arXiv
E. Kurtic, D. Campos, T. Nguyen, E. Frantar, M. Kurtz, B. Fineran, M. Goin, D.-A. Alistarh, in:, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2022, pp. 4163–4181.
2021 |
Published |
Conference Paper |
IST-REx-ID: 11463 |
M-FAC: Efficient matrix-free approximations of second-order information
E. Frantar, E. Kurtic, D.-A. Alistarh, in:, 35th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2021, pp. 14873–14886.
[Published Version]
View
| Download Published Version (ext.)
| arXiv
E. Frantar, E. Kurtic, D.-A. Alistarh, in:, 35th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2021, pp. 14873–14886.
Grants
10 Publications
2025 |
Published |
Conference Paper |
IST-REx-ID: 20684 |
“Give me BF16 or give me death”? Accuracy-performance trade-offs in LLM quantization
E. Kurtic, A. Marques, S. Pandit, M. Kurtz, D.-A. Alistarh, in:, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, 2025, pp. 26872–26886.
[Published Version]
View
| Files available
| arXiv
E. Kurtic, A. Marques, S. Pandit, M. Kurtz, D.-A. Alistarh, in:, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, 2025, pp. 26872–26886.
2025 |
Published |
Conference Paper |
IST-REx-ID: 20820 |
EvoPress: Accurate dynamic model compression via evolutionary search
O. Sieberling, D. Kuznedelev, E. Kurtic, D.-A. Alistarh, in:, 42nd International Conference on Machine Learning, ML Research Press, 2025, pp. 55556–55590.
[Published Version]
View
| Files available
| arXiv
O. Sieberling, D. Kuznedelev, E. Kurtic, D.-A. Alistarh, in:, 42nd International Conference on Machine Learning, ML Research Press, 2025, pp. 55556–55590.
2025 |
Published |
Book Chapter |
IST-REx-ID: 21257 |
Sparse Fine-Tuning for Inference Acceleration of Large Language Models
E. Kurtic, D. Kuznedelev, E. Frantar, M. Goinv, S. Pandit, A. Agarwalla, T. Nguyen, A. Marques, M. Kurtz, D.-A. Alistarh, in:, P. Passban, A. Way, M. Rezagholizadeh (Eds.), Enhancing LLM Performance. Efficacy, Fine-Tuning, and Inference Techniques, Springer Nature, 2025, pp. 83–97.
[Preprint]
View
| DOI
| Download Preprint (ext.)
| arXiv
E. Kurtic, D. Kuznedelev, E. Frantar, M. Goinv, S. Pandit, A. Agarwalla, T. Nguyen, A. Marques, M. Kurtz, D.-A. Alistarh, in:, P. Passban, A. Way, M. Rezagholizadeh (Eds.), Enhancing LLM Performance. Efficacy, Fine-Tuning, and Inference Techniques, Springer Nature, 2025, pp. 83–97.
2024 |
Published |
Conference Paper |
IST-REx-ID: 18975 |
Error feedback can accurately compress preconditioners
I.-V. Modoranu, A. Kalinov, E. Kurtic, E. Frantar, D.-A. Alistarh, in:, 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 35910–35933.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
I.-V. Modoranu, A. Kalinov, E. Kurtic, E. Frantar, D.-A. Alistarh, in:, 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 35910–35933.
2024 |
Published |
Conference Paper |
IST-REx-ID: 19510 |
MICROADAM: Accurate adaptive optimization with low space overhead and provable convergence
I.-V. Modoranu, M. Safaryan, G. Malinovsky, E. Kurtic, T. Robert, P. Richtárik, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
[Preprint]
View
| Files available
| Download Preprint (ext.)
| arXiv
I.-V. Modoranu, M. Safaryan, G. Malinovsky, E. Kurtic, T. Robert, P. Richtárik, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
2024 |
Published |
Conference Paper |
IST-REx-ID: 15011 |
How to prune your language model: Recovering accuracy on the "Sparsity May Cry" benchmark
E. Kurtic, T. Hoefler, D.-A. Alistarh, in:, Proceedings of Machine Learning Research, ML Research Press, 2024, pp. 542–553.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
E. Kurtic, T. Hoefler, D.-A. Alistarh, in:, Proceedings of Machine Learning Research, ML Research Press, 2024, pp. 542–553.
2023 |
Published |
Conference Paper |
IST-REx-ID: 14460 |
SparseProp: Efficient sparse backpropagation for faster training of neural networks at the edge
M. Nikdan, T. Pegolotti, E.B. Iofinova, E. Kurtic, D.-A. Alistarh, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 26215–26227.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
M. Nikdan, T. Pegolotti, E.B. Iofinova, E. Kurtic, D.-A. Alistarh, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 26215–26227.
2023 |
Published |
Conference Paper |
IST-REx-ID: 13053 |
CrAM: A Compression-Aware Minimizer
A. Krumes, A. Vladu, E. Kurtic, C. Lampert, D.-A. Alistarh, in:, 11th International Conference on Learning Representations , OpenReview, 2023.
[Published Version]
View
| Files available
| Download Published Version (ext.)
| arXiv
A. Krumes, A. Vladu, E. Kurtic, C. Lampert, D.-A. Alistarh, in:, 11th International Conference on Learning Representations , OpenReview, 2023.
2022 |
Published |
Conference Paper |
IST-REx-ID: 17088 |
The optimal BERT surgeon: Scalable and accurate second-order pruning for large language models
E. Kurtic, D. Campos, T. Nguyen, E. Frantar, M. Kurtz, B. Fineran, M. Goin, D.-A. Alistarh, in:, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2022, pp. 4163–4181.
[Published Version]
View
| Files available
| DOI
| arXiv
E. Kurtic, D. Campos, T. Nguyen, E. Frantar, M. Kurtz, B. Fineran, M. Goin, D.-A. Alistarh, in:, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2022, pp. 4163–4181.
2021 |
Published |
Conference Paper |
IST-REx-ID: 11463 |
M-FAC: Efficient matrix-free approximations of second-order information
E. Frantar, E. Kurtic, D.-A. Alistarh, in:, 35th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2021, pp. 14873–14886.
[Published Version]
View
| Download Published Version (ext.)
| arXiv
E. Frantar, E. Kurtic, D.-A. Alistarh, in:, 35th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2021, pp. 14873–14886.