4 Publications

Mark all

[4]
2024 | Conference Paper | IST-REx-ID: 15011 | OA
E. Kurtic, T. Hoefler, and D.-A. Alistarh, “How to prune your language model: Recovering accuracy on the ‘Sparsity May Cry’ benchmark,” in Proceedings of Machine Learning Research, Hongkong, China, 2024, vol. 234, pp. 542–553.
[Preprint] View | Download Preprint (ext.) | arXiv
 
[3]
2023 | Conference Paper | IST-REx-ID: 13053 | OA
E.-A. Peste, A. Vladu, E. Kurtic, C. Lampert, and D.-A. Alistarh, “CrAM: A Compression-Aware Minimizer,” in 11th International Conference on Learning Representations , Kigali, Rwanda .
[Preprint] View | Files available | Download Preprint (ext.) | arXiv
 
[2]
2023 | Conference Paper | IST-REx-ID: 14460 | OA
M. Nikdan, T. Pegolotti, E. B. Iofinova, E. Kurtic, and D.-A. Alistarh, “SparseProp: Efficient sparse backpropagation for faster training of neural networks at the edge,” in Proceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, HI, United States, 2023, vol. 202, pp. 26215–26227.
[Preprint] View | Download Preprint (ext.) | arXiv
 
[1]
2021 | Conference Paper | IST-REx-ID: 11463 | OA
E. Frantar, E. Kurtic, and D.-A. Alistarh, “M-FAC: Efficient matrix-free approximations of second-order information,” in 35th Conference on Neural Information Processing Systems, Virtual, Online, 2021, vol. 34, pp. 14873–14886.
[Published Version] View | Download Published Version (ext.) | arXiv
 

Search

Filter Publications

Display / Sort

Citation Style: IEEE

Export / Embed

4 Publications

Mark all

[4]
2024 | Conference Paper | IST-REx-ID: 15011 | OA
E. Kurtic, T. Hoefler, and D.-A. Alistarh, “How to prune your language model: Recovering accuracy on the ‘Sparsity May Cry’ benchmark,” in Proceedings of Machine Learning Research, Hongkong, China, 2024, vol. 234, pp. 542–553.
[Preprint] View | Download Preprint (ext.) | arXiv
 
[3]
2023 | Conference Paper | IST-REx-ID: 13053 | OA
E.-A. Peste, A. Vladu, E. Kurtic, C. Lampert, and D.-A. Alistarh, “CrAM: A Compression-Aware Minimizer,” in 11th International Conference on Learning Representations , Kigali, Rwanda .
[Preprint] View | Files available | Download Preprint (ext.) | arXiv
 
[2]
2023 | Conference Paper | IST-REx-ID: 14460 | OA
M. Nikdan, T. Pegolotti, E. B. Iofinova, E. Kurtic, and D.-A. Alistarh, “SparseProp: Efficient sparse backpropagation for faster training of neural networks at the edge,” in Proceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, HI, United States, 2023, vol. 202, pp. 26215–26227.
[Preprint] View | Download Preprint (ext.) | arXiv
 
[1]
2021 | Conference Paper | IST-REx-ID: 11463 | OA
E. Frantar, E. Kurtic, and D.-A. Alistarh, “M-FAC: Efficient matrix-free approximations of second-order information,” in 35th Conference on Neural Information Processing Systems, Virtual, Online, 2021, vol. 34, pp. 14873–14886.
[Published Version] View | Download Published Version (ext.) | arXiv
 

Search

Filter Publications

Display / Sort

Citation Style: IEEE

Export / Embed