3 Publications

Mark all

[3]
2024 | Published | Conference Paper | IST-REx-ID: 18976 | OA
AsGrad: A sharp unified analysis of asynchronous-SGD algorithms
R. Islamov, M. Safaryan, D.-A. Alistarh, in:, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, ML Research Press, 2024, pp. 649–657.
[Preprint] View | Download Preprint (ext.) | arXiv
 
[2]
2023 | Published | Journal Article | IST-REx-ID: 14815 | OA
On biased compression for distributed learning
A. Beznosikov, S. Horvath, P. Richtarik, M. Safaryan, Journal of Machine Learning Research 24 (2023) 1–50.
[Published Version] View | Files available | WoS | arXiv
 
[1]
2023 | Published | Conference Paper | IST-REx-ID: 15363 | OA
Knowledge distillation performs partial variance reduction
M. Safaryan, A. Krumes, D.-A. Alistarh, in:, 36th Conference on Neural Information Processing Systems, 2023.
[Published Version] View | Files available | arXiv
 

Search

Filter Publications

Display / Sort

Export / Embed

Grants


3 Publications

Mark all

[3]
2024 | Published | Conference Paper | IST-REx-ID: 18976 | OA
AsGrad: A sharp unified analysis of asynchronous-SGD algorithms
R. Islamov, M. Safaryan, D.-A. Alistarh, in:, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, ML Research Press, 2024, pp. 649–657.
[Preprint] View | Download Preprint (ext.) | arXiv
 
[2]
2023 | Published | Journal Article | IST-REx-ID: 14815 | OA
On biased compression for distributed learning
A. Beznosikov, S. Horvath, P. Richtarik, M. Safaryan, Journal of Machine Learning Research 24 (2023) 1–50.
[Published Version] View | Files available | WoS | arXiv
 
[1]
2023 | Published | Conference Paper | IST-REx-ID: 15363 | OA
Knowledge distillation performs partial variance reduction
M. Safaryan, A. Krumes, D.-A. Alistarh, in:, 36th Conference on Neural Information Processing Systems, 2023.
[Published Version] View | Files available | arXiv
 

Search

Filter Publications

Display / Sort

Export / Embed