Mher Safaryan
Alistarh Group
5 Publications
2024 | Published | Conference Paper | IST-REx-ID: 18976 |

AsGrad: A sharp unified analysis of asynchronous-SGD algorithms
R. Islamov, M. Safaryan, D.-A. Alistarh, in:, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, ML Research Press, 2024, pp. 649–657.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
R. Islamov, M. Safaryan, D.-A. Alistarh, in:, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, ML Research Press, 2024, pp. 649–657.
2024 | Published | Conference Paper | IST-REx-ID: 19518 |

The iterative optimal brain surgeon: Faster sparse recovery by leveraging second-order information
D. Wu, I.-V. Modoranu, M. Safaryan, D. Kuznedelev, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
D. Wu, I.-V. Modoranu, M. Safaryan, D. Kuznedelev, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
2024 | Published | Conference Paper | IST-REx-ID: 19510 |

MICROADAM: Accurate adaptive optimization with low space overhead and provable convergence
I.-V. Modoranu, M. Safaryan, G. Malinovsky, E. Kurtic, T. Robert, P. Richtárik, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
[Preprint]
View
| Files available
| Download Preprint (ext.)
| arXiv
I.-V. Modoranu, M. Safaryan, G. Malinovsky, E. Kurtic, T. Robert, P. Richtárik, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
2023 | Published | Journal Article | IST-REx-ID: 14815 |

On biased compression for distributed learning
A. Beznosikov, S. Horvath, P. Richtarik, M. Safaryan, Journal of Machine Learning Research 24 (2023) 1–50.
[Published Version]
View
| Files available
| WoS
| arXiv
A. Beznosikov, S. Horvath, P. Richtarik, M. Safaryan, Journal of Machine Learning Research 24 (2023) 1–50.
2023 | Published | Conference Paper | IST-REx-ID: 15363 |

Knowledge distillation performs partial variance reduction
M. Safaryan, A. Krumes, D.-A. Alistarh, in:, 36th Conference on Neural Information Processing Systems, 2023.
[Published Version]
View
| Files available
| arXiv
M. Safaryan, A. Krumes, D.-A. Alistarh, in:, 36th Conference on Neural Information Processing Systems, 2023.
Grants
5 Publications
2024 | Published | Conference Paper | IST-REx-ID: 18976 |

AsGrad: A sharp unified analysis of asynchronous-SGD algorithms
R. Islamov, M. Safaryan, D.-A. Alistarh, in:, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, ML Research Press, 2024, pp. 649–657.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
R. Islamov, M. Safaryan, D.-A. Alistarh, in:, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, ML Research Press, 2024, pp. 649–657.
2024 | Published | Conference Paper | IST-REx-ID: 19518 |

The iterative optimal brain surgeon: Faster sparse recovery by leveraging second-order information
D. Wu, I.-V. Modoranu, M. Safaryan, D. Kuznedelev, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
D. Wu, I.-V. Modoranu, M. Safaryan, D. Kuznedelev, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
2024 | Published | Conference Paper | IST-REx-ID: 19510 |

MICROADAM: Accurate adaptive optimization with low space overhead and provable convergence
I.-V. Modoranu, M. Safaryan, G. Malinovsky, E. Kurtic, T. Robert, P. Richtárik, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
[Preprint]
View
| Files available
| Download Preprint (ext.)
| arXiv
I.-V. Modoranu, M. Safaryan, G. Malinovsky, E. Kurtic, T. Robert, P. Richtárik, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
2023 | Published | Journal Article | IST-REx-ID: 14815 |

On biased compression for distributed learning
A. Beznosikov, S. Horvath, P. Richtarik, M. Safaryan, Journal of Machine Learning Research 24 (2023) 1–50.
[Published Version]
View
| Files available
| WoS
| arXiv
A. Beznosikov, S. Horvath, P. Richtarik, M. Safaryan, Journal of Machine Learning Research 24 (2023) 1–50.
2023 | Published | Conference Paper | IST-REx-ID: 15363 |

Knowledge distillation performs partial variance reduction
M. Safaryan, A. Krumes, D.-A. Alistarh, in:, 36th Conference on Neural Information Processing Systems, 2023.
[Published Version]
View
| Files available
| arXiv
M. Safaryan, A. Krumes, D.-A. Alistarh, in:, 36th Conference on Neural Information Processing Systems, 2023.