6 Publications

Mark all

[6]
2025 | Published | Conference Paper | IST-REx-ID: 20034 | OA
Robert, Thomas, et al. “LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics.” 13th International Conference on Learning Representations, ICLR, 2025, pp. 101877–913.
[Published Version] View | Files available | arXiv
 
[5]
2024 | Published | Conference Paper | IST-REx-ID: 18976 | OA
Islamov, Rustem, et al. “AsGrad: A Sharp Unified Analysis of Asynchronous-SGD Algorithms.” Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, vol. 238, ML Research Press, 2024, pp. 649–57.
[Preprint] View | Download Preprint (ext.) | arXiv
 
[4]
2024 | Published | Conference Paper | IST-REx-ID: 19518 | OA
Wu, Diyuan, et al. “The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Information.” 38th Conference on Neural Information Processing Systems, vol. 37, Neural Information Processing Systems Foundation, 2024.
[Preprint] View | Download Preprint (ext.) | arXiv
 
[3]
2024 | Published | Conference Paper | IST-REx-ID: 19510 | OA
Modoranu, Ionut-Vlad, et al. “MICROADAM: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence.” 38th Conference on Neural Information Processing Systems, vol. 37, Neural Information Processing Systems Foundation, 2024.
[Preprint] View | Files available | Download Preprint (ext.) | arXiv
 
[2]
2023 | Published | Journal Article | IST-REx-ID: 14815 | OA
Beznosikov, Aleksandr, et al. “On Biased Compression for Distributed Learning.” Journal of Machine Learning Research, vol. 24, Journal of Machine Learning Research, 2023, pp. 1–50.
[Published Version] View | Files available | WoS | arXiv
 
[1]
2023 | Published | Conference Paper | IST-REx-ID: 15363 | OA
Safaryan, Mher, et al. “Knowledge Distillation Performs Partial Variance Reduction.” 36th Conference on Neural Information Processing Systems, vol. 36, 2023.
[Published Version] View | Files available | arXiv
 

Search

Filter Publications

Display / Sort

Citation Style: MLA

Export / Embed

Grants


6 Publications

Mark all

[6]
2025 | Published | Conference Paper | IST-REx-ID: 20034 | OA
Robert, Thomas, et al. “LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics.” 13th International Conference on Learning Representations, ICLR, 2025, pp. 101877–913.
[Published Version] View | Files available | arXiv
 
[5]
2024 | Published | Conference Paper | IST-REx-ID: 18976 | OA
Islamov, Rustem, et al. “AsGrad: A Sharp Unified Analysis of Asynchronous-SGD Algorithms.” Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, vol. 238, ML Research Press, 2024, pp. 649–57.
[Preprint] View | Download Preprint (ext.) | arXiv
 
[4]
2024 | Published | Conference Paper | IST-REx-ID: 19518 | OA
Wu, Diyuan, et al. “The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Information.” 38th Conference on Neural Information Processing Systems, vol. 37, Neural Information Processing Systems Foundation, 2024.
[Preprint] View | Download Preprint (ext.) | arXiv
 
[3]
2024 | Published | Conference Paper | IST-REx-ID: 19510 | OA
Modoranu, Ionut-Vlad, et al. “MICROADAM: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence.” 38th Conference on Neural Information Processing Systems, vol. 37, Neural Information Processing Systems Foundation, 2024.
[Preprint] View | Files available | Download Preprint (ext.) | arXiv
 
[2]
2023 | Published | Journal Article | IST-REx-ID: 14815 | OA
Beznosikov, Aleksandr, et al. “On Biased Compression for Distributed Learning.” Journal of Machine Learning Research, vol. 24, Journal of Machine Learning Research, 2023, pp. 1–50.
[Published Version] View | Files available | WoS | arXiv
 
[1]
2023 | Published | Conference Paper | IST-REx-ID: 15363 | OA
Safaryan, Mher, et al. “Knowledge Distillation Performs Partial Variance Reduction.” 36th Conference on Neural Information Processing Systems, vol. 36, 2023.
[Published Version] View | Files available | arXiv
 

Search

Filter Publications

Display / Sort

Citation Style: MLA

Export / Embed