Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
164 Publications
2024 |
Research Data Reference |
IST-REx-ID: 19884 |
Frantar, Elias, et al. MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models. Zenodo, 2024, doi:10.5281/ZENODO.14213091.
[Published Version]
View
| Files available
| DOI
| Download Published Version (ext.)
2024 |
Published |
Conference Paper |
IST-REx-ID: 18975 |
Modoranu, Ionut-Vlad, et al. “Error Feedback Can Accurately Compress Preconditioners.” 41st International Conference on Machine Learning, vol. 235, ML Research Press, 2024, pp. 35910–33.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
2024 |
Published |
Conference Paper |
IST-REx-ID: 18976 |
Islamov, Rustem, et al. “AsGrad: A Sharp Unified Analysis of Asynchronous-SGD Algorithms.” Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, vol. 238, ML Research Press, 2024, pp. 649–57.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
2024 |
Published |
Conference Paper |
IST-REx-ID: 18977 |
Dettmers, Tim, et al. “SpQR: A Sparse-Quantized Representation for near-Lossless LLM Weight Compression.” 12th International Conference on Learning Representations, OpenReview, 2024.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
2024 |
Published |
Conference Paper |
IST-REx-ID: 19510 |
Modoranu, Ionut-Vlad, et al. “MICROADAM: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence.” 38th Conference on Neural Information Processing Systems, vol. 37, Neural Information Processing Systems Foundation, 2024.
[Preprint]
View
| Files available
| Download Preprint (ext.)
| arXiv
2024 |
Published |
Conference Paper |
IST-REx-ID: 19511 |
Ashkboos, Saleh, et al. “QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs.” 38th Conference on Neural Information Processing Systems, vol. 37, Neural Information Processing Systems Foundation, 2024.
[Preprint]
View
| Files available
| Download Preprint (ext.)
| arXiv
2024 |
Published |
Conference Paper |
IST-REx-ID: 19518 |
Wu, Diyuan, et al. “The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Information.” 38th Conference on Neural Information Processing Systems, vol. 37, Neural Information Processing Systems Foundation, 2024.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
2024 |
Published |
Conference Paper |
IST-REx-ID: 19519 |
Malinovskii, Vladimir, et al. “PV-Tuning: Beyond Straight-through Estimation for Extreme LLM Compression.” 38th Conference on Neural Information Processing Systems, vol. 37, Neural Information Processing Systems Foundation, 2024.
[Published Version]
View
| Files available
| arXiv
2024 |
Published |
Conference Paper |
IST-REx-ID: 15011 |
Kurtic, Eldar, et al. “How to Prune Your Language Model: Recovering Accuracy on the ‘Sparsity May Cry’ Benchmark.” Proceedings of Machine Learning Research, vol. 234, ML Research Press, 2024, pp. 542–53.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
2024 |
Published |
Conference Paper |
IST-REx-ID: 17093 |
Zakerinia, Hossein, et al. “Communication-Efficient Federated Learning with Data and Client Heterogeneity.” Proceedings of the 27th International Conference on Artificial Intelligence and Statistics, vol. 238, ML Research Press, 2024, pp. 3448–56.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
2024 |
Published |
Conference Paper |
IST-REx-ID: 17329 |
Alistarh, Dan-Adrian, et al. “Game Dynamics and Equilibrium Computation in the Population Protocol Model.” Proceedings of the 43rd Annual ACM Symposium on Principles of Distributed Computing, Association for Computing Machinery, 2024, pp. 40–49, doi:10.1145/3662158.3662768.
[Published Version]
View
| Files available
| DOI
2024 |
Published |
Conference Paper |
IST-REx-ID: 17332 |
Kokorin, Ilya, et al. “Wait-Free Trees with Asymptotically-Efficient Range Queries.” 2024 IEEE International Parallel and Distributed Processing Symposium, IEEE, 2024, pp. 169–79, doi:10.1109/IPDPS57955.2024.00023.
[Preprint]
View
| DOI
| Download Preprint (ext.)
| WoS
| arXiv
2024 |
Published |
Conference Paper |
IST-REx-ID: 17456 |
Markov, Ilia, et al. “L-GreCo: Layerwise-Adaptive Gradient Compression for Efficient Data-Parallel Deep Learning.” Proceedings of Machine Learning and Systems , edited by P. Gibbons et al., vol. 6, Association for Computing Machinery, 2024.
[Published Version]
View
| Files available
| Download Published Version (ext.)
| arXiv
2024 |
Published |
Thesis | PhD |
IST-REx-ID: 17485 |
Frantar, Elias. Compressing Large Neural Networks : Algorithms, Systems and Scaling Laws. Institute of Science and Technology Austria, 2024, doi:10.15479/at:ista:17485.
[Published Version]
View
| Files available
| DOI
2024 |
Published |
Thesis | PhD |
IST-REx-ID: 17490 |
Markov, Ilia. Communication-Efficient Distributed Training of Deep Neural Networks : An Algorithms and Systems Perspective. Institute of Science and Technology Austria, 2024, doi:10.15479/at:ista:17490.
[Published Version]
View
| Files available
| DOI
2024 |
Published |
Thesis | PhD |
IST-REx-ID: 17465 |
Shevchenko, Alexander. High-Dimensional Limits in Artificial Neural Networks. Institute of Science and Technology Austria, 2024, doi:10.15479/at:ista:17465.
[Published Version]
View
| Files available
| DOI
2024 |
Published |
Conference Paper |
IST-REx-ID: 17469 |
Kögler, Kevin, et al. “Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth.” Proceedings of the 41st International Conference on Machine Learning, vol. 235, ML Research Press, 2024, pp. 24964–5015.
[Published Version]
View
| Files available
| Download Published Version (ext.)
| arXiv
2023 |
Published |
Conference Paper |
IST-REx-ID: 14260 |
Koval, Nikita, et al. “Lincheck: A Practical Framework for Testing Concurrent Data Structures on JVM.” 35th International Conference on Computer Aided Verification , vol. 13964, Springer Nature, 2023, pp. 156–69, doi:10.1007/978-3-031-37706-8_8.
[Published Version]
View
| Files available
| DOI
| WoS
2023 |
Published |
Journal Article |
IST-REx-ID: 14364 |
Alistarh, Dan-Adrian, et al. “Why Extension-Based Proofs Fail.” SIAM Journal on Computing, vol. 52, no. 4, Society for Industrial and Applied Mathematics, 2023, pp. 913–44, doi:10.1137/20M1375851.
[Preprint]
View
| Files available
| DOI
| Download Preprint (ext.)
| WoS
| arXiv
2023 |
Published |
Conference Paper |
IST-REx-ID: 14458 |
Frantar, Elias, and Dan-Adrian Alistarh. “SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot.” Proceedings of the 40th International Conference on Machine Learning, vol. 202, ML Research Press, 2023, pp. 10323–37.
[Preprint]
View
| Files available
| Download Preprint (ext.)
| arXiv