Please note that ISTA Research Explorer no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

25 Publications


2024 | Published | Conference Paper | IST-REx-ID: 18973 | OA
Bombari S, Mondelli M. Towards understanding the word sensitivity of attention layers: A study via random features. In: 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:4300-4328.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 18975 | OA
Modoranu I-V, Kalinov A, Kurtic E, Frantar E, Alistarh D-A. Error feedback can accurately compress preconditioners. In: 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:35910-35933.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 18972 | OA
Bombari S, Mondelli M. How spurious features are memorized: Precise analysis for random and NTK features. In: 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:4267-4299.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 18971 | OA
Arefin R, Zhang Y, Baratin A, et al. Unsupervised concept discovery mitigates spurious correlations. In: Proceedings of the 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:1672-1688.
[Preprint] View | Files available | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 18976 | OA
Islamov R, Safaryan M, Alistarh D-A. AsGrad: A sharp unified analysis of asynchronous-SGD algorithms. In: Proceedings of The 27th International Conference on Artificial Intelligence and Statistics. Vol 238. ML Research Press; 2024:649-657.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 17093 | OA
Zakerinia H, Talaei S, Nadiradze G, Alistarh D-A. Communication-efficient federated learning with data and client heterogeneity. In: Proceedings of the 27th International Conference on Artificial Intelligence and Statistics. Vol 238. ML Research Press; 2024:3448-3456.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 15011 | OA
Kurtic E, Hoefler T, Alistarh D-A. How to prune your language model: Recovering accuracy on the “Sparsity May Cry” benchmark. In: Proceedings of Machine Learning Research. Vol 234. ML Research Press; 2024:542-553.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 18113 | OA
Egiazarian V, Panferov A, Kuznedelev D, Frantar E, Babenko A, Alistarh D-A. Extreme compression of large language models via additive quantization. In: Proceedings of the 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:12284-12303.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 18114 | OA
Pervez AA, Locatello F, Gavves E. Mechanistic neural networks for scientific machine learning. In: Proceedings of the 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:40484-40501.
[Published Version] View | Files available | Download Published Version (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 18115 | OA
Axiotis K, Cohen-Addad V, Henzinger M, et al. Data-efficient learning via clustering-based sensitivity sampling: Foundation models and beyond. In: Proceedings of the 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:2086-2107.
[Published Version] View | Download Published Version (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 18116 | OA
La Tour MD, Henzinger M, Saulpic D. Making old things new: A unified algorithm for differentially private clustering. In: Proceedings of the 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:12046-12086.
[Published Version] View | Download Published Version (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 18117 | OA
Nikdan M, Tabesh S, Crncevic E, Alistarh D-A. RoSA: Accurate parameter-efficient fine-tuning via robust adaptation. In: Proceedings of the 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:38187-38206.
[Preprint] View | Files available | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 18118 | OA
Zakerinia H, Behjati A, Lampert C. More flexible PAC-Bayesian meta-learning by learning learning algorithms. In: Proceedings of the 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:58122-58139.
[Published Version] View | Download Published Version (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 18120 | OA
Scott JA, Cahill Á. Improved modelling of federated datasets using mixtures-of-Dirichlet-multinomials. In: Proceedings of the 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:44012-44037.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 18121 | OA
Moakhar AS, Iofinova EB, Frantar E, Alistarh D-A. SPADE: Sparsity-guided debugging for deep neural networks. In: Proceedings of the 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:45955-45987.
[Preprint] View | Files available | Download Preprint (ext.) | arXiv
 

2023 | Published | Conference Paper | IST-REx-ID: 14458 | OA
Frantar E, Alistarh D-A. SparseGPT: Massive language models can be accurately pruned in one-shot. In: Proceedings of the 40th International Conference on Machine Learning. Vol 202. ML Research Press; 2023:10323-10337.
[Preprint] View | Files available | Download Preprint (ext.) | arXiv
 

2023 | Published | Conference Paper | IST-REx-ID: 14460 | OA
Nikdan M, Pegolotti T, Iofinova EB, Kurtic E, Alistarh D-A. SparseProp: Efficient sparse backpropagation for faster training of neural networks at the edge. In: Proceedings of the 40th International Conference on Machine Learning. Vol 202. ML Research Press; 2023:26215-26227.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2023 | Published | Conference Paper | IST-REx-ID: 14461 | OA
Markov I, Vladu A, Guo Q, Alistarh D-A. Quantized distributed training of large models with convergence guarantees. In: Proceedings of the 40th International Conference on Machine Learning. Vol 202. ML Research Press; 2023:24020-24044.
[Preprint] View | Files available | Download Preprint (ext.) | arXiv
 

2023 | Published | Conference Paper | IST-REx-ID: 14462 | OA
Fichtenberger H, Henzinger M, Upadhyay J. Constant matters: Fine-grained error bound on differentially private continual observation. In: Proceedings of the 40th International Conference on Machine Learning. Vol 202. ML Research Press; 2023:10072-10092.
[Published Version] View | Download Published Version (ext.)
 

2023 | Published | Conference Paper | IST-REx-ID: 14459 | OA
Shevchenko A, Kögler K, Hassani H, Mondelli M. Fundamental limits of two-layer autoencoders, and achieving them with gradient methods. In: Proceedings of the 40th International Conference on Machine Learning. Vol 202. ML Research Press; 2023:31151-31209.
[Preprint] View | Files available | Download Preprint (ext.) | arXiv
 

Filters and Search Terms

eissn=2640-3498

Search

Filter Publications

Display / Sort

Citation Style: AMA

Export / Embed