Please note that ISTA Research Explorer no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
25 Publications
2024 | Published | Conference Paper | IST-REx-ID: 18973 |

Towards understanding the word sensitivity of attention layers: A study via random features
S. Bombari, M. Mondelli, in:, 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 4300–4328.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
S. Bombari, M. Mondelli, in:, 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 4300–4328.
2024 | Published | Conference Paper | IST-REx-ID: 18975 |

Error feedback can accurately compress preconditioners
I.-V. Modoranu, A. Kalinov, E. Kurtic, E. Frantar, D.-A. Alistarh, in:, 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 35910–35933.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
I.-V. Modoranu, A. Kalinov, E. Kurtic, E. Frantar, D.-A. Alistarh, in:, 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 35910–35933.
2024 | Published | Conference Paper | IST-REx-ID: 18972 |

How spurious features are memorized: Precise analysis for random and NTK features
S. Bombari, M. Mondelli, in:, 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 4267–4299.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
S. Bombari, M. Mondelli, in:, 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 4267–4299.
2024 | Published | Conference Paper | IST-REx-ID: 18971 |

Unsupervised concept discovery mitigates spurious correlations
R. Arefin, Y. Zhang, A. Baratin, F. Locatello, I. Rish, D. Liu, K. Kawaguchi, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 1672–1688.
[Preprint]
View
| Files available
| Download Preprint (ext.)
| arXiv
R. Arefin, Y. Zhang, A. Baratin, F. Locatello, I. Rish, D. Liu, K. Kawaguchi, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 1672–1688.
2024 | Published | Conference Paper | IST-REx-ID: 18976 |

AsGrad: A sharp unified analysis of asynchronous-SGD algorithms
R. Islamov, M. Safaryan, D.-A. Alistarh, in:, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, ML Research Press, 2024, pp. 649–657.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
R. Islamov, M. Safaryan, D.-A. Alistarh, in:, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, ML Research Press, 2024, pp. 649–657.
2024 | Published | Conference Paper | IST-REx-ID: 17093 |

Communication-efficient federated learning with data and client heterogeneity
H. Zakerinia, S. Talaei, G. Nadiradze, D.-A. Alistarh, in:, Proceedings of the 27th International Conference on Artificial Intelligence and Statistics, ML Research Press, 2024, pp. 3448–3456.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
H. Zakerinia, S. Talaei, G. Nadiradze, D.-A. Alistarh, in:, Proceedings of the 27th International Conference on Artificial Intelligence and Statistics, ML Research Press, 2024, pp. 3448–3456.
2024 | Published | Conference Paper | IST-REx-ID: 15011 |

How to prune your language model: Recovering accuracy on the "Sparsity May Cry" benchmark
E. Kurtic, T. Hoefler, D.-A. Alistarh, in:, Proceedings of Machine Learning Research, ML Research Press, 2024, pp. 542–553.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
E. Kurtic, T. Hoefler, D.-A. Alistarh, in:, Proceedings of Machine Learning Research, ML Research Press, 2024, pp. 542–553.
2024 | Published | Conference Paper | IST-REx-ID: 18113 |

Extreme compression of large language models via additive quantization
V. Egiazarian, A. Panferov, D. Kuznedelev, E. Frantar, A. Babenko, D.-A. Alistarh, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 12284–12303.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
V. Egiazarian, A. Panferov, D. Kuznedelev, E. Frantar, A. Babenko, D.-A. Alistarh, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 12284–12303.
2024 | Published | Conference Paper | IST-REx-ID: 18114 |

Mechanistic neural networks for scientific machine learning
A.A. Pervez, F. Locatello, E. Gavves, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 40484–40501.
[Published Version]
View
| Files available
| Download Published Version (ext.)
| arXiv
A.A. Pervez, F. Locatello, E. Gavves, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 40484–40501.
2024 | Published | Conference Paper | IST-REx-ID: 18115 |

Data-efficient learning via clustering-based sensitivity sampling: Foundation models and beyond
K. Axiotis, V. Cohen-Addad, M. Henzinger, S. Jerome, V. Mirrokni, D. Saulpic, D.P. Woodruff, M. Wunder, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 2086–2107.
[Published Version]
View
| Download Published Version (ext.)
| arXiv
K. Axiotis, V. Cohen-Addad, M. Henzinger, S. Jerome, V. Mirrokni, D. Saulpic, D.P. Woodruff, M. Wunder, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 2086–2107.
2024 | Published | Conference Paper | IST-REx-ID: 18116 |

Making old things new: A unified algorithm for differentially private clustering
M.D. La Tour, M. Henzinger, D. Saulpic, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 12046–12086.
[Published Version]
View
| Download Published Version (ext.)
| arXiv
M.D. La Tour, M. Henzinger, D. Saulpic, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 12046–12086.
2024 | Published | Conference Paper | IST-REx-ID: 18117 |

RoSA: Accurate parameter-efficient fine-tuning via robust adaptation
M. Nikdan, S. Tabesh, E. Crncevic, D.-A. Alistarh, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 38187–38206.
[Preprint]
View
| Files available
| Download Preprint (ext.)
| arXiv
M. Nikdan, S. Tabesh, E. Crncevic, D.-A. Alistarh, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 38187–38206.
2024 | Published | Conference Paper | IST-REx-ID: 18118 |

More flexible PAC-Bayesian meta-learning by learning learning algorithms
H. Zakerinia, A. Behjati, C. Lampert, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 58122–58139.
[Published Version]
View
| Download Published Version (ext.)
| arXiv
H. Zakerinia, A. Behjati, C. Lampert, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 58122–58139.
2024 | Published | Conference Paper | IST-REx-ID: 18120 |

Improved modelling of federated datasets using mixtures-of-Dirichlet-multinomials
J.A. Scott, Á. Cahill, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 44012–44037.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
J.A. Scott, Á. Cahill, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 44012–44037.
2024 | Published | Conference Paper | IST-REx-ID: 18121 |

SPADE: Sparsity-guided debugging for deep neural networks
A.S. Moakhar, E.B. Iofinova, E. Frantar, D.-A. Alistarh, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 45955–45987.
[Preprint]
View
| Files available
| Download Preprint (ext.)
| arXiv
A.S. Moakhar, E.B. Iofinova, E. Frantar, D.-A. Alistarh, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 45955–45987.
2023 | Published | Conference Paper | IST-REx-ID: 14458 |

SparseGPT: Massive language models can be accurately pruned in one-shot
E. Frantar, D.-A. Alistarh, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 10323–10337.
[Preprint]
View
| Files available
| Download Preprint (ext.)
| arXiv
E. Frantar, D.-A. Alistarh, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 10323–10337.
2023 | Published | Conference Paper | IST-REx-ID: 14460 |

SparseProp: Efficient sparse backpropagation for faster training of neural networks at the edge
M. Nikdan, T. Pegolotti, E.B. Iofinova, E. Kurtic, D.-A. Alistarh, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 26215–26227.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
M. Nikdan, T. Pegolotti, E.B. Iofinova, E. Kurtic, D.-A. Alistarh, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 26215–26227.
2023 | Published | Conference Paper | IST-REx-ID: 14461 |

Quantized distributed training of large models with convergence guarantees
I. Markov, A. Vladu, Q. Guo, D.-A. Alistarh, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 24020–24044.
[Preprint]
View
| Files available
| Download Preprint (ext.)
| arXiv
I. Markov, A. Vladu, Q. Guo, D.-A. Alistarh, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 24020–24044.
2023 | Published | Conference Paper | IST-REx-ID: 14462 |

Constant matters: Fine-grained error bound on differentially private continual observation
H. Fichtenberger, M. Henzinger, J. Upadhyay, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 10072–10092.
[Published Version]
View
| Download Published Version (ext.)
H. Fichtenberger, M. Henzinger, J. Upadhyay, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 10072–10092.
2023 | Published | Conference Paper | IST-REx-ID: 14459 |

Fundamental limits of two-layer autoencoders, and achieving them with gradient methods
A. Shevchenko, K. Kögler, H. Hassani, M. Mondelli, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 31151–31209.
[Preprint]
View
| Files available
| Download Preprint (ext.)
| arXiv
A. Shevchenko, K. Kögler, H. Hassani, M. Mondelli, in:, Proceedings of the 40th International Conference on Machine Learning, ML Research Press, 2023, pp. 31151–31209.