ISTA Research Explorer

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

Andrei Panferov

Eldar Kurtic

Dan Alistarh

Alexander Fedorov

Vage Egiazarian

Erik Schultheis

Soroush Tabesh

Sergei Pankratov

Jiale Chen

Alexandra Volkova

Eugenia Iofinova

Mahdi Nikdan

Ionut-Vlad Modoranu

Mher Safaryan

Ilia Markov

Bapi Chatterjee

Alexandra Krumes

Peter Davies

Vitalii Aksenov

Giorgi Nadiradze

Sidak Singh

Elvir Crncevic

Sharareh Alipour

Elias Frantar

Joel Rybicki

Roberto Lopez Castro

Trevor Brown

Martin Töpfer

Janne Korhonen

Sokolova, Mariia

162 Publications

Search / Filter

2025 | Published | Journal Article | IST-REx-ID: 19713 |

Hybrid decentralized optimization: Leveraging both first- and zeroth-order optimizers for faster convergence
S. Talaei, M. Ansaripour, G. Nadiradze, D.-A. Alistarh, Proceedings of The39th AAAI Conference on Artificial Intelligence 39 (2025) 20778–20786.

[Preprint] View | Files available | DOI | Download Preprint (ext.) | arXiv

2025 | Published | Conference Paper | IST-REx-ID: 20038 |

The journey matters: Average parameter count over pre-training unifies sparse and dense scaling laws
T. Jin, A.I. Humayun, U. Evci, S. Subramanian, A. Yazdanbakhsh, D.-A. Alistarh, G.K. Dziugaite, in:, 13th International Conference on Learning Representations, ICLR, 2025, pp. 85165–85181.

[Published Version] View | Files available | arXiv

2025 | Published | Conference Paper | IST-REx-ID: 20037 |

Wasserstein distances, neuronal entanglement, and sparsity
S. Sawmya, L. Kong, I. Markov, D.-A. Alistarh, N. Shavit, in:, 13th International Conference on Learning Representations, ICLR, 2025, pp. 26244–26274.

[Published Version] View | Files available | arXiv

2025 | Published | Conference Paper | IST-REx-ID: 20032 |

Scalable mechanistic neural networks
J. Chen, D. Yao, A.A. Pervez, D.-A. Alistarh, F. Locatello, in:, 13th International Conference on Learning Representations, ICLR, 2025, pp. 63716–63737.

[Published Version] View | Files available | arXiv

2025 | Published | Conference Paper | IST-REx-ID: 20034 |

LDAdam: Adaptive optimization from low-dimensional gradient statistics
T. Robert, M. Safaryan, I.-V. Modoranu, D.-A. Alistarh, in:, 13th International Conference on Learning Representations, ICLR, 2025, pp. 101877–101913.

[Published Version] View | Files available | arXiv

2025 | Published | Conference Paper | IST-REx-ID: 19877 |

MARLIN: Mixed-precision auto-regressive parallel inference on Large Language Models
E. Frantar, R.L. Castro, J. Chen, T. Hoefler, D.-A. Alistarh, in:, Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, Association for Computing Machinery, 2025, pp. 239–251.

[Published Version] View | Files available | DOI | WoS | arXiv

2025 | Published | Conference Paper | IST-REx-ID: 20684 |

“Give me BF16 or give me death”? Accuracy-performance trade-offs in LLM quantization
E. Kurtic, A. Marques, S. Pandit, M. Kurtz, D.-A. Alistarh, in:, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, 2025, pp. 26872–26886.

[Published Version] View | Files available | arXiv

2025 | Published | Conference Paper | IST-REx-ID: 20224 |

In the search of optimal tree networks: Hardness and heuristics
P. Martynov, M. Buzdalov, S. Pankratov, V. Aksenov, S. Schmid, in:, Proceedings of the 2025 Genetic and Evolutionary Computation Conference, Association for Computing Machinery, 2025, pp. 249–257.

[Published Version] View | Files available | DOI | WoS

2025 | Published | Journal Article | IST-REx-ID: 20704

Scalable multitemperature free energy sampling of classical Ising spin states
P. Tuo, Z. Zeng, J. Chen, B. Cheng, Journal of Chemical Theory and Computation 21 (2025) 11427–11435.

2025 | Published | Conference Paper | IST-REx-ID: 20821 |

Layer-wise quantization for quantized optimistic dual averaging
A.D. Nguyen, I. Markov, F.Z. Wu, A. Ramezani-Kebrya, K. Antonakopoulos, D.-A. Alistarh, V. Cevher, in:, 42nd International Conference on Machine Learning, ML Research Press, 2025, pp. 46026–46072.

[Published Version] View | Files available | arXiv

2025 | Published | Conference Paper | IST-REx-ID: 20820 |

EvoPress: Accurate dynamic model compression via evolutionary search
O. Sieberling, D. Kuznedelev, E. Kurtic, D.-A. Alistarh, in:, 42nd International Conference on Machine Learning, ML Research Press, 2025, pp. 55556–55590.

[Published Version] View | Files available | arXiv

Near-optimal leader election in population protocols on graphs
D.-A. Alistarh, J. Rybicki, S. Voitovych, Distributed Computing 38 (2025) 207–245.

[Published Version] View | Files available | DOI | WoS | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 17093 |

Communication-efficient federated learning with data and client heterogeneity
H. Zakerinia, S. Talaei, G. Nadiradze, D.-A. Alistarh, in:, Proceedings of the 27th International Conference on Artificial Intelligence and Statistics, ML Research Press, 2024, pp. 3448–3456.

[Preprint] View | Download Preprint (ext.) | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 15011 |

How to prune your language model: Recovering accuracy on the "Sparsity May Cry" benchmark
E. Kurtic, T. Hoefler, D.-A. Alistarh, in:, Proceedings of Machine Learning Research, ML Research Press, 2024, pp. 542–553.

[Preprint] View | Download Preprint (ext.) | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 18113 |

Extreme compression of large language models via additive quantization
V. Egiazarian, A. Panferov, D. Kuznedelev, E. Frantar, A. Babenko, D.-A. Alistarh, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 12284–12303.

[Preprint] View | Download Preprint (ext.) | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 18117 |

RoSA: Accurate parameter-efficient fine-tuning via robust adaptation
M. Nikdan, S. Tabesh, E. Crncevic, D.-A. Alistarh, in:, Proceedings of the 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 38187–38206.

[Preprint] View | Files available | Download Preprint (ext.) | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 18975 |

Error feedback can accurately compress preconditioners
I.-V. Modoranu, A. Kalinov, E. Kurtic, E. Frantar, D.-A. Alistarh, in:, 41st International Conference on Machine Learning, ML Research Press, 2024, pp. 35910–35933.

[Preprint] View | Download Preprint (ext.) | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 18977 |

SpQR: A sparse-quantized representation for near-lossless LLM weight compression
T. Dettmers, R.A. Svirschevski, V. Egiazarian, D. Kuznedelev, E. Frantar, S. Ashkboos, A. Borzunov, T. Hoefler, D.-A. Alistarh, in:, 12th International Conference on Learning Representations, OpenReview, 2024.

[Preprint] View | Download Preprint (ext.) | arXiv

2024 | Published | Thesis | IST-REx-ID: 17485 |

Compressing large neural networks : Algorithms, systems and scaling laws
E. Frantar, Compressing Large Neural Networks : Algorithms, Systems and Scaling Laws, Institute of Science and Technology Austria, 2024.

[Published Version] View | Files available | DOI

2024 | Published | Conference Paper | IST-REx-ID: 18061 |

QMoE: Sub-1-bit compression of trillion parameter models
E. Frantar, D.-A. Alistarh, in:, P. Gibbons, G. Pekhimenko, C. De Sa (Eds.), Proceedings of Machine Learning and Systems, 2024.

[Published Version] View | Files available | Download Published Version (ext.)

Alistarh Group

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

Alumni

162 Publications

Search

Filter Publications

Display / Sort

Export / Embed

Alistarh Group

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

Alumni

162 Publications

Search

Filter Publications

Display / Sort

Export / Embed

Export Options