ISTA Research Explorer

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

Alexander Fedorov

Andrei Panferov

Dan Alistarh

Sergei Pankratov

Vage Egiazarian

Erik Schultheis

Mahdi Nikdan

Soroush Tabesh

Jiale Chen

Alexandra Volkova

Eldar Kurtic

Ionut-Vlad Modoranu

Janne Korhonen

Vitalii Aksenov

Martin Töpfer

Giorgi Nadiradze

Bapi Chatterjee

Alexandra Krumes

Peter Davies

Sidak Singh

Trevor Brown

Elvir Crncevic

Sokolova, Mariia

Elias Frantar

Sharareh Alipour

Joel Rybicki

Ilia Markov

Roberto Lopez Castro

Mher Safaryan

164 Publications

Search / Filter

2024 | Research Data Reference | IST-REx-ID: 19884 |

Frantar E, Castro R, Chen J, Hoefler T, Alistarh D-A. MARLIN: Mixed-precision auto-regressive parallel inference on Large Language Models. 2024. doi:10.5281/ZENODO.14213091

[Published Version] View | Files available | DOI | Download Published Version (ext.)

2024 | Published | Conference Paper | IST-REx-ID: 18975 |

Modoranu I-V, Kalinov A, Kurtic E, Frantar E, Alistarh D-A. Error feedback can accurately compress preconditioners. In: 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:35910-35933.

[Preprint] View | Download Preprint (ext.) | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 18976 |

Islamov R, Safaryan M, Alistarh D-A. AsGrad: A sharp unified analysis of asynchronous-SGD algorithms. In: Proceedings of The 27th International Conference on Artificial Intelligence and Statistics. Vol 238. ML Research Press; 2024:649-657.

[Preprint] View | Download Preprint (ext.) | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 18977 |

Dettmers T, Svirschevski RA, Egiazarian V, et al. SpQR: A sparse-quantized representation for near-lossless LLM weight compression. In: 12th International Conference on Learning Representations. OpenReview; 2024.

[Preprint] View | Download Preprint (ext.) | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 19510 |

Modoranu I-V, Safaryan M, Malinovsky G, et al. MICROADAM: Accurate adaptive optimization with low space overhead and provable convergence. In: 38th Conference on Neural Information Processing Systems. Vol 37. Neural Information Processing Systems Foundation; 2024.

[Preprint] View | Files available | Download Preprint (ext.) | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 19511 |

Ashkboos S, Mohtashami A, Croci ML, et al. QuaRot: Outlier-free 4-bit inference in rotated LLMs. In: 38th Conference on Neural Information Processing Systems. Vol 37. Neural Information Processing Systems Foundation; 2024.

[Preprint] View | Files available | Download Preprint (ext.) | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 19518 |

Wu D, Modoranu I-V, Safaryan M, Kuznedelev D, Alistarh D-A. The iterative optimal brain surgeon: Faster sparse recovery by leveraging second-order information. In: 38th Conference on Neural Information Processing Systems. Vol 37. Neural Information Processing Systems Foundation; 2024.

[Preprint] View | Download Preprint (ext.) | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 19519 |

Malinovskii V, Mazur D, Ilin I, et al. PV-tuning: Beyond straight-through estimation for extreme LLM compression. In: 38th Conference on Neural Information Processing Systems. Vol 37. Neural Information Processing Systems Foundation; 2024.

[Published Version] View | Files available | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 15011 |

Kurtic E, Hoefler T, Alistarh D-A. How to prune your language model: Recovering accuracy on the “Sparsity May Cry” benchmark. In: Proceedings of Machine Learning Research. Vol 234. ML Research Press; 2024:542-553.

[Preprint] View | Download Preprint (ext.) | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 17093 |

Zakerinia H, Talaei S, Nadiradze G, Alistarh D-A. Communication-efficient federated learning with data and client heterogeneity. In: Proceedings of the 27th International Conference on Artificial Intelligence and Statistics. Vol 238. ML Research Press; 2024:3448-3456.

[Preprint] View | Download Preprint (ext.) | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 17329 |

Alistarh D-A, Chatterjee K, Karrabi M, Lazarsfeld JM. Game dynamics and equilibrium computation in the population protocol model. In: Proceedings of the 43rd Annual ACM Symposium on Principles of Distributed Computing. Association for Computing Machinery; 2024:40-49. doi:10.1145/3662158.3662768

[Published Version] View | Files available | DOI

2024 | Published | Conference Paper | IST-REx-ID: 17332 |

Kokorin I, Yudov V, Aksenov V, Alistarh D-A. Wait-free trees with asymptotically-efficient range queries. In: 2024 IEEE International Parallel and Distributed Processing Symposium. IEEE; 2024:169-179. doi:10.1109/IPDPS57955.2024.00023

[Preprint] View | DOI | Download Preprint (ext.) | WoS | arXiv

2024 | Published | Conference Paper | IST-REx-ID: 17456 |

Markov I, Alimohammadi K, Frantar E, Alistarh D-A. L-GreCo: Layerwise-adaptive gradient compression for efficient data-parallel deep learning. In: Gibbons P, Pekhimenko G, De Sa C, eds. Proceedings of Machine Learning and Systems . Vol 6. Association for Computing Machinery; 2024.

[Published Version] View | Files available | Download Published Version (ext.) | arXiv

2024 | Published | Thesis | PhD | IST-REx-ID: 17485 |

Frantar E. Compressing large neural networks : Algorithms, systems and scaling laws. 2024. doi:10.15479/at:ista:17485

[Published Version] View | Files available | DOI

2024 | Published | Thesis | PhD | IST-REx-ID: 17490 |

Markov I. Communication-efficient distributed training of deep neural networks : An algorithms and systems perspective. 2024. doi:10.15479/at:ista:17490

[Published Version] View | Files available | DOI

2024 | Published | Thesis | PhD | IST-REx-ID: 17465 |

Shevchenko A. High-dimensional limits in artificial neural networks. 2024. doi:10.15479/at:ista:17465

[Published Version] View | Files available | DOI

2024 | Published | Conference Paper | IST-REx-ID: 17469 |

Kögler K, Shevchenko A, Hassani H, Mondelli M. Compression of structured data with autoencoders: Provable benefit of nonlinearities and depth. In: Proceedings of the 41st International Conference on Machine Learning. Vol 235. ML Research Press; 2024:24964-25015.

[Published Version] View | Files available | Download Published Version (ext.) | arXiv

2023 | Published | Conference Paper | IST-REx-ID: 14260 |

Koval N, Fedorov A, Sokolova M, Tsitelov D, Alistarh D-A. Lincheck: A practical framework for testing concurrent data structures on JVM. In: 35th International Conference on Computer Aided Verification . Vol 13964. Springer Nature; 2023:156-169. doi:10.1007/978-3-031-37706-8_8

[Published Version] View | Files available | DOI | WoS

2023 | Published | Journal Article | IST-REx-ID: 14364 |

Alistarh D-A, Aspnes J, Ellen F, Gelashvili R, Zhu L. Why extension-based proofs fail. SIAM Journal on Computing. 2023;52(4):913-944. doi:10.1137/20M1375851

2023 | Published | Conference Paper | IST-REx-ID: 14458 |

Frantar E, Alistarh D-A. SparseGPT: Massive language models can be accurately pruned in one-shot. In: Proceedings of the 40th International Conference on Machine Learning. Vol 202. ML Research Press; 2023:10323-10337.

[Preprint] View | Files available | Download Preprint (ext.) | arXiv

Alistarh Group

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

Alumni

164 Publications

Search

Filter Publications

Display / Sort

Export / Embed

Alistarh Group

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

Alumni

164 Publications

Search

Filter Publications

Display / Sort

Export / Embed

Export Options