Please note that ISTA Research Explorer no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

22 Publications


2024 | Published | Conference Paper | IST-REx-ID: 19518 | OA
Wu, D., Modoranu, I.-V., Safaryan, M., Kuznedelev, D., & Alistarh, D.-A. (2024). The iterative optimal brain surgeon: Faster sparse recovery by leveraging second-order information. In 38th Conference on Neural Information Processing Systems (Vol. 37). Vancouver, Canada: Neural Information Processing Systems Foundation.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 19512 | OA
Andersson, J. D., Henzinger, M., Pagh, R., Steiner, T. A., & Upadhyay, J. (2024). Continual counting with gradual privacy expiration. In 38th Conference on Neural Information Processing Systems (Vol. 37). Vancouver, Canada: Neural Information Processing Systems Foundation.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 19515 | OA
Fumero, M., Pegoraro, M., Maiorca, V., Locatello, F., & Rodolà, E. (2024). Latent functional maps: A spectral framework for representation alignment. In 38th Conference on Neural Information Processing Systems (Vol. 37). Vancouver, Canada: Neural Information Processing Systems Foundation.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 19510 | OA
Modoranu, I.-V., Safaryan, M., Malinovsky, G., Kurtic, E., Robert, T., Richtárik, P., & Alistarh, D.-A. (2024). MICROADAM: Accurate adaptive optimization with low space overhead and provable convergence. In 38th Conference on Neural Information Processing Systems (Vol. 37). Neural Information Processing Systems Foundation.
[Preprint] View | Files available | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 19517 | OA
Crisostomi, D., Fumero, M., Baieri, D., Bernard, F., & Rodolà, E. (2024). C2M3: Cycle-consistent multi-model merging. In 38th Conference on Neural Information Processing Systems (Vol. 37). Vancouver, Canada: Neural Information Processing Systems Foundation.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 19511 | OA
Ashkboos, S., Mohtashami, A., Croci, M. L., Li, B., Cameron, P., Jaggi, M., … Hensman, J. (2024). QuaRot: Outlier-free 4-bit inference in rotated LLMs. In 38th Conference on Neural Information Processing Systems (Vol. 37). Vancouver, Canada: Neural Information Processing Systems Foundation.
[Preprint] View | Files available | Download Preprint (ext.) | arXiv
 

2024 | Published | Conference Paper | IST-REx-ID: 19519 | OA
Malinovskii, V., Mazur, D., Ilin, I., Kuznedelev, D., Burlachenko, K., Yi, K., … Richtarik, P. (2024). PV-tuning: Beyond straight-through estimation for extreme LLM compression. In 38th Conference on Neural Information Processing Systems (Vol. 37). Vancouver, Canada: Neural Information Processing Systems Foundation.
[Published Version] View | Files available | arXiv
 

2023 | Published | Conference Paper | IST-REx-ID: 15363 | OA
Safaryan, M., Krumes, A., & Alistarh, D.-A. (2023). Knowledge distillation performs partial variance reduction. In 36th Conference on Neural Information Processing Systems (Vol. 36). New Orleans, LA, United States.
[Published Version] View | Files available | arXiv
 

2023 | Published | Conference Paper | IST-REx-ID: 15364 | OA
Charikar, M., Hu, L., Henzinger, M., Vötsch, M., & Waingarten, E. (2023). Simple, scalable and effective clustering via one-dimensional projections. In 37th Conference on Neural Information Processing Systems (Vol. 36). New Orleans, LA, United States.
[Published Version] View | Files available | arXiv
 

2022 | Published | Conference Paper | IST-REx-ID: 18876 | OA
Kocsis, P., Súkeník, P., Brasó, G., Niessner, M., Leal-Taixé, L., & Elezi, I. (2022). The unreasonable effectiveness of fully-connected layers for low-data regimes. In 36th Conference on Neural Information Processing Systems (Vol. 35, pp. 1896–1908). New Orleans, LA, United States: Neural Information Processing Systems Foundation.
[Published Version] View | Files available | arXiv
 

2021 | Published | Conference Paper | IST-REx-ID: 11453 | OA
Braun, L., & Vogels, T. P. (2021). Online learning of neural computations from sparse temporal feedback. In Advances in Neural Information Processing Systems - 35th Conference on Neural Information Processing Systems (Vol. 20, pp. 16437–16450). Virtual, Online: Neural Information Processing Systems Foundation.
[Published Version] View | Download Published Version (ext.)
 

2021 | Published | Conference Paper | IST-REx-ID: 11452 | OA
Alimisis, F., Davies, P., Vandereycken, B., & Alistarh, D.-A. (2021). Distributed principal component analysis with limited communication. In Advances in Neural Information Processing Systems - 35th Conference on Neural Information Processing Systems (Vol. 4, pp. 2823–2834). Virtual, Online: Neural Information Processing Systems Foundation.
[Published Version] View | Download Published Version (ext.) | arXiv
 

2021 | Published | Conference Paper | IST-REx-ID: 10593 | OA
Mondelli, M., & Venkataramanan, R. (2021). PCA initialization for approximate message passing in rotationally invariant models. In 35th Conference on Neural Information Processing Systems (Vol. 35, pp. 29616–29629). Virtual: Neural Information Processing Systems Foundation.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2021 | Published | Conference Paper | IST-REx-ID: 10594 | OA
Nguyen, Q., Bréchet, P., & Mondelli, M. (2021). When are solutions connected in deep networks? In 35th Conference on Neural Information Processing Systems (Vol. 35). Virtual: Neural Information Processing Systems Foundation.
[Preprint] View | Download Preprint (ext.) | arXiv
 

2021 | Published | Conference Paper | IST-REx-ID: 11458 | OA
Krumes, A., Iofinova, E. B., Vladu, A., & Alistarh, D.-A. (2021). AC/DC: Alternating Compressed/DeCompressed training of deep neural networks. In 35th Conference on Neural Information Processing Systems (Vol. 34, pp. 8557–8570). Virtual, Online: Neural Information Processing Systems Foundation.
[Published Version] View | Files available | Download Published Version (ext.) | arXiv
 

2021 | Published | Conference Paper | IST-REx-ID: 11463 | OA
Frantar, E., Kurtic, E., & Alistarh, D.-A. (2021). M-FAC: Efficient matrix-free approximations of second-order information. In 35th Conference on Neural Information Processing Systems (Vol. 34, pp. 14873–14886). Virtual, Online: Neural Information Processing Systems Foundation.
[Published Version] View | Download Published Version (ext.) | arXiv
 

2021 | Published | Conference Paper | IST-REx-ID: 11464 | OA
Alistarh, D.-A., & Korhonen, J. (2021). Towards tight communication lower bounds for distributed optimisation. In 35th Conference on Neural Information Processing Systems (Vol. 34, pp. 7254–7266). Virtual, Online: Neural Information Processing Systems Foundation.
[Published Version] View | Download Published Version (ext.) | arXiv
 

2020 | Published | Conference Paper | IST-REx-ID: 9632 | OA
Singh, S. P., & Alistarh, D.-A. (2020). WoodFisher: Efficient second-order approximation for neural network compression (Vol. 33, pp. 18098–18109). Presented at the NeurIPS: Conference on Neural Information Processing Systems, Vancouver, Canada: Neural Information Processing Systems Foundation.
[Published Version] View | Download Published Version (ext.) | arXiv
 

2020 | Published | Conference Paper | IST-REx-ID: 9631 | OA
Aksenov, V., Alistarh, D.-A., & Korhonen, J. (2020). Scalable belief propagation via relaxed scheduling (Vol. 33, pp. 22361–22372). Presented at the NeurIPS: Conference on Neural Information Processing Systems, Vancouver, Canada: Neural Information Processing Systems Foundation.
[Published Version] View | Download Published Version (ext.) | arXiv
 

2020 | Published | Conference Paper | IST-REx-ID: 9633 | OA
Confavreux, B. J., Zenke, F., Agnes, E. J., Lillicrap, T., & Vogels, T. P. (2020). A meta-learning approach to (re)discover plasticity rules that carve a desired function into a neural network. In Advances in Neural Information Processing Systems (Vol. 33, pp. 16398–16408). Vancouver, Canada.
[Published Version] View | Files available | Download Published Version (ext.)
 

Filters and Search Terms

issn=1049-5258

Search

Filter Publications

Display / Sort

Citation Style: APA

Export / Embed