Please note that ISTA Research Explorer no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

1 Publication


2025 | Published | Conference Paper | IST-REx-ID: 19877 | OA
E. Frantar, R. L. Castro, J. Chen, T. Hoefler, and D.-A. Alistarh, “MARLIN: Mixed-precision auto-regressive parallel inference on Large Language Models,” in Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, Las Vegas, NV, United States, 2025, pp. 239–251.
[Published Version] View | Files available | DOI | arXiv
 

Filters and Search Terms

isbn=9798400714436

Search

Filter Publications

  • Display / Sort

    Citation Style: IEEE

    Export / Embed