3 Publications

Mark all

[3]
2025 | Published | Conference Paper | IST-REx-ID: 19877 | OA
Frantar, Elias, Roberto L. Castro, Jiale Chen, Torsten Hoefler, and Dan-Adrian Alistarh. “MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models.” In Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 239–51. Association for Computing Machinery, 2025. https://doi.org/10.1145/3710848.3710871.
[Published Version] View | Files available | DOI | arXiv
 
[2]
2025 | Published | Conference Paper | IST-REx-ID: 20032 | OA
Chen, Jiale, Dingling Yao, Adeel A Pervez, Dan-Adrian Alistarh, and Francesco Locatello. “Scalable Mechanistic Neural Networks.” In 13th International Conference on Learning Representations, 63716–37. OpenReview, 2025.
[Published Version] View | Files available | arXiv
 
[1]
2024 | Research Data Reference | IST-REx-ID: 19884 | OA
Frantar, Elias, Roberto Castro, Jiale Chen, Torsten Hoefler, and Dan-Adrian Alistarh. “MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models.” Zenodo, 2024. https://doi.org/10.5281/ZENODO.14213091.
[Published Version] View | Files available | DOI | Download Published Version (ext.)
 

Search

Filter Publications

Display / Sort

Citation Style: Chicago

Export / Embed

Grants


3 Publications

Mark all

[3]
2025 | Published | Conference Paper | IST-REx-ID: 19877 | OA
Frantar, Elias, Roberto L. Castro, Jiale Chen, Torsten Hoefler, and Dan-Adrian Alistarh. “MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models.” In Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 239–51. Association for Computing Machinery, 2025. https://doi.org/10.1145/3710848.3710871.
[Published Version] View | Files available | DOI | arXiv
 
[2]
2025 | Published | Conference Paper | IST-REx-ID: 20032 | OA
Chen, Jiale, Dingling Yao, Adeel A Pervez, Dan-Adrian Alistarh, and Francesco Locatello. “Scalable Mechanistic Neural Networks.” In 13th International Conference on Learning Representations, 63716–37. OpenReview, 2025.
[Published Version] View | Files available | arXiv
 
[1]
2024 | Research Data Reference | IST-REx-ID: 19884 | OA
Frantar, Elias, Roberto Castro, Jiale Chen, Torsten Hoefler, and Dan-Adrian Alistarh. “MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models.” Zenodo, 2024. https://doi.org/10.5281/ZENODO.14213091.
[Published Version] View | Files available | DOI | Download Published Version (ext.)
 

Search

Filter Publications

Display / Sort

Citation Style: Chicago

Export / Embed