Jiale Chen
Graduate School
Alistarh Group
3 Publications
2025 | Published | Conference Paper | IST-REx-ID: 19877 |

Frantar, Elias, et al. “MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models.” Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, Association for Computing Machinery, 2025, pp. 239–51, doi:10.1145/3710848.3710871.
[Published Version]
View
| Files available
| DOI
| arXiv
2025 | Published | Conference Paper | IST-REx-ID: 20032 |

Chen, Jiale, et al. “Scalable Mechanistic Neural Networks.” 13th International Conference on Learning Representations, OpenReview, 2025, pp. 63716–37.
[Published Version]
View
| Files available
| arXiv
2024 | Research Data Reference | IST-REx-ID: 19884 |

Frantar, Elias, et al. MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models. Zenodo, 2024, doi:10.5281/ZENODO.14213091.
[Published Version]
View
| Files available
| DOI
| Download Published Version (ext.)
Grants
3 Publications
2025 | Published | Conference Paper | IST-REx-ID: 19877 |

Frantar, Elias, et al. “MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models.” Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, Association for Computing Machinery, 2025, pp. 239–51, doi:10.1145/3710848.3710871.
[Published Version]
View
| Files available
| DOI
| arXiv
2025 | Published | Conference Paper | IST-REx-ID: 20032 |

Chen, Jiale, et al. “Scalable Mechanistic Neural Networks.” 13th International Conference on Learning Representations, OpenReview, 2025, pp. 63716–37.
[Published Version]
View
| Files available
| arXiv
2024 | Research Data Reference | IST-REx-ID: 19884 |

Frantar, Elias, et al. MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models. Zenodo, 2024, doi:10.5281/ZENODO.14213091.
[Published Version]
View
| Files available
| DOI
| Download Published Version (ext.)