Jiale Chen
Graduate School
Alistarh Group
3 Publications
2025 | Published | Conference Paper | IST-REx-ID: 19877 |

MARLIN: Mixed-precision auto-regressive parallel inference on Large Language Models
E. Frantar, R.L. Castro, J. Chen, T. Hoefler, D.-A. Alistarh, in:, Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, Association for Computing Machinery, 2025, pp. 239–251.
[Published Version]
View
| Files available
| DOI
| arXiv
E. Frantar, R.L. Castro, J. Chen, T. Hoefler, D.-A. Alistarh, in:, Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, Association for Computing Machinery, 2025, pp. 239–251.
2025 | Published | Conference Paper | IST-REx-ID: 20032 |

Scalable mechanistic neural networks
J. Chen, D. Yao, A.A. Pervez, D.-A. Alistarh, F. Locatello, in:, 13th International Conference on Learning Representations, OpenReview, 2025, pp. 63716–63737.
[Published Version]
View
| Files available
| arXiv
J. Chen, D. Yao, A.A. Pervez, D.-A. Alistarh, F. Locatello, in:, 13th International Conference on Learning Representations, OpenReview, 2025, pp. 63716–63737.
2024 | Research Data Reference | IST-REx-ID: 19884 |

MARLIN: Mixed-precision auto-regressive parallel inference on Large Language Models
E. Frantar, R. Castro, J. Chen, T. Hoefler, D.-A. Alistarh, (2024).
[Published Version]
View
| Files available
| DOI
| Download Published Version (ext.)
E. Frantar, R. Castro, J. Chen, T. Hoefler, D.-A. Alistarh, (2024).
Grants
3 Publications
2025 | Published | Conference Paper | IST-REx-ID: 19877 |

MARLIN: Mixed-precision auto-regressive parallel inference on Large Language Models
E. Frantar, R.L. Castro, J. Chen, T. Hoefler, D.-A. Alistarh, in:, Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, Association for Computing Machinery, 2025, pp. 239–251.
[Published Version]
View
| Files available
| DOI
| arXiv
E. Frantar, R.L. Castro, J. Chen, T. Hoefler, D.-A. Alistarh, in:, Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, Association for Computing Machinery, 2025, pp. 239–251.
2025 | Published | Conference Paper | IST-REx-ID: 20032 |

Scalable mechanistic neural networks
J. Chen, D. Yao, A.A. Pervez, D.-A. Alistarh, F. Locatello, in:, 13th International Conference on Learning Representations, OpenReview, 2025, pp. 63716–63737.
[Published Version]
View
| Files available
| arXiv
J. Chen, D. Yao, A.A. Pervez, D.-A. Alistarh, F. Locatello, in:, 13th International Conference on Learning Representations, OpenReview, 2025, pp. 63716–63737.
2024 | Research Data Reference | IST-REx-ID: 19884 |

MARLIN: Mixed-precision auto-regressive parallel inference on Large Language Models
E. Frantar, R. Castro, J. Chen, T. Hoefler, D.-A. Alistarh, (2024).
[Published Version]
View
| Files available
| DOI
| Download Published Version (ext.)
E. Frantar, R. Castro, J. Chen, T. Hoefler, D.-A. Alistarh, (2024).