Diyuan Wu
3 Publications
2025 |
Published |
Conference Paper |
IST-REx-ID: 21326 |
Neural collapse beyond the unconstrained features model: Landscape, dynamics, and generalization in the mean-field regime
D. Wu, M. Mondelli, in:, Proceedings of the 42nd International Conference on Machine Learning, ML Research Press, 2025, pp. 67499–67536.
[Published Version]
View
| Files available
| arXiv
D. Wu, M. Mondelli, in:, Proceedings of the 42nd International Conference on Machine Learning, ML Research Press, 2025, pp. 67499–67536.
2024 |
Published |
Conference Paper |
IST-REx-ID: 19518 |
The iterative optimal brain surgeon: Faster sparse recovery by leveraging second-order information
D. Wu, I.-V. Modoranu, M. Safaryan, D. Kuznedelev, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
D. Wu, I.-V. Modoranu, M. Safaryan, D. Kuznedelev, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
2023 |
Published |
Conference Paper |
IST-REx-ID: 14924 |
Mean-field analysis for heavy ball methods: Dropout-stability, connectivity, and global convergence
D. Wu, V. Kungurtsev, M. Mondelli, in:, Transactions on Machine Learning Research, ML Research Press, 2023.
[Published Version]
View
| Download Published Version (ext.)
| arXiv
D. Wu, V. Kungurtsev, M. Mondelli, in:, Transactions on Machine Learning Research, ML Research Press, 2023.
Grants
3 Publications
2025 |
Published |
Conference Paper |
IST-REx-ID: 21326 |
Neural collapse beyond the unconstrained features model: Landscape, dynamics, and generalization in the mean-field regime
D. Wu, M. Mondelli, in:, Proceedings of the 42nd International Conference on Machine Learning, ML Research Press, 2025, pp. 67499–67536.
[Published Version]
View
| Files available
| arXiv
D. Wu, M. Mondelli, in:, Proceedings of the 42nd International Conference on Machine Learning, ML Research Press, 2025, pp. 67499–67536.
2024 |
Published |
Conference Paper |
IST-REx-ID: 19518 |
The iterative optimal brain surgeon: Faster sparse recovery by leveraging second-order information
D. Wu, I.-V. Modoranu, M. Safaryan, D. Kuznedelev, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
[Preprint]
View
| Download Preprint (ext.)
| arXiv
D. Wu, I.-V. Modoranu, M. Safaryan, D. Kuznedelev, D.-A. Alistarh, in:, 38th Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, 2024.
2023 |
Published |
Conference Paper |
IST-REx-ID: 14924 |
Mean-field analysis for heavy ball methods: Dropout-stability, connectivity, and global convergence
D. Wu, V. Kungurtsev, M. Mondelli, in:, Transactions on Machine Learning Research, ML Research Press, 2023.
[Published Version]
View
| Download Published Version (ext.)
| arXiv
D. Wu, V. Kungurtsev, M. Mondelli, in:, Transactions on Machine Learning Research, ML Research Press, 2023.