Efficient identification of wide shallow neural networks with biases

Fornasier, Massimo; Klock, Timo; Mondelli, Marco; Rauchensteiner, Michael

Efficient identification of wide shallow neural networks with biases

Fornasier M, Klock T, Mondelli M, Rauchensteiner M. 2025. Efficient identification of wide shallow neural networks with biases. Applied and Computational Harmonic Analysis. 77, 101749.

Download

2025_ApplCompAnalysis_Fornasier.pdf 2.22 MB [Published Version]

DOI

10.1016/j.acha.2025.101749

Journal Article | Published | English

Scopus indexed

Author

Fornasier, Massimo; Klock, Timo; Mondelli, Marco^ISTA ; Rauchensteiner, Michael

Corresponding author has ISTA affiliation

Department

Mondelli Group

Abstract

The identification of the parameters of a neural network from finite samples of input-output pairs is often referred to as the teacher-student model, and this model has represented a popular framework for understanding training and generalization. Even if the problem is NP-complete in the worst case, a rapidly growing literature – after adding suitable distributional assumptions – has established finite sample identification of two-layer networks with a number of neurons (math. formula), D being the input dimension. For the range (math. formula) the problem becomes harder, and truly little is known for networks parametrized by biases as well. This paper fills the gap by providing efficient algorithms and rigorous theoretical guarantees of finite sample identification for such wider shallow networks with biases. Our approach is based on a two-step pipeline: first, we recover the direction of the weights, by exploiting second order information; next, we identify the signs by suitable algebraic evaluations, and we recover the biases by empirical risk minimization via gradient descent. Numerical results demonstrate the effectiveness of our approach.

Publishing Year

2025

Date Published

2025-06-01

Journal Title

Applied and Computational Harmonic Analysis

Publisher

Elsevier

Volume

Article Number

101749

ISSN

1063-5203

eISSN

1096-603X

IST-REx-ID

19065

Cite this

Fornasier M, Klock T, Mondelli M, Rauchensteiner M. Efficient identification of wide shallow neural networks with biases. Applied and Computational Harmonic Analysis. 2025;77. doi:10.1016/j.acha.2025.101749

Fornasier, M., Klock, T., Mondelli, M., & Rauchensteiner, M. (2025). Efficient identification of wide shallow neural networks with biases. Applied and Computational Harmonic Analysis. Elsevier. https://doi.org/10.1016/j.acha.2025.101749

Fornasier, Massimo, Timo Klock, Marco Mondelli, and Michael Rauchensteiner. “Efficient Identification of Wide Shallow Neural Networks with Biases.” Applied and Computational Harmonic Analysis. Elsevier, 2025. https://doi.org/10.1016/j.acha.2025.101749.

M. Fornasier, T. Klock, M. Mondelli, and M. Rauchensteiner, “Efficient identification of wide shallow neural networks with biases,” Applied and Computational Harmonic Analysis, vol. 77. Elsevier, 2025.

Fornasier M, Klock T, Mondelli M, Rauchensteiner M. 2025. Efficient identification of wide shallow neural networks with biases. Applied and Computational Harmonic Analysis. 77, 101749.

Fornasier, Massimo, et al. “Efficient Identification of Wide Shallow Neural Networks with Biases.” Applied and Computational Harmonic Analysis, vol. 77, 101749, Elsevier, 2025, doi:10.1016/j.acha.2025.101749.

All files available under the following license(s):

Creative Commons Attribution 4.0 International Public License (CC-BY 4.0):