The inductive bias of ReLU networks on orthogonally separable data

Bui Thi Mai, Phuong; Lampert, Christoph

The inductive bias of ReLU networks on orthogonally separable data

Phuong M, Lampert C. 2021. The inductive bias of ReLU networks on orthogonally separable data. 9th International Conference on Learning Representations. ICLR: International Conference on Learning Representations.

Download

iclr2021_conference.pdf 502.36 KB [Published Version]

Download (ext.)

https://openreview.net/pdf?id=krz7T0xU9Z_ [Published Version]

Conference Paper | Published | English

Scopus indexed

Author

Phuong, Mary^ISTA; Lampert , Christoph^ISTA

Corresponding author has ISTA affiliation

Department

Graduate School
Lampert Group

Abstract

We study the inductive bias of two-layer ReLU networks trained by gradient flow. We identify a class of easy-to-learn (`orthogonally separable') datasets, and characterise the solution that ReLU networks trained on such datasets converge to. Irrespective of network width, the solution turns out to be a combination of two max-margin classifiers: one corresponding to the positive data subset and one corresponding to the negative data subset. The proof is based on the recently introduced concept of extremal sectors, for which we prove a number of properties in the context of orthogonal separability. In particular, we prove stationarity of activation patterns from some time onwards, which enables a reduction of the ReLU network to an ensemble of linear subnetworks.

Publishing Year

2021

Date Published

2021-05-01

Proceedings Title

9th International Conference on Learning Representations

Conference

ICLR: International Conference on Learning Representations

Conference Location

Virtual

Conference Date

2021-05-03 – 2021-05-07

IST-REx-ID

9416

Cite this

Phuong M, Lampert C. The inductive bias of ReLU networks on orthogonally separable data. In: 9th International Conference on Learning Representations. ; 2021.

Phuong, M., & Lampert, C. (2021). The inductive bias of ReLU networks on orthogonally separable data. In 9th International Conference on Learning Representations. Virtual.

Phuong, Mary, and Christoph Lampert. “The Inductive Bias of ReLU Networks on Orthogonally Separable Data.” In 9th International Conference on Learning Representations, 2021.

M. Phuong and C. Lampert, “The inductive bias of ReLU networks on orthogonally separable data,” in 9th International Conference on Learning Representations, Virtual, 2021.

Phuong, Mary, and Christoph Lampert. “The Inductive Bias of ReLU Networks on Orthogonally Separable Data.” 9th International Conference on Learning Representations, 2021.

All files available under the following license(s):

Copyright Statement: