Finite-size effects in on-line learning of multilayer neural networks

David Barber; David Saad; Peter Sollich

Finite-size effects in on-line learning of multilayer neural networks

David Barber, David Saad, Peter Sollich

Research output: Contribution to journal › Article › peer-review

Abstract

We complement recent advances in thermodynamic limit analyses of mean on-line gradient descent learning dynamics in multi-layer networks by calculating fluctuations possessed by finite dimensional systems. Fluctuations from the mean dynamics are largest at the onset of specialisation as student hidden unit weight vectors begin to imitate specific teacher vectors, increasing with the degree of symmetry of the initial conditions. In light of this, we include a term to stimulate asymmetry in the learning process, which typically also leads to a significant decrease in training time.

Original language	English
Pages (from-to)	151-156
Number of pages	6
Journal	Europhysics Letters
Volume	34
Issue number	2
Publication status	Published - Apr 1996

Bibliographical note

Copyright of EDP Sciences

Keywords

probability theory
stochastic processes
and statistics

Access to Document

Finite-size effects in on-line learning of multilayer neural networks
Copyright of EDP Sciences
Final published version, 606 KB

Cite this

@article{52d97381056e4951b9143688617835e9,

title = "Finite-size effects in on-line learning of multilayer neural networks",

abstract = "We complement recent advances in thermodynamic limit analyses of mean on-line gradient descent learning dynamics in multi-layer networks by calculating fluctuations possessed by finite dimensional systems. Fluctuations from the mean dynamics are largest at the onset of specialisation as student hidden unit weight vectors begin to imitate specific teacher vectors, increasing with the degree of symmetry of the initial conditions. In light of this, we include a term to stimulate asymmetry in the learning process, which typically also leads to a significant decrease in training time.",

keywords = "probability theory, stochastic processes, and statistics",

author = "David Barber and David Saad and Peter Sollich",

note = "Copyright of EDP Sciences",

year = "1996",

month = apr,

language = "English",

volume = "34",

pages = "151--156",

journal = "Europhysics Letters",

issn = "0295-5075",

publisher = "IOP Publishing Ltd.",

number = "2",

}

TY - JOUR

T1 - Finite-size effects in on-line learning of multilayer neural networks

AU - Barber, David

AU - Saad, David

AU - Sollich, Peter

N1 - Copyright of EDP Sciences

PY - 1996/4

Y1 - 1996/4

N2 - We complement recent advances in thermodynamic limit analyses of mean on-line gradient descent learning dynamics in multi-layer networks by calculating fluctuations possessed by finite dimensional systems. Fluctuations from the mean dynamics are largest at the onset of specialisation as student hidden unit weight vectors begin to imitate specific teacher vectors, increasing with the degree of symmetry of the initial conditions. In light of this, we include a term to stimulate asymmetry in the learning process, which typically also leads to a significant decrease in training time.

AB - We complement recent advances in thermodynamic limit analyses of mean on-line gradient descent learning dynamics in multi-layer networks by calculating fluctuations possessed by finite dimensional systems. Fluctuations from the mean dynamics are largest at the onset of specialisation as student hidden unit weight vectors begin to imitate specific teacher vectors, increasing with the degree of symmetry of the initial conditions. In light of this, we include a term to stimulate asymmetry in the learning process, which typically also leads to a significant decrease in training time.

KW - probability theory

KW - stochastic processes

KW - and statistics

UR - http://iopscience.iop.org/0295-5075/34/2/151/fulltext?ejredirect=.iopscience

M3 - Article

SN - 0295-5075

VL - 34

SP - 151

EP - 156

JO - Europhysics Letters

JF - Europhysics Letters

IS - 2

ER -

Finite-size effects in on-line learning of multilayer neural networks

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this