The role of biases in on-line learning of two-layer networks

Ansgar H.L. West; David Saad

doi:10.1103/PhysRevE.57.3265

The role of biases in on-line learning of two-layer networks

Ansgar H.L. West, David Saad

Research output: Contribution to journal › Article › peer-review

Abstract

The influence of biases on the learning dynamics of a two-layer neural network, a normalized soft-committee machine, is studied for on-line gradient descent learning. Within a statistical mechanics framework, numerical studies show that the inclusion of adjustable biases dramatically alters the learning dynamics found previously. The symmetric phase which has often been predominant in the original model all but disappears for a non-degenerate bias task. The extended model furthermore exhibits a much richer dynamical behavior, e.g. attractive suboptimal symmetric phases even for realizable cases and noiseless data.

Original language	English
Pages (from-to)	3265-3291
Number of pages	27
Journal	Physical Review E
Volume	57
Issue number	3
DOIs	https://doi.org/10.1103/PhysRevE.57.3265
Publication status	Published - Mar 1998

Bibliographical note

Copyright of the American Physical Society

Keywords

learning dynamics
two-layer neural network
soft-committee machine
on-line gradient descent learning

Access to Document

10.1103/PhysRevE.57.3265

The role of biases in on-line learning of two-layer networks
Copyright of the American Physical Society
Accepted author manuscript, 915 KB

Cite this

@article{bcc8dd3d233a4c999f9c26297187f51b,

title = "The role of biases in on-line learning of two-layer networks",

abstract = "The influence of biases on the learning dynamics of a two-layer neural network, a normalized soft-committee machine, is studied for on-line gradient descent learning. Within a statistical mechanics framework, numerical studies show that the inclusion of adjustable biases dramatically alters the learning dynamics found previously. The symmetric phase which has often been predominant in the original model all but disappears for a non-degenerate bias task. The extended model furthermore exhibits a much richer dynamical behavior, e.g. attractive suboptimal symmetric phases even for realizable cases and noiseless data.",

keywords = "learning dynamics, two-layer neural network, soft-committee machine, on-line gradient descent learning",

author = "West, {Ansgar H.L.} and David Saad",

note = "Copyright of the American Physical Society",

year = "1998",

month = mar,

doi = "10.1103/PhysRevE.57.3265",

language = "English",

volume = "57",

pages = "3265--3291",

journal = "Physical Review E",

issn = "1539-3755",

publisher = "American Physical Society",

number = "3",

}

TY - JOUR

T1 - The role of biases in on-line learning of two-layer networks

AU - West, Ansgar H.L.

AU - Saad, David

N1 - Copyright of the American Physical Society

PY - 1998/3

Y1 - 1998/3

N2 - The influence of biases on the learning dynamics of a two-layer neural network, a normalized soft-committee machine, is studied for on-line gradient descent learning. Within a statistical mechanics framework, numerical studies show that the inclusion of adjustable biases dramatically alters the learning dynamics found previously. The symmetric phase which has often been predominant in the original model all but disappears for a non-degenerate bias task. The extended model furthermore exhibits a much richer dynamical behavior, e.g. attractive suboptimal symmetric phases even for realizable cases and noiseless data.

AB - The influence of biases on the learning dynamics of a two-layer neural network, a normalized soft-committee machine, is studied for on-line gradient descent learning. Within a statistical mechanics framework, numerical studies show that the inclusion of adjustable biases dramatically alters the learning dynamics found previously. The symmetric phase which has often been predominant in the original model all but disappears for a non-degenerate bias task. The extended model furthermore exhibits a much richer dynamical behavior, e.g. attractive suboptimal symmetric phases even for realizable cases and noiseless data.

KW - learning dynamics

KW - two-layer neural network

KW - soft-committee machine

KW - on-line gradient descent learning

UR - http://prola.aps.org/pdf/PRE/v57/i3/p3265_1

U2 - 10.1103/PhysRevE.57.3265

DO - 10.1103/PhysRevE.57.3265

M3 - Article

SN - 1539-3755

VL - 57

SP - 3265

EP - 3291

JO - Physical Review E

JF - Physical Review E

IS - 3

ER -

The role of biases in on-line learning of two-layer networks

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this