Phys. Rev. E 72, 026117 (2005) [11 pages]Efficient training of multilayer perceptrons using principal component analysisReceived 15 March 2005; published 16 August 2005 A training algorithm for multilayer perceptrons is discussed and studied in detail, which relates to the technique of principal component analysis. The latter is performed with respect to a correlation matrix computed from the example inputs and their target outputs. Typical properties of the training procedure are investigated by means of a statistical physics analysis in models of learning regression and classification tasks. We demonstrate that the procedure requires by far fewer examples for good generalization than traditional online training. For networks with a large number of hidden units we derive the training prescription which achieves, within our model, the optimal generalization behavior. © 2005 The American Physical Society URL:
http://link.aps.org/doi/10.1103/PhysRevE.72.026117
DOI:
10.1103/PhysRevE.72.026117
PACS:
84.35.+i, 89.90.+n, 64.60.−i, 87.19.La
|
