Training multilayer neural networks using fast global learning algorithm - least-squares and penalized optimization methods
Cho SY, Chow TWS
NEUROCOMPUTING 25 (1-3): 115-131 APR 1999
One of the major limitations of conventional learning algorithm is the slow convergence speed caused by the problem of local minima. This paper presents a novel heuristic approach for feedforward neural networks learning. The algorithm is based upon a hybrid type of the Least Squares (LS) and penalised optimisation methods. The LS method is used to provide a fast convergence speed, and a penalty approach is introduced to solve the problem of local minima. The penalty term, defined as discontinuous functions, such as Gaussian, Rayleigh and Laplace, superimposed into the specified error surface, provides a way of escape from the local minima when the convergence stalls. As a result, the learning performance are dramatically improved. The proper selection and adaptation for the penalty factor are also demonstrate the effect of the penalty terms and to ensure the convergence of the algorithm. In all the tested problems, the proposed algorithm yields excellent results in terms of convergence speed, avoidance of local minima and quality of solution.
Some of our obtained results are shown as below, but details can be referred to our paper published in Neurocomputing Journal.

Figure 1. Comparison results for the ability of converging to the global minimum to obtain the best solution for the different algorithms on the modified XOR problem.

a) error surface of a modified Ex-or problem (b) Contour plot and search trajectory by Gaussian approach
¡@


(c) Contour plot and search trajectory byRayleigh
approach (d) Contour plot and search trajectory by Laplace approach
Figure 2. 3-D error plot and 2-D contour plots with searching trajectories of the proposed algorithm for the modified XOR problem. In these figures shown, the convergence is stuck in the local minimum and then the trajectories can be pulled out from the local minimum by our penalty approaches to converge to the global minimum.
¡@
|
Algorithm
|
CPU time taken (mins) at RMSE=0.1 or 5000 iterations
|
Final RMSE
|
Classification accuracy
|
|
EBP
|
200.4
|
0.4271
|
below 60%
|
|
LM
|
117.7
|
0.0997
|
80.9%
|
|
SIMANN
|
171
|
0.3360
|
about 70%
|
|
GA
|
95
|
0.284
|
about 70%
|
|
proposed (Gaussian)
|
49.2
|
0.0994
|
83.1%
|
|
proposed (Rayleigh)
|
46.3
|
0.0998
|
84.4 %
|
|
proposed (Laplace)
|
132.7
|
0.1332
|
73.7%
|
Table 2. Simulation results for the two-spiral classification problem.
¡@
![]() |
![]() |
|
Figure 3a
|
Figure 3b
|
![]() |
![]() |
|
Figure 3c
|
Figure 3d
|
Figure 3. The output classification contour images for two-spiral problem created by (a) the SIMANN algorithm; (b) the Gaussian penalty approach; (c) the Rayleigh penalty approach; (d) the Laplace penalty approach.
¡@
Source Code : GNNv2-02. zip