International Journal of Robust and Nonlinear Control

Research Article

Gradient-based parameter optimization for systems containing discrete-valued functions

Corresponding Author

Edward Wilson

[email protected]

Intellization, 454 Barkentine Lane, Redwood Shores, CA 94065, U.S.A

This work was performed as part of E. Wilson's doctoral research at the Stanford Aerospace Robotics Laboratory, Department of Aeronautics and Astronautics, Stanford University, Stanford, CA 94305, U.S.A.

Intellization, 454 Barkentine Lane, Redwood Shores, California 94065, U.S.A.===Search for more papers by this author

Stephen M. Rock,

Stephen M. Rock

Stanford University, Aerospace Robotics Laboratory, Stanford, CA, 94305, U.S.A

S. Rock is with the Department of Aeronautics and Astronautics, Stanford University, Stanford, CA 94305.

Search for more papers by this author

Edward Wilson,

Corresponding Author

Edward Wilson

[email protected]

Intellization, 454 Barkentine Lane, Redwood Shores, CA 94065, U.S.A

Intellization, 454 Barkentine Lane, Redwood Shores, California 94065, U.S.A.===Search for more papers by this author

Stephen M. Rock,

Stephen M. Rock

Stanford University, Aerospace Robotics Laboratory, Stanford, CA, 94305, U.S.A

S. Rock is with the Department of Aeronautics and Astronautics, Stanford University, Stanford, CA 94305.

Search for more papers by this author

First published: 03 September 2002

https://doi.org/10.1002/rnc.729

Citations: 6

About

PDF

Tools

Share a link

Email
Wechat
Bluesky

Abstract

Gradient-based parameter optimization is commonly used for training neural networks and optimizing the performance of other complex systems that only contain continuously differentiable functions. However, there is a large class of important parameter optimization problems involving systems containing discrete-valued functions that do not permit the direct use of gradient-based methods. Examples include optimization of control systems containing discrete-level actuators such as on/off devices, systems with discrete-valued inputs and outputs, discrete-decision-making systems (accept/reject), and neural networks built with signums (also known as hard-limiters or Heaviside step functions) rather than sigmoids. Even if most of the system is continuously differentiable, the presence of one or more discrete-valued functions will not allow gradient-based optimization to be used directly. A new algorithm, ‘noisy backpropagation,’ is developed here, as an extension of backpropagation, which solves this problem and extends gradient-based parameter optimization to permit application to systems containing discrete-valued functions. Moreover, the modification to backpropagation is small, requiring only (1) replacement of the discrete-valued functions with continuously differentiable approximations, and (2) injection of noise into the smooth approximating function on the forward sweep during training. Noise injection is the key to reducing the round-off error created when the discrete-valued functions are replaced after training. This generic approach is applicable whenever gradient-based parameter optimization is used with systems containing discrete-valued functions; it is not limited to training neural networks. The examples in this paper demonstrate the use of noisy backpropagation in training two different multi-layer signum networks and in training a neural network for a control problem involving on-off actuators. This final example includes implementation on a laboratory model of a ‘free-flying space robot’ to validate the realizability and practical utility of the method. Copyright © 2002 John Wiley & Sons, Ltd.

REFERENCES

1 Rumelhart DE, Hinton GE, Williams RJ. Learning internal representations by error propagation. Parallel Distributed Processing. The MIT Press: Cambridge, MA 02142, 1986; 318.
Google Scholar
2 Werbos PJ. Beyond Regression: new tools for prediction and analysis in the behavioral sciences. PhD Thesis, Harvard University, Cambridge, MA 02142, 1974.
Google Scholar
3 Hoff ME Jr. Learning phenomena in networks of adaptive switching circuits. PhD Thesis, Stanford University, Stanford, CA 94305, 1962. Tech. Rep. 1556-1, Stanford Electron. Labs.
Google Scholar
4 Widrow B, Lehr MA. 30 years of adaptive neural networks: Perceptron, MADALINE, and backpropagation. Proceedings of the IEEE 1990; 78(9): 1415–1442.
10.1109/5.58323
Web of Science® Google Scholar
5 Rosenblatt F. Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms. Spartan Books: Washington, DC, 1962.
Google Scholar
6 Winter R, Widrow B. Madaline Rule II: a training algorithm for neural networks. IEEE International Conference on Neural Networks vol. 1 1988; 401–408.
Google Scholar
7 Bartlett PL, Downs T. Using random weights to train multilayer networks of hard-limiting units. IEEE Transactions on Neural Networks 1992; 3(2): 202–210.
10.1109/72.125861
PubMed Web of Science® Google Scholar
8 Widrow B. A study of rough amplitude quantization by means of Nyquist sampling theory. IRE Transactions of the Professional Group on Circuit Theory 1956; CT-3(4): 266–276.
10.1109/TCT.1956.1086334
Google Scholar
9 An G. The effects of adding noise during backpropagation training on generalization performance. Neural Computation 1996; 8(3): 643–674.
10.1162/neco.1996.8.3.643
Web of Science® Google Scholar
10 Murray AF, Edwards PJ. Enhanced mlp performance and fault tolerance resulting from synaptic weight noise during training. IEEE Transactions on Neural Networks 1994; 5(5): 792–802.
10.1109/72.317730
CAS PubMed Web of Science® Google Scholar
11 Wilson E. Experiments in neural network control of a free-flying space robot. PhD Thesis, Stanford University, Stanford, CA 94305, 1995.
Google Scholar
12 Wilson E. Experiments in neural network control of a free-flying space robot. Proceedings of the Fifth Workshop on Neural Networks: Academic/Industrial/NASA/Defense 1993; 204–209; SPIE. Proceedings of the SPIE, vol. 2204.
Google Scholar
13 Wilson E. Backpropagation learning for systems with discrete-valued functions. Proceedings of the World Congress on Neural Networks, vol. 3, 1994; 332–339.
Google Scholar
14 Bryson AE Jr., Ho YC. Applied Optimal Control. Hemisphere Publishing Corporation: New York, NY, 1975.
Google Scholar
15 MATLAB is a registered trademark of The MathWorks, Inc. 24 Prime Park Way, Natick, MA 01760, 508-647-7000.
Google Scholar
16 Wilson E. Matlab source code for noisy backpropagation example. Available at http://sun-valley.stanford.edu, 1997.
Google Scholar
17 Weigend AS, Huberman BA, Rumelhart DE. Predicting the future: a connectionist approach. International Journal of Neural Systems 1990; 1(3): 193–209.
10.1142/S0129065790000102
Google Scholar
18 Wilson E, Rock SM. Experiments in control of a free-flying space robot using fully-connected neural networks. Proceedings of the World Congress on Neural Networks vol. 3, 1993; 157–162.
Google Scholar
19 JR Wertz, editor. Spacecraft Attitude Determination and Control. Kluwer Academic Publishers: Boston, MA, 1978.
10.1007/978-94-009-9907-7
Google Scholar
20 Ullman MA. Experiments in autonomous navigation and control of multi-manipulator, free-flying space robots. PhD Thesis, Stanford University, Stanford, CA 94305, 1993.
Google Scholar
21 Wilson E, Rock SM. Neural network control of a free-flying space robot. Simulation 1995; 65(2): 103–115.
10.1177/003754979506500203
Web of Science® Google Scholar
22 Wilson E, Rock SM. Reconfigurable control of a free-flying space robot using neural networks. Proceedings of the American Control Conference, 1995.
Google Scholar

Citing Literature

Volume12, Issue11

Special Issue:Applications of Neural Networks in Control and Instrumentation

September 2002

Pages 1009-1028

Gradient-based parameter optimization for systems containing discrete-valued functions

Abstract

REFERENCES

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Gradient-based parameter optimization for systems containing discrete-valued functions

Abstract

REFERENCES

Citing Literature

References

Related

Information