Volume 22, Issue 14 pp. 90-100
Article
Full Access

Phoneme recognition with elliptic discrimination neural units

Noboru Kanedera

Noboru Kanedera

Ishikawa College of Technology, Ishikawa, Japan 929-03

Noboru Kanedera received a B.E. in Electronic Engineering from the University of Electro-communications, Tokyo, Japan, in 1985 and an M.E. in ElectronicEngineering from the University of Tokyo, Tokyo, in 1987. He is currently an Instructor at Ishikawa National College of Technology. His research interests include continuous speech recognitionand neural networks.

Search for more papers by this author
Tetsuo Funada

Tetsuo Funada

Faculty of Technology, Kanazawa University, Kanazawa, Japan 920

Tetsuo Funada received a B.E. in Electronic Engineering from Kanazawa University, Kanazawa, Japan, in 1966, and an M.E. and a Dr. of Eng. degree in Electrical Engineering from Nagoya University, Nagoya, Japan, in 1968 and 1974, respectively. He is currently an Associate Professor on the Faculty of Technology at Kanazawa University. His research interests include speech pitch extraction, continuous speech recognition, and speech coding.

Search for more papers by this author

Abstract

Many researchers achieved high phoneme recognition rates by multilayered neural networks with linear discrimination neural (LDN) units. However, it is difficult to analyze which components of the input are important to each unit in those LDN networks.

This paper proposed a multilayer neural network with elliptic discrimination neural (EDN) units so that the functions of each unit in the network may be interpreted more definitely. The center of the elliptic discrimination boundary of a neural unit corresponds to a typical point in an input space. The radii of the ellipse express the extent of the corresponding components in the input space, hence it becomes clear which components of the input space are important to each unit in the EDN network.

To compare the performance of EDN and LDN networks, recognition experiments of phonemes /b, d, g/ in 5240 tokens of a Japanese speech database were carried out. In the experiments, recognition rates were obtained by EDN networks as high as the rate by an LDN network. Also, it was confirmed which components of the input space are important to each unit in the EDN network.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.