March 1, 1989

Phoneme recognition using time-delay neural networks

Key Points

Key points are not available for this paper at this time.

Abstract

The authors present a time-delay neural network (TDNN) approach to phoneme recognition which is characterized by two important properties: (1) using a three-layer arrangement of simple computing units, a hierarchy can be constructed that allows for the formation of arbitrary nonlinear decision surfaces, which the TDNN learns automatically using error backpropagation; and (2) the time-delay arrangement enables the network to discover acoustic-phonetic features and the temporal relationships between them independently of position in time and therefore not blurred by temporal shifts in the input. As a recognition task, the speaker-dependent recognition of the phonemes B, D, and G in varying phonetic contexts was chosen. For comparison, several discrete hidden Markov models (HMM) were trained to perform the same task. Performance evaluation over 1946 testing tokens from three speakers showed that the TDNN achieves a recognition rate of 98.5% correct while the rate obtained by the best of the HMMs was only 93.7%.>

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Alexander Waibel

Toshiyuki Hanazawa

Geoffrey E. Hinton

Journals

IEEE Transactions on Acoustics Speech and Signal Processing

Actions

Institutions

University of Toronto

Carnegie Mellon University

Canadian Institute for Advanced Research

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Phoneme recognition using time-delay neural networks

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider