Speech Recognition
Lalit Bahl,
Raimo Bakis,
Subrata Das,
Michael Picheny,
Lalit Bahl
Thomas J. Watson Research Center, Yorktown Heights, NY
Search for more papers by this authorRaimo Bakis
Thomas J. Watson Research Center, Yorktown Heights, NY
Search for more papers by this authorSubrata Das
Thomas J. Watson Research Center, Yorktown Heights, NY
Search for more papers by this authorMichael Picheny
Thomas J. Watson Research Center, Yorktown Heights, NY
Search for more papers by this authorLalit Bahl,
Raimo Bakis,
Subrata Das,
Michael Picheny,
Lalit Bahl
Thomas J. Watson Research Center, Yorktown Heights, NY
Search for more papers by this authorRaimo Bakis
Thomas J. Watson Research Center, Yorktown Heights, NY
Search for more papers by this authorSubrata Das
Thomas J. Watson Research Center, Yorktown Heights, NY
Search for more papers by this authorMichael Picheny
Thomas J. Watson Research Center, Yorktown Heights, NY
Search for more papers by this authorFirst published: 27 December 1999
Abstract
The sections in this article are
- 1 Fundamentals
- 2 Detailed Theory
- 3 Exploratory Work
- 4 Testing Speech Recognition Systems
- 5 Application Programming Interfaces
Bibliography
- 1 V. Zue Conversational interfaces: Advances and challenges, in Proc. Eurospeech '97, September 1997, pp. 9–18.
- 2 L. R. Bahl F. Jelinek R. L. Mercer A maximum likelihood approach to continuous speech recognition, IEEE Trans. Pattern Recog. Mach. Intell., 5: 179–190, 1983.
- 3 L. R. Rabiner A tutorial on hidden Markov models and selected applications in speech recognition, Proc. IEEE, 77: 257–286, 1989.
- 4 J. Picone Continuous speech recognition using hidden Markov models, IEEE ASSP Magazine, 26–41, July 1990.
- 5 L. R. Rabiner B.-H. Huang Fundamentals of Speech Recognition, Englewood Cliffs, NJ: Prentice-Hall, 1993.
- 6 J. R. Deller, Jr. J. G. Proakis J. H. L. Hansen Discrete-time Processing of Speech Signals, New York: Macmillan Publishing Co., 1993.
- 7
C.-H. Lee
F. K. Soong
K. K. Paliwal (eds.)
Automatic Speech and Speaker Recognition: Advanced Topics,
Boston:
Kluwer Academic Publishers,
1996.
10.1007/978-1-4613-1367-0 Google Scholar
- 8
R. Ramachandran
R. Mammone (eds.)
Modern Methods of Speech Processing,
Boston:
Kluwer Academic Publishers,
1995.
10.1007/978-1-4615-2281-2 Google Scholar
- 9 A. Averbuch et al. An IBM PC based large-vocabulary isolated-utterance speech recognizer, Proc. IEEE Int. Conf. Acoust., Speech Signal Process., 1: 53–56, April 1986.
- 10 P. Price Evaluation of spoken language systems: The ATIS domain, Proc. DARPA Speech Nat. Language Workshop, Morgan Kaufmann Publishers, pp. 91–95, June 1990.
- 11 Y. Tohkura K. Aikawa Cepstral analysis of speech, in J. G. Webster (ed.), Encyclopedia of Electrical and Electronics Engineering, New York: Wiley, 1999.
- 12 S. Furui Speaker-independent isolated word recognition using dynamic features of speech spectrum, IEEE Trans. Acoust., Speech Signal Process., 34: 52–59, 1986.
- 13 L. E. Baum An inequality and associated maximization technique in statistical estimation of probabilistic functions of Markov processes, Inequalities, 3 (1): 1–8, 1972.
- 14
F. Jelinek
A fast sequential decoding algorithm using a stack,
IBM J. Res. Development,
13:
675–685,
November 1969.
10.1147/rd.136.0675 Google Scholar
- 15
D. P. Morgan
C. L. Scofield
Neural Networks and Speech Processing,
Boston:
Kluwer Academic Publishers,
1991.
10.1007/978-1-4615-3950-6 Google Scholar
- 16 S. R. Hyde Automatic speech recognition: A critical survey and discussion of the literature, in N. R. Dixon and T. B. Martin (eds.), Automatic Speech and Speaker Recognition, New York: IEEE Press, 1979, pp. 16–55.
- 17 L. Bahl F. Jelinek Decoding for channels with insertions, deletions and substitutions with applications to speech recognition, IEEE Trans. Inf. Theory, 21: 404–411, July 1975. (a) G. Cook et al. Transcription of broadcast television and radio news: The 1996 ABBOT system, in Proc. Speech Recognition Workshop, February 1997, pp. 79–84.
- 18 J. Baker The Dragon system—an overview, IEEE Trans. Acoust. Speech Signal Process., 23: 24–29, February 1975.
- 19
K.-F. Lee
Automatic Speech Recognition—The Development of the Sphinx System,
Boston:
Kluwer Academic Publishers,
1989.
10.1007/978-1-4615-3650-5 Google Scholar
- 20 A. Viterbi Error bounds for convolutional codes and an asymptotically optimum decoding algorithm, IEEE Trans. Inf. Theory, 13: 260–269, 1967.
- 21 G. D. Forney, Jr. The Viterbi algorithm, Proc. IEEE, 61: 268–278, 1978.
- 22 N. Morgan H. A. Bourlard Neural networks for statistical recognition of continuous speech, Proc. IEEE, 83: 742–770, 1995.
- 23 M. Ostendorf V. Digalakis O. Kimball From HMM's to segment models: A unified view of statistical modeling for speech recognition, IEEE Trans. Speech Audio Process., 4: 360–378, 1996.
- 24 A. Kannan M. Ostendorf A comparison of trajectory and mixture modeling in segment-based word recognition, in Proc. IEEE ICASSP, 2: April 1993, pp. 327–330.
- 25 M. Ostendorf et al. Continuous word recognition based on the stochastic segment model, in Proc. DARPA Workshop CSR, 1992.
- 26
O. Ghitza
M. Sondhi Hidden Markov models with templates as nonstationary states: An application to speech recognition, in
Comput. Speech Language,
2:
101–119,
1993.
10.1006/csla.1993.1005 Google Scholar
- 27 G. Zavaliagkos et al. A hybrid segmental neural net/hidden Markov model system for continuous speech recognition, IEEE Trans. Speech Audio Process., 2: 151–160, 1994.
- 28
A. Acero
Acoustical and Environmental Robustness in Automatic Speech Recognition,
Norwell, MA:
Kluwer Academic,
1993.
10.1007/978-1-4615-3122-7 Google Scholar
- 29 M. J. F. Gales S. J. Young Cepstral parameter compensation for HMM recognition in noise, Speech Commun., 12: 231–239, July 1993.
- 30 J. Garofolo J. Fiscus W. Fisher Design and preparation of the 1996 Hub-4 broadcast news benchmark test corpora, in Proc. Speech Recognition Workshop, February 1997, pp. 15–21.
- 31 http://www.amudsen.com/mstdg/book/mstdgbook.htm.
- 32 http://www.srapi.com.
- 33 http://www.software.ibm.com/voicetype/dev_home.html.
Wiley Encyclopedia of Electrical and Electronics Engineering
Browse other articles of this reference work: