Speaker Recognition
Joseph P. Campbell Jr.,
Joseph P. Campbell Jr.
The Johns Hopkins University, Ft. Meade, MD
Search for more papers by this authorJoseph P. Campbell Jr.,
Joseph P. Campbell Jr.
The Johns Hopkins University, Ft. Meade, MD
Search for more papers by this authorAbstract
The sections in this article are
- 1 Motivation
- 2 Problem Formulation
- 3 Overview
- 4 Previous Work
- 5 Speech Processing
- 6 Feature Selection and Measures
- 7 Pattern Matching
- 8 Classification and Decision Theory
- 9 A New Speaker Recognition System
- 10 Performance
- 11 Summary
Bibliography
- 1 B. S. Atal Automatic recognition of speakers from their voices, Proc. IEEE, 64: 460–475, 1976.
- 2 G. R. Doddington Speaker recognition—Identifying people by their voices, Proc. IEEE, 73: 1651–1664, 1985.
- 3 S. Furui Speaker-dependent-feature extraction, recognition and processing techniques, Speech Commun., 10: 505–520, 1991.
- 4 D. O'Shaughnessy Speech Communication, Human and Machine: Digital Signal Processing, Reading, MA: Addison-Wesley, 1987.
- 5 A. Rosenberg Automatic speaker verification: A review, Proc. IEEE, 64: 475–487, 1976.
- 6 A. E. Rosenberg F. K. Soong Recent research in automatic speaker recognition, in S. Furui and M. M. Sondhi (eds.), Advances in Speech Signal Processing, New York: Dekker, 1992, pp. 701–738.
- 7 A. Sutherland M. Jack Speaker verification, in M. Jack and J. Laver (eds.), Aspects of Speech Technology, Edinburgh, UK: Edinburgh Univ. Press, 1988, pp. 185–215.
- 8 R. Mammone X. Zhang R. Ramachandran Robust speaker recognition—A feature-based approach, IEEE Signal Process. Mag., 13 (5): 58–71, 1996.
- 9 D. Reynolds R. Rose Robust text-independent speaker identification using Gaussian mixture speaker models, IEEE Trans. Speech Audio Process., 3: 72–83, 1995.
- 10
A. Higgins
L. Bahler
J. Porter
Speaker verification using randomized phrase prompting,
Digital Signal Process.,
1 (2):
89–106,
1991.
10.1016/1051-2004(91)90098-6 Google Scholar
- 11 A. Martin M. Przybocki 1997 speaker recognition evaluation, in A. Martin (ed.), Speaker Recognition Workshop, Linthicum Heights, MD: Maritime Inst. of Technol., 1997, Sect. 2. Available ftp://jaguar.ncsl.nist.gov/speaker/ and http://www.nist.gov/itl/div894/894.01/
- 12 J. Campbell Testing with the YOHO CD-ROM voice verification corpus, Int. Conf. Acoust., Speech, Signal Process., Detroit, MI, 1995, pp. 341–344. Available http://www.biometrics.org/
- 13 B. S. Atal Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification, J. Acoust. Soc. Amer., 55: 1304–1312, 1974.
- 14 J. D. Markel S. B. Davis Text-independent speaker recognition from a large linguistically unconstrained time-spaced data base, IEEE Trans. Acoust. Speech Signal Process., ASSP-27: 74–82, 1979.
- 15 S. Furui Cepstral analysis technique for automatic speaker verification, IEEE Trans. Acoust. Speech Signal Process., ASSP-29: 254–272, 1981.
- 16 R. Schwartz S. Roucos M. Berouti The application of probability density estimation to text independent speaker identification, Int. Conf. Acoust., Speech, Signal Process., Paris, 1982, pp. 1649–1652.
- 17 K. P. Li E. H. Wrench, Jr. Text-independent speaker recognition with short utterances, Int. Conf. Acoust., Speech, Signal Process., Boston, 1983, pp. 555–558.
- 18 F. Soong et al. A vector quantization approach to speaker recognition, IEEE, Int. Conf. Acoust., Speech, Signal Process., Tampa, Florida, 1985, pp. 387–390.
- 19 A. L. Higgins R. E. Wohlford A new method of text-independent speaker recognition, Int. Conf. Acoust., Speech, Signal Process., Tokyo, 1986, pp. 869–872.
- 20 J. Attili M. Savic J. Campbell A TMS32020-based real time, text-independent, automatic speaker verification system, Int. Conf. Acoust., Speech, Signal Process., New York, 1988, pp. 599–602.
- 21 N. Z. Tishby On the application of mixture AR hidden Markov models to text independent speaker recognition, IEEE Trans. Acoust., Speech, Signal Process., 39: 563–570, 1991.
- 22 D. Reynolds Speaker identification and verification using Gaussian mixture speaker models, Speech Commun., 17 (1–2): 91–108, 1995.
- 23 D. Reynolds B. Carlson Text-dependent speaker verification using decoupled and integrated speaker and speech recognizers, EUROSPEECH ESCA, Madrid, 1995, pp. 647–650.
- 24 C. Che Q. Lin Speaker recognition using HMM with experiments on the YOHO database, EUROSPEECH, ESCA, Madrid, 1995, pp. 625–628.
- 25 J. Colombi et al. Cohort selection and word grammar effects for speaker recognition, IEEE, Int. Conf. Acoust., Speech, Signal Process., Atlanta, GA, 1996, pp. 85–88.
- 26 D. Reynolds M.I.T. Lincoln Laboratory site presentation, in A. Martin (ed.), Speaker Recognition Workshop, Linthicum Heights, MD: Maritime Inst. of Technol., 1996, Sect. 5. Available http://www.jaguar.ncsl.nist.gov/speaker/ and http://www.nist.gov/itl/div894/894.01/
- 27 A. E. Rosenberg et al. The use of cohort normalized scores for speaker verification, Int. Conf. Spoken Lang. Process., Banff, Univ. of Alberta, 1992, pp. 599–602.
- 28 H. Gish M. Schmidt Text-independent speaker identification, IEEE Signal Process. Mag., 11 (4): 18–32, 1994.
- 29 G. Papcun Commensurability among biometric systems: How to know when three apples probably equals seven oranges, Proc. Biometric Consortium, 9th Meet., Crystal City, VA, 1997. Available http://www.biometrics.org/
- 30 A. Higgins YOHO speaker verification, Speech Res. Symp., Baltimore, MD, 1990.
- 31
J. Flanagan
Speech Analysis Synthesis and Perception,
2nd ed., Berlin:
Springer-Verlag,
1972.
10.1007/978-3-662-01562-9 Google Scholar
- 32 T. Parsons Voice and speech processing, in S. Director (ed.), Communications and Signal Processing, New York: McGraw-Hill, 1987.
- 33 A. Pentz Speech Science (SPATH 4313) Class Notes, Stillwater: Oklahoma State Univ., 1990.
- 34 D. Plumpe Modeling of the glottal flow derivative waveform with application to speaker identification, M.S. thesis, Massachusetts Inst. of Technol., Cambridge, MA, 1997.
- 35 J. Makhoul Linear prediction: A tutorial review, Proc. IEEE, 63: 561–580, 1975.
- 36 F. J. Harris On the use of windows for harmonic analysis with the DFT, Proc. IEEE, 66: 51–83, 1978.
- 37 F. Itakura Line spectrum representation of linear predictive coefficients, Trans. Comm. on Speech Res., Acoust. Soc. Jpn., S75: 34, 1975.
- 38 S. Saito K. Nakata Fundamentals of Speech Signal Processing, Tokyo: Academic Press, 1985.
- 39 G. Kang L. Fransen Low Bit Speech Encoder Based on Line-Spectrum-Frequency, NRL Rep. 8857, Washington, DC: NRL, 1985.
- 40 L. Rabiner R. Schafer Digital processing of speech signals, in A. Oppenheim (ed.), Signal Processing, Englewood Cliffs, NJ: Prentice-Hall, 1978.
- 41
J. P. Campbell, Jr.
T. E. Tremain
V. C. Welch
The Federal Standard 1016 4800 bps CELP voice coder,
Digital Signal Process.,
1 (3):
145–155,
1991.
10.1016/1051-2004(91)90106-U Google Scholar
- 42 L. Rabiner B.-H. Juang Fundamentals of speech recognition, in A. Oppenheim (ed.), Signal Processing, Englewood Cliffs, NJ Prentice-Hall, 1993.
- 43
R. Gnanadesikan
J. R. Kettenring
Discriminant analysis and clustering,
Stat. Sci.,
4 (1):
34–69,
1989.
10.1214/ss/1177012666 Google Scholar
- 44 R. Duda P. Hart Pattern Classification and Scene Analysis, New York: Wiley, 1973.
- 45 J. Tou R. Gonzalez Pattern recognition principles, in R. Kalaba (ed.), Applied Mathematics and Computation, Reading, MA: Addison-Wesley, 1974.
- 46 K. Fukunaga Introduction to statistical pattern recognition, in W. Rheinboldt and D. Siewiorek (eds.), Computer Science and Scientific Computing, 2nd ed., San Diego, CA: Academic Press, 1990.
- 47 S. Kullback Information Theory and Statistics, New York: Dover, 1968.
- 48 R. E. Blahut Principles and Practice of Information Theory: Electrical and Computer Engineering, Reading, MA: Addison-Wesley, 1987.
- 49 S. Kullback R. Leibler On information and sufficiency, Ann. Math. Stat., 22: 79–86, 1951.
- 50 J. Tou P. Heydorn Some approaches to optimum feature extraction, in J. Tou (ed.), Computer and Information Sciences-II, New York: Academic Press, 1967, pp. 57–89.
- 51 M. Basseville Distance measures for signal processing and pattern recognition, Signal Process., 18: 349–369, 1989.
- 52 P. A. Devijver On a new class of bounds on Bayes risk in multihypothesis pattern recognition, IEEE Trans. Comput., C-23: 70–80, 1974.
- 53 T. Kailath The divergence and Bhattacharyya distance measures in signal selection, IEEE Trans. Commun. Technol., 15: 52–60, 1967.
- 54 Y.-T. Lee Information—Theoretic distortion measures for speech recognition, IEEE Trans. Acoust. Speech Signal Process., 39: 330–335, 1991.
- 55 F. K. Soong et al. A vector quantization approach to speaker recognition, AT&T Tech. J., 66 (2): 14–26, 1987.
- 56 H. Sakoe S. Chiba Dynamic programming algorithm optimization for spoken word recognition, IEEE Trans. Acoust. Speech Signal Process., ASSP-26: 43–49, 1978.
- 57 A. Higgins L. Bhaler J. Porter Voice identification using nearest neighbor distance measure, Int. Conf. Acoust., Speech, Signal Process., Minneapolis, MN, 1993, pp. 375–378.
- 58 L. Rabiner B.-H. Juang An introduction to hidden Markov models, IEEE Acoust. Speech Signal Process. Mag., 3(1): 4–16, 1986.
- 59 L. R. Rabiner A tutorial on hidden Markov models and selected applications in speech recognition, Proc. IEEE, 77: 257–286, 1989.
- 60 A. Wald Sequential Analysis, New York: Wiley, 1947.
Citing Literature
Wiley Encyclopedia of Electrical and Electronics Engineering
Browse other articles of this reference work: