Chapter 7

Speaker Recognition and Diarization

Gerald Friedland

Gerald Friedland

ICSI, University of California, Berkeley, California, USA

Search for more papers by this author
David van Leeuwen

David van Leeuwen

ICSI, University of California, Berkeley, California, USA

Search for more papers by this author
First published: 19 April 2010
Citations: 1

Summary

This chapter presents a continuously growing field that promises a wealth of applications far beyond the field of speech processing: the automatic identification of persons from their uttered speech. Research is currently focusing mainly on two tasks: The task of speaker detection is to verify the identity of a new speaker against a set of pretrained speaker models. The task of speaker diarization is to find speech segments of the same speaker without any a priori knowledge. The chapter introduces the general ideas in the two fields then it continues to explain the task of speaker diarization by providing an overview of current work before providing a more detailed description of a concrete example of a diarization system. Then, variants and current research topics are discussed. It presents speaker recognition in a similar way. Finally it concludes the chapter pointing to open problems.

Controlled Vocabulary Terms

speaker recognition

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.