Detection of Target Speakers in Audio Databases

Magrin-Chagnolleau, Ivan; Rosenberg, Aaron; Parthasarathy, S.

Detection of Target Speakers in Audio Databases

dc.citation.bibtexName	inproceedings	en_US
dc.citation.conferenceName	IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)	en_US
dc.contributor.author	Magrin-Chagnolleau, Ivan	en_US
dc.contributor.author	Rosenberg, Aaron	en_US
dc.contributor.author	Parthasarathy, S.	en_US
dc.contributor.org	Digital Signal Processing (http://dsp.rice.edu/)	en_US
dc.date.accessioned	2007-10-31T00:52:29Z	en_US
dc.date.available	2007-10-31T00:52:29Z	en_US
dc.date.issued	1999-01-15	en_US
dc.date.modified	2004-11-05	en_US
dc.date.note	2004-01-14	en_US
dc.date.submitted	1999-01-15	en_US
dc.description	Conference Paper	en_US
dc.description.abstract	The problem of speaker detection in audio databases is addressed in this paper. Gaussian mixture modeling is used to build target speaker and background models. A detection algorithm based on a likelihood ratio calculation is applied to estimate target speaker segments. Evaluation procedures are defined in detail for this task. Results are given for different subsets of the HUB4 broadcast news database. For one target speaker, with the data restricted to high quality speech segments, the segment miss rate is approximately 7%. For unrestricted data, the segment miss rate is approximately 27%. In both cases the segment false alarm rate is 4 or 5 per hour. For two target speakers with unrestricted data, the segment miss rate is approximately 63% with about 27 segment false alarms per hour. The decrease in performance for two target speakers is largely associated with short speech segments in the two target speaker test data which are undetectable in the current configuration of the detection algorithm.	en_US
dc.identifier.citation	I. Magrin-Chagnolleau, A. Rosenberg and S. Parthasarathy, "Detection of Target Speakers in Audio Databases," 1999.	en_US
dc.identifier.doi	http://dx.doi.org/10.1109/ICASSP.1999.759797	en_US
dc.identifier.uri	https://hdl.handle.net/1911/20077	en_US
dc.language.iso	eng	en_US
dc.subject	Temporary	en_US
dc.subject.keyword	Temporary	en_US
dc.subject.other	Signal Processing Applications	en_US
dc.title	Detection of Target Speakers in Audio Databases	en_US
dc.type	Conference paper	en_US
dc.type.dcmi	Text	en_US

Files

Original bundle

Now showing 1 - 3 of 3

Name:: Mag1999Non5Detection.PDF
Size:: 57.04 KB
Format:: Adobe Portable Document Format

Download

Name:: Mag1999Non5Detection.PPT
Size:: 98.5 KB
Format:: Microsoft Powerpoint

Download

Name:: Mag1999Non5Detection.PS
Size:: 86.38 KB
Format:: Postscript Files

Download

Collections

ECE Publications
DSP Publications