Speaker Detection in Broadcast Speech Databases

Rosenberg, Aaron; Magrin-Chagnolleau, Ivan; Parthasarathy, S.

Speaker Detection in Broadcast Speech Databases

dc.citation.bibtexName	inproceedings	en_US
dc.citation.conferenceName	Proceedings of International Conference on Spoken Language Processsing	en_US
dc.contributor.author	Rosenberg, Aaron	en_US
dc.contributor.author	Magrin-Chagnolleau, Ivan	en_US
dc.contributor.author	Parthasarathy, S.	en_US
dc.contributor.org	Digital Signal Processing (http://dsp.rice.edu/)	en_US
dc.date.accessioned	2007-10-31T01:03:05Z	en_US
dc.date.available	2007-10-31T01:03:05Z	en_US
dc.date.issued	1998-01-15	en_US
dc.date.modified	2004-11-04	en_US
dc.date.note	2004-01-14	en_US
dc.date.submitted	1998-01-15	en_US
dc.description	Conference Paper	en_US
dc.description.abstract	Experiments have been carried out to assess the feasibility of detecting target speaker segments in multi-speaker broadcast databases. The experiemental database consists of NBC Nightly News broadcasts. The target speaker is the news anchor, Tom Brokaw. Gaussian mixture models are constructed from labelled training data for the target speaker as well as background models for other speakers, commercials, and music. Four labelled 30-min. broadcasts are used for testing. Mel-frequency cepstral features, augmented by delta cepstral features are calculated over 20 msec. windows shifted every 10 msec. through a broadcast. Likelihood ratio scores are calculated for each test frame averaged over blocks of frames with a specified duration. The block scores are input to a detection routine which returns estimates of target segments boundaries. The range of best results obtained over the test broadcasts is 82% to 100% detection of target segments with segment frame accuracy ranging from 86% to 95%. 0 to 2 false alarm segments are detected over each 30 min. broadcast.	en_US
dc.identifier.citation	A. Rosenberg, I. Magrin-Chagnolleau and S. Parthasarathy, "Speaker Detection in Broadcast Speech Databases," 1998.	en_US
dc.identifier.uri	https://hdl.handle.net/1911/20304	en_US
dc.language.iso	eng	en_US
dc.subject	Temporary	en_US
dc.subject.keyword	Temporary	en_US
dc.subject.other	Signal Processing Applications	en_US
dc.title	Speaker Detection in Broadcast Speech Databases	en_US
dc.type	Conference paper	en_US
dc.type.dcmi	Text	en_US

Collections

ECE Publications
DSP Publications

Speaker Detection in Broadcast Speech Databases

Files

Collections