Browsing by Author "Pereira, Fernando"
Now showing 1 - 2 of 2
Results Per Page
Sort Options
Item An Overview of the AT&T Spoken Document Retrieval System(1998-01-15) Choi, John; Hindle, Don; Hirschberg, Julia; Magrin-Chagnolleau, Ivan; Nakatani, Christine; Pereira, Fernando; Singhal, Amit; Whittaker, Steve; Digital Signal Processing (http://dsp.rice.edu/)We present an overview of a spoken document retrieval system developed at AT&T Labs-Research for the HUB4 Broadcast News corpus. This overview includes a description of the intonational phrase boundary detection, classification, speech recognition, information retrieval and user interface components of the system, along with updated system assessments based on the 49-query task defined for the TREC-6 SDR track. Results from a comparative ranking study, based on queries taken from AP Newswire headlines from the same time period that the Broadcast News corpus was recorded, are presented. For the AP task, retrieval accuracy is assessed by comparing the documents retrieved from ASR generated transcriptions with those from human generated transcriptions.Item SCAN - Speech Content Based Audio Navigator: A Systems Overview(1998-01-15) Choi, John; Hindle, Don; Hirschberg, Julia; Magrin-Chagnolleau, Ivan; Nakatani, Christine; Pereira, Fernando; Singhal, Amit; Whittaker, Steve; Digital Signal Processing (http://dsp.rice.edu/)SCAN (Speech Content based Audio Navigator) is a spoken document retrieval system integrating speaker-independent, large-vocabulary speech recognition with information-retrieval to support query-based retrieval of information from speech archives. Initial development focused on the application of SCAN to the broadcast news domain. This paper provides an overview of this system, including a description of its graphical user interface which incorporates machine-generated speech transcripts to provide local contextual navigation and random access for browsing large speech databases.