A Prony speech processing technique

Scanio, Thomas Joseph

A Prony speech processing technique

dc.contributor.advisor	Parks, Thomas	en_US
dc.creator	Scanio, Thomas Joseph	en_US
dc.date.accessioned	2016-04-21T12:02:08Z	en_US
dc.date.available	2016-04-21T12:02:08Z	en_US
dc.date.issued	1972	en_US
dc.description.abstract	A method for speech processing is presented. The method does not require voiced/unvoiced or pitch determination. It models the sampled speech wave as a concatenation of initial segments of unit pulse responses of linear, time-invariant, recursive discrete time systems. The poles of the systems are calculated by Prony's method applied to blocks of speech samples. The zeroes are chosen to zero the error between the speech wave and the first output samples of each system. The analysis phase proceeds as follows. After an initial block of unit pulse response, the system output samples are compared with the speech samples and the system continues to function until the error between the two grows too large. At this time the next block of samples is used to calculate a new system and the process continues. The parameters describing the speech are thus the system parameters (poles and zeroes, for example) and the number of output samples taken from each system. This information is quantized to produce a bit rate for the process of 20 kilobits/second. The approximate speech is synthesized by implementing each system sequentially, applying a pulse to the input and concatenating the required number of output samples to the samples from previous systems. The speech obtained is very noisy, but it is intelligible and speakers can be recognized. A demonstration tape is available from Dr. T. W. Parks of the Electrical Engineering Department. The entire analysis and synthesis procedure for 8 kHz sampling runs in 145 times real time on a Burroughs B-5500 computer with an ALGOL program. It is estimated that this is fast enough to be done in real time by a special purpose processor.	en_US
dc.format.digitalOrigin	reformatted digital	en_US
dc.format.extent	67 pp	en_US
dc.identifier.callno	Thesis E.E. 1972 SCANIO	en_US
dc.identifier.citation	Scanio, Thomas Joseph. "A Prony speech processing technique." (1972) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/89313">https://hdl.handle.net/1911/89313</a>.	en_US
dc.identifier.digital	RICE0351	en_US
dc.identifier.uri	https://hdl.handle.net/1911/89313	en_US
dc.language.iso	eng	en_US
dc.rights	Copyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.	en_US
dc.title	A Prony speech processing technique	en_US
dc.type	Thesis	en_US
dc.type.material	Text	en_US
thesis.degree.department	Electrical Engineering	en_US
thesis.degree.discipline	Engineering	en_US
thesis.degree.grantor	Rice University	en_US
thesis.degree.level	Masters	en_US
thesis.degree.name	Master of Science	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: RICE0351.pdf
Size:: 1.27 MB
Format:: Adobe Portable Document Format

Download

Collections

Rice University Theses and Dissertations