A Prony speech processing technique

dc.contributor.advisorParks, Thomasen_US
dc.creatorScanio, Thomas Josephen_US
dc.date.accessioned2016-04-21T12:02:08Zen_US
dc.date.available2016-04-21T12:02:08Zen_US
dc.date.issued1972en_US
dc.description.abstractA method for speech processing is presented. The method does not require voiced/unvoiced or pitch determination. It models the sampled speech wave as a concatenation of initial segments of unit pulse responses of linear, time-invariant, recursive discrete time systems. The poles of the systems are calculated by Prony's method applied to blocks of speech samples. The zeroes are chosen to zero the error between the speech wave and the first output samples of each system. The analysis phase proceeds as follows. After an initial block of unit pulse response, the system output samples are compared with the speech samples and the system continues to function until the error between the two grows too large. At this time the next block of samples is used to calculate a new system and the process continues. The parameters describing the speech are thus the system parameters (poles and zeroes, for example) and the number of output samples taken from each system. This information is quantized to produce a bit rate for the process of 20 kilobits/second. The approximate speech is synthesized by implementing each system sequentially, applying a pulse to the input and concatenating the required number of output samples to the samples from previous systems. The speech obtained is very noisy, but it is intelligible and speakers can be recognized. A demonstration tape is available from Dr. T. W. Parks of the Electrical Engineering Department. The entire analysis and synthesis procedure for 8 kHz sampling runs in 145 times real time on a Burroughs B-5500 computer with an ALGOL program. It is estimated that this is fast enough to be done in real time by a special purpose processor.en_US
dc.format.digitalOriginreformatted digitalen_US
dc.format.extent67 ppen_US
dc.identifier.callnoThesis E.E. 1972 SCANIOen_US
dc.identifier.citationScanio, Thomas Joseph. "A Prony speech processing technique." (1972) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/89313">https://hdl.handle.net/1911/89313</a>.en_US
dc.identifier.digitalRICE0351en_US
dc.identifier.urihttps://hdl.handle.net/1911/89313en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.titleA Prony speech processing techniqueen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentElectrical Engineeringen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Scienceen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
RICE0351.pdf
Size:
1.27 MB
Format:
Adobe Portable Document Format