Show simple item record

dc.contributor.authorBi, Ning.
dc.creatorBi, Ning.en_US
dc.date.accessioned2011-10-31T18:35:29Z
dc.date.available2011-10-31T18:35:29Z
dc.date.issued1995en_US
dc.identifier.urihttp://hdl.handle.net/10150/187290
dc.description.abstractIn this investigation, a vector quantization (VQ)-based speech conversion algorithm and a linear multivariate regression (LMR)-based speech conversion algorithm were modified, and the modified algorithms were applied to the enhancement of alaryngeal speech. The modifications were aimed at reducing the spectral distortion (bandwidth increase) in the VQ-based system and the spectral discontinuity in the LMR-based system. The spectral distortion in the VQ-based algorithm was compensated by formant enhancement using chirp z-transform and cepstral weighting. The spectral discontinuity in the LMR-based system was minimized by the use of overlapped subsets during the constructing of conversion mapping function. These modified algorithms were evaluated using simulated data and speech samples. Results of the evaluations indicated that the modified algorithms reduced conversion distortions. These modified algorithms were also used for the enhancement of alaryngeal speech. Results of perceptual evaluation indicated that listeners generally preferred to listen to the enhanced speech samples.
dc.language.isoenen_US
dc.publisherThe University of Arizona.en_US
dc.rightsCopyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction or presentation (such as public display or performance) of protected items is prohibited except with permission of the author.en_US
dc.titleSpeech conversion and its application to alaryngeal speech enhancement.en_US
dc.typetexten_US
dc.typeDissertation-Reproduction (electronic)en_US
dc.contributor.chairQi, Yingyongen_US
thesis.degree.grantorUniversity of Arizonaen_US
thesis.degree.leveldoctoralen_US
dc.contributor.committeememberGlattke, Theodore J.en_US
dc.contributor.committeememberShipp, Thomasen_US
dc.identifier.proquest9604516en_US
thesis.degree.disciplineSpeech and Hearing Sciencesen_US
thesis.degree.disciplineGraduate Collegeen_US
thesis.degree.namePh.D.en_US
dc.description.noteThis item was digitized from a paper original and/or a microfilm copy. If you need higher-resolution images for any content in this item, please contact us at repository@u.library.arizona.edu.
dc.description.admin-noteOriginal file replaced with corrected file October 2023.
refterms.dateFOA2018-06-18T13:33:15Z
html.description.abstractIn this investigation, a vector quantization (VQ)-based speech conversion algorithm and a linear multivariate regression (LMR)-based speech conversion algorithm were modified, and the modified algorithms were applied to the enhancement of alaryngeal speech. The modifications were aimed at reducing the spectral distortion (bandwidth increase) in the VQ-based system and the spectral discontinuity in the LMR-based system. The spectral distortion in the VQ-based algorithm was compensated by formant enhancement using chirp z-transform and cepstral weighting. The spectral discontinuity in the LMR-based system was minimized by the use of overlapped subsets during the constructing of conversion mapping function. These modified algorithms were evaluated using simulated data and speech samples. Results of the evaluations indicated that the modified algorithms reduced conversion distortions. These modified algorithms were also used for the enhancement of alaryngeal speech. Results of perceptual evaluation indicated that listeners generally preferred to listen to the enhanced speech samples.


Files in this item

Thumbnail
Name:
azu_td_9604516_sip1_c.pdf
Size:
3.288Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record