JBiSE  Vol.3 No.1 , January 2010
Cepstral and linear prediction techniques for improving intelligibility and audibility of impaired speech
ABSTRACT
Human speech becomes impaired i.e., unintelligible due to a variety of reasons that can be either neurological or anatomical. The objective of the research was to improve the intelligibility and audibility of the impaired speech that resulted from a disabled human speech mechanism with impairment in the acoustic system-the supra-laryngeal vocal tract. For this purpose three methods are presented in this paper. Method 1 was to develop an inverse model of the speech degradation using the Cepstral technique. Method 2 was to replace the degraded vocal tract response by a normal vocal tract response using the Cepstral technique. Method 3 was to replace the degraded vocal tract response by a normal vocal tract response using the Linear Prediction technique.

Cite this paper
nullRavindran, G. , Shenbagadevi, S. and Selvam, V. (2010) Cepstral and linear prediction techniques for improving intelligibility and audibility of impaired speech. Journal of Biomedical Science and Engineering, 3, 85-94. doi: 10.4236/jbise.2010.31013.
References
[1]   (2004) NICHCY disability fact sheet., Speech & Language Impairments. NICHCY. 11.

[2]   (2002) Department of Education, Special education programs and services guide, State of Michigan State.

[3]   Shuzo, S. and Kazuo, N. (1985) Fundamental of Speech Signal Processing. Academic Press, London.

[4]   Rabiner, L.R. and Schafer, R.W. (1978) Digital processing of speech signal, Prentice-Hall, Engliwood Cliffs, NJ.

[5]   Rabiner, L.R. and Juang, B.H. (1993) Fundamentals of speech recognition, Prentice-Hall, Engliwood Cliffs, NJ.

[6]   Rabiner, L.R. and Bernard, G. (1992) Theory and application of digital signal processing, Prentice-Hall of India, New Delhi, Chapter 12.

[7]   Thomas, F.Q. (2004) Discrete-time speech signal processing. Pearson Education, Singapore.

[8]   Oppenheim, A.V. and Schafer, R.W. (1992) discrete-time signal processing, Prentice-Hall of India, New Delhi.

[9]   Oppenheim, A.V. (1969) Speech analysis-synthesis based on homomorphic filtering, Journal of Acoustic Society of America, 45, 458-465.

[10]   Oppenheim, A.V. (1976) Signal analysis by homomorphic prediction. Proc. IEEE, ASSP, 24, 327.

[11]   Proakis, J. G. and Manolakis, D. G. (2000) Digital Signal Processing, Prentice-Hall of India, New Delhi.

[12]   Tony, R. (1998) Speech Analysis Lent Term.

[13]   Nipul, B, Sara, M., Slavinskym J.P. and Aamirm V. (2000) A project on speaker recognition’ rice university.

[14]   Makhoul, J. (1975) Linear prediction: a tutorial review, Proc. IEEE, 63, 561-580.

[15]   Jean, L. and Mark, D. (1999) New phase-vocoder techniques for pitch-shifting, harmonizing and other exotic effects, Proc. IEEE WASPAA.

 
 
Top