Back
 ENG  Vol.9 No.9 , September 2017
Adaptive Pitch Transposition: Smart Auditory Spectral Shifts in Cochlear Implants
Abstract: Patients with severe hearing loss have the option to get a cochlear implant device to regain their hearing. Yet, the implantation process is not always optimal, which in some cases results in a shallow insertion depth or an accidental insertion into the wrong cochlear duct. As a consequence, the patients' pitch discrimination ability is suboptimal, leading to an even more decreased vowel identification, which is vital for speech recognition. This paper presents a technical approach to solve this problem: the adaptive pitch transposition module modifies the frequency content in a fashion so that the pitch is fixed to an optimal value. To determine this value, a patient-individual best pitch is determined experimentally by evaluating speech recognition at different pitches. This best pitch is subsequently called the comfort pitch. As a result of the considerations a technical implementation is presented in principle. A system comprised of pitch detection, pitch transposition and an arbitrary chosen comfort pitch is described in depth. It has been implemented prototypically in Matlab/Octave and tested with an example audio file. The system itself is designed as a preprocessing stage preceding cochlear implant processing.
Cite this paper: Struwe, K. and Salomon, R. (2017) Adaptive Pitch Transposition: Smart Auditory Spectral Shifts in Cochlear Implants. Engineering, 9, 739-754. doi: 10.4236/eng.2017.99045.
References

[1]   Struwe, K. (2017) APT: Enhanced Speech Comprehension Through Adaptive Pitch Transposition in Cochlear Implants. In: Giokas, K., Bokor, L.-Z. and Hopfgartner, F., Eds., eHealth 360?: International Summit on eHealth, Budapest, 14-16 June, 2016, Revised Selected Papers. Springer International Publishing, 224-228.

[2]   Shannon, R.V., et al. (2004) Speech Perception with Cochlear Implants. In: Cochlear Implants: Auditory Prostheses and Electric Hearing, Springer, 334-376.

[3]   Zeng, F.-G., Tang, Q. and Lu, T. (2014) Abnormal Pitch Perception Produced by Cochlear Implant Stimulation. PloS One, 9, e88662.

[4]   Laneau, J., Wouters, J. and Moonen, M. (2006) Improved Music Perception with Explicit Pitch Coding in Cochlear Implants. Audiology and Neurotology, 11, 38-52.

[5]   Francart, T., Osses, A. and Wouters, J. (2015) Speech Perception with F0mod, a Cochlear Implant Pitch Coding Strategy. International Journal of Audiology, 54, 424-432.

[6]   De Cheveigne, A. (2005) Pitch Perception Models. In: Pitch, Springer, 169-233.

[7]   Patterson, R.D., Gaudrain, E. and Walters. T.C. (2010) The Perception of Family and Register in Musical Tones. In: Music Perception, Springer, 13-50.

[8]   Mak, M.-W. and Yu, H.-B. (2014) A Study of Voice Activity Detection Techniques for NIST Speaker Recognition Evaluations. Computer Speech & Language, 28, 295-313.

[9]   Charpentier, F.J. and Stella, M.G. (1986) Diphone Synthesis Using an Overlap-Add Technique for Speech Waveforms Concatenation. Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'86, 11, 2015-2018.

[10]   Flanagan, J.L. and Golden, R.M. (1966) Phase Vocoder. Bell System Technical Journal, 45, 1493-1509.

[11]   Ellis, D.P.W. (2002) A Phase Vocoder in Matlab.
http://www.ee.columbia.edu/ln/rosa/matlab/pvoc/

[12]   Laroche, J. and Dolson. M. (1999) Improved Phase Vocoder Time-Scale Modification of Audio. IEEE Trans-actions on Speech and Audio Processing, 7, 323-332.

 
 
Top