Speech to Electrode Encoding Using Virtual Channels for Cochlear Implants

Google Meet Link

meet.google.com/ard-dqki-gjx

GitHub Repository

https://github.com/RushilJ2603/cochlear_Implant_IEEE

Aim

To design and implement a software-based cochlear implant signal processing framework capable of converting speech into electrode stimulation patterns using ERB scaling, ACE coding and virtual channel current steering techniques.

Introduction

Cochlear implants are neural prosthetic devices designed to restore hearing in individuals with severe sensorineural hearing loss. Unlike hearing aids that amplify sound, cochlear implants bypass damaged hair cells and directly stimulate the auditory nerve using intracochlear electrodes. Modern cochlear implant systems rely on sophisticated speech coding strategies such as CIS and ACE to encode speech information efficiently. However, the number of physical electrodes is limited, reducing spectral resolution. Virtual channel current steering improves this limitation by creating intermediate pitch percepts between adjacent electrodes, effectively increasing the number of distinguishable channels.

Literature Survey and Technologies Used

The project is based on research related to ACE (Advanced Combination Encoder), CIS (Continuous Interleaved Sampling), and virtual channel current steering techniques. Previous studies demonstrated that current steering can significantly improve spectral discrimination and speech perception in cochlear implant systems.

The implementation uses:

Python for modular development
NumPy and SciPy for DSP operations
Librosa for audio preprocessing and feature extraction
Matplotlib for visualization and electrodograms
Butterworth filterbanks for spectral decomposition

Methodology

1. Audio Preprocessing

Audio loading and normalization
Pre-emphasis filtering
Framing and windowing

2. Filterbank Analysis

ERB-spaced Butterworth filters separate the signal into frequency bands corresponding to cochlear electrodes.

3. Envelope Extraction

Rectification and low-pass filtering are used to obtain temporal envelopes.
ACE n-of-m maxima selection chooses dominant channels.

4. Virtual Channel Current Steering

Adjacent electrodes stimulated simultaneously using steering coefficient
Intermediate pitch channels generated between electrodes.

5. Dynamic Range Compression

Loudness growth functions compress acoustic amplitudes into electrical stimulation ranges.

6. Pulse Generation and Vocoder

Biphasic CIS pulse trains generated.
Vocoder reconstruction produces speech-like perceptual output.

7. Visualization Modules

Electrodograms : visualisation of electrode activation over time
Filterbank Responses : frequency responses of ERB-spaced channels
Envelope Trajectories : temporal envelope evolution in each channel
Pitch Tracking & Stimulation Plots : estimated pitch and electrode current activity

Results

Conclusion and Future Scope

The project successfully demonstrates a modular cochlear implant speech processing simulator with ACE coding and virtual channel current steering. The system improves spectral representation beyond the limitation of physical electrodes and provides a strong foundation for advanced cochlear implant research.

Future improvements may include:

Real-time processing pipelines
FPGA or embedded DSP implementation
Adaptive AI-based channel optimization
Patient-specific fitting strategies
Deep learning-assisted speech enhancement

References

Wilson, B. S., et al. (1991). Better speech recognition with cochlear implants. Nature, 352, 236–238.
Loizou, P. C. (1998). Mimicking the human ear. IEEE Signal Processing Magazine, 15(5), 101–130.
Moore, B. C. J., & Glasberg, B. R. (1983). Auditory-filter bandwidths and excitation patterns. J. Acoust. Soc. Am., 74(3), 750–753.
Donaldson, G. S., et al. (2005). Improving speech understanding with current steering. Ear and Hearing, 26(4S), 109S–119S.
Koch, D. B., et al. (2007). Combining current steering and current focusing. Cochlear Implants International, 8(4), 202–213.
Shannon, R. V., et al. (2004). Speech recognition with primarily temporal cues. Science, 303, 1780–1782.
Vandali, A. E., et al. (2000). Speech perception as a function of electrical stimulation rate. Ear and Hearing, 21(6), 608–624.
Zeng, F.-G., et al. (2002). Cochlear implant speech recognition with civil-level stimulation. J. Acoust. Soc. Am., 112(5), 1829–1842.

Mentors:
Rushil Jain, Shobitha Arun Kumar

Mentees:
K Charandheep, Chinmay Raikar, Harain Muralietharan, S Kayalselvi, Shaurya Bhatkaria, Shreya Harikumar

Virtual Expo 2026

Abstract

Abstract

Report Information

Team Members

Team Members

Report Details

Report Details

Explore More Projects