Researchers Propose A High-Density sEMG Technique for Automatic Speech Recognition

Date:25-02-2020   |   【Print】 【close

The communication of words and speaking is a tremendously important way to engage in social interaction. The normal speaking process requires coordinated contractions of a mass of articulatory muscles on the face and neck. 

Surface electromyography (sEMG) signals containing electrophysiology information associated with speaking activities are usually considered as an alternative input for automatic speech recognition.  

A research team led by Prof. CHEN Shixiong from Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences proposed a high-density (HD) sEMG technique, using dense arrays of individual electrodes to acquire muscle activities over a relatively large area with a rich set of information for adequate motion classification.  

In the sEMG-based speech recognition system, the locations of electrodes used to recording the sEMG signals are the main factor that would affect the classification performances on automatic speech recognition.However, in the previous studies, the placement of the electrodes was dependent on the knowledge of the individual researchers without prior quantitative analysis or benchmark standard. 

According to the team’s publishment in the JOURNAL OF INTEGRATION TECHNOLOGY, the study analyzed the contribution of sEMG signals between the left and right sides of the facial and neck muscles when classifying the daily words in speaking task with English and Chinese respectively. 

In this study, the HD sEMG signals was recorded by the surface electrodes which have 120 channels from eight subjects’ facial and neck muscles, the experimental process needed them to speak five daily words in English and Chinese separately. 

Recording from the electrode arrays in the left and right sides of the facial and neck muscles, classification accuracies were obtained when recognizing the speaking tasks, compared with the signals when using the HD sEMG. 

The results showed that there were similar classification accuracies obtained between using the HD sEMG recording from the left region of the neck and right side of the neck. On the contrary, a significant difference in classification accuracies between using the signals from the left and right facial muscles. 

"The HD sEMG signals from symmetrical positions in the neck are consistent in their contribution to speech recognition, whereas facial signals are not. " said Professor CHEN. 

He added: "The proposed HD sEMG technique would be a useful tool to recognize the speaking activities and determine the appropriate placement of the electrodes using for automatic speech recognition, which might provide a potential tool for reducing the electrode number and selecting the optimal location of channels for speech recognition. " 

Distribution of the high density sEMG electrodes on the left and right sides of the face/neck: (Left) Four main parts of the face; (Right) Symmetrical arrangemrnt. (Image by Prof. CHEN)

 

CONTACT:

ZHANG Xiaomin

Email: xm.zhang@siat.ac.cn

Tel: 86-755-86585299