多任务师生模型的语音情感识别实验设计

Experimental Design of Speech Emotion Recognition with Multi-task Teacher-student Model

  • 摘要: 针对人机智能交互中语音情感识别的研究热点,将基于多任务约束师生模型的含噪语音情感识别设计为研究型教学实验,观察教师模型的指导作用、学生模型的学习过程和多级增强损失的约束力。设计基于Wav2vec 2.0的师生模型和多级增强损失机制,并将语音增强辅助任务引入学生模型,使学生模型能够通过学习获取教师模型的特征表示能力。在测试阶段学生模型直接从含噪语音中提取关键情感特征,用于情感分类。最后通过大量实验分析情感识别系统的性能和鲁棒性。该师生模型实验设计有助于提升学生思考能力、科研创新和探索意识。

     

    Abstract: Aiming at the research hotspots of speech emotion recognition in human-computer intelligent interaction, the noisy speech emotion recognition with the multi-task constrained teacher-student model is designed as a research-oriented teaching experiment, and the guiding role of the teacher model, the learning process of the student model and the binding force of multi-level reinforcement loss are observed. The teacher-student model based on Wav2vec 2.0 and the multi-level reinforcement loss mechanism are designed, and then the speech enhancement auxiliary task is introduced into the student model, so that the student model can obtain the feature representation ability of the teacher model through learning. In the testing phase, the student model directly extracts key emotional features from noisy speech for emotion classification. Finally, a large number of experiments are conducted to analyze the performance and robustness of the emotion recognition system. The experimental design based on the teacher-student model helps to improve students' thinking ability, scientific research innovation and exploration awareness.

     

/

返回文章
返回