We use cookies to improve your experience with our site.

计算机辅助发音训练中基于统计分析和计算语音感知的发音错误严重程度的分级

Grading the Severity of Mispronunciations in CAPT Based on Statistical Analysis and Computational Speech Perception

  • 摘要: 计算机辅助发音训练CAPT技术使得能够通过使用自动语音识别来发现第二语言(L2)学者的错误发音。为了进一步方便人们对L2的学习,我们提出了一个基于原则方法,用于对发音错误严重程度进行分级。本文提出的分级方法是受听觉感知的启发。我们还开发了一种两个口语音素之间的感知距离(PD)的方法。此方法用于计算母语(L1)的听觉混淆。我们发现,对于中国英语学习者而言(母语为汉语或粤语,二语为美式英语),计算机辅助发音训练系统中检测的错误发音与PD高度相关。实验结果表明在第二语言学习中,听觉混淆通常导致发音困扰,PD可以帮助我们对发音错误的严重程度进行分级(越不相似的音素混淆,错误越严重),并对所生成的给学习者的修正反馈进行优先级排序。

     

    Abstract: Computer-aided pronunciation training (CAPT) technologies enable the use of automatic speech recognition to detect mispronunciations in second language (L2) learners' speech. In order to further facilitate learning, we aim to develop a principle-based method for generating a gradation of the severity of mispronunciations. This paper presents an approach towards gradation that is motivated by auditory perception. We have developed a computational method for generating a perceptual distance (PD) between two spoken phonemes. This is used to compute the auditory confusion of native language(L1). PD is found to correlate well with the mispronunciations detected in CAPT system for Chinese learners of English, i.e., L1 being Chinese (Mandarin and Cantonese) and L2 being US English. The results show that auditory confusion is indicative of pronunciation confusions in L2 learning. PD can also be used to help us grade the severity of errors (i.e., mispronunciations that confuse more distant phonemes are more severe) and accordingly prioritize the order of corrective feedback generated for the learners.

     

/

返回文章
返回