检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]南京大学声学所近代声学国家重点实验室,南京210093
出 处:《声学学报》2005年第1期69-75,共7页Acta Acustica
基 金:国家自然科学基金资助项目(60272037)
摘 要:耳语音声韵分割是耳语音识别和转换的前期工作。由于耳语发音不同于正常音,一般用于正常音的声韵分割法对耳语音不再适用。通过分析耳语音的发音及声学特点,利用宽带语谱图的声韵变化规律,提出了适用于耳语音的信息熵端点检测法,以及相对熵、音长和谱重心相结合的声韵分割法。并对两组信噪比为2-10 dB的380个汉语单音节耳语音进行声韵分割,女声的正确率为87.9%,男声的正确率为90.3%,高于频域法、聚类法和谱平坦度声韵分割法。实验表明,相对熵法可做为耳语音识别和转换的预处理,它改善了汉语耳语音转换为正常音的音质。The Initial/Final(IF) segmentation of whispered speech is the pre-processing in the whispered speech recognition and the reconstruction of normal speech from whisper. However, because the whispered initials and finals are all unvoiced, it is difficult to segment them by the methods used in the normal speech. With tile characteristics analysis of Chinese whispered speech, a new segmentation method is proposed. The speech endpoint is detected by the entropy function, and the initial/final boundary is obtained by the decision of the initial duration, the symmetric relative entropy and the normalized spectral center of gravity. The correct segmentation rates are 87.9% for the female data and 90.3% for the male data in the test with 380 Chinese whispered syllables at 2-10 dB SNR. It is more accuracy than the frequency domain method, the clustering method and the spectral flatness method. As shown in the experiments, this algorithm can be used as pre-processing in the whispered speech recognition and the conversion. It gives the reconstructed speech a more natural quality.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.15