检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]中国科学技术大学自动化系,安徽合肥230027
出 处:《声学技术》2008年第1期79-86,共8页Technical Acoustics
基 金:中国科技大学研究生创新基金(KD2005044);模式识别国家重点实验室开放基金;中国科学技术大学/中国科学院自动化研究所智能科学与技术联合实验室开放基金(JL0602)
摘 要:基音频率是语音信号处理中的一个重要参数。倍频、半频错误以及清浊音判决的可靠性等问题一直是基频估计中的难点问题。在对语音信号的倒谱进行适当修正的基础上,提出了一种高精度的基频估计算法。该算法根据倒谱、短时能量和短时过零率在清音段和浊音段的不同表现,构造了一个清浊音判决函数,大大提高了清浊音判决精度;然后利用动态规划技术进行基频跟踪。在构造代价函数时,充分考虑了基频连续性的影响,从而使该算法既能有效地避免倍频和半频错误,又能体现出基频的自然加倍和减半。通过与现有的几种效果较好的方法进行对比实验,结果表明该算法具有准确率高、基频轨迹平滑的优点,利用该算法得到的基频轨迹基本不需要进行后期平滑处理。Fundamental frequency (F0) is a key parameter in speech signals processing. Pitch doubling, pitch halving and the reliability of voicing decision are the most difficult problems in the estimation of fundamental frequency. An algorithm based on the modified cepstrum is proposed for the estimation of the fundamental frequency (F0) of speech signals Voicing decisions are made by using a decision function composed of cepstral peak, zero-crossing rate, and energy of short-time segments of speech signals. An accurate voiced/unvoiced classification is obtained based on this decision function. Then a dynamic programming method is used to realize pitch tracking. The consecution of F0 is considered sufficiently in the cost function. The proposed algorithm can avoid the problem concerning with pitch doubling and pitch halving effectively, as well as preserve the natural doubling and halving of F0. The comparing experiments with several other well-known methods show that the algorithm in this paper has some desirable advantages such as high accuracy and smooth F0 contour, which needs no postsmoother.
分 类 号:TN912.3[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.15