基于声学分段模型的无监督语音样例检测被引量：2

Unsupervised Query-by-Example Spoken Term Detection Based on Acoustic Segment Models

出　　处：《数据采集与处理》2016年第2期407-414,共8页Journal of Data Acquisition and Processing

基　　金：国家自然科学基金(61175017)资助项目

摘　　要：提出一种基于声学分段模型的无监督语音样例检测方法。该方法首先利用高斯混合模型(Gaussian mixture model,GMM)将训练数据频谱参数转换为后验概率特征向量,采用层次聚类算法确定后验概率的边界信息,得到声学分段;然后通过k-means算法将片段聚类并添加标签,构建基于后验概率的声学分段模型。检索时以模型对查询样例与检索文档的解码序列代替测量矩阵以降低检索时间,通过基于最小编辑距离的动态匹配检索查询项,最小编辑距离的代价函数由模型相似度距离矩阵修正。实验结果表明,相比GMM及传统声学分段模型,本文提出的方法性能更好,检索速度得到显著提升。A study of acoustic segment models （ASM s） for unsupervised query‐by‐example spoken term detec‐tion is presented .Firsty ,a Gaussian mixture model（GMM） is trained without any transcription information to label speech frames with Gaussian posteriorgram .Hierarchical agglomerative clustering is used to decompose the posterior features into acoustically exhibiting segments .A label is assigned to each result segment by k‐means clustering ,then posteriorgram is faciltitated to train ASMs .In query matching phase ,Viterbi decode is proposed to represent query and test posteriorgrams as ASM sequences .Dynamic match lattice spotting based on minimum edit distance is used to locate possible occurrences of the query term .Experimental results show that the proposed method outperforms traditional GMM and ASMs tokenizers .

关键词：声学分段模型语音样例检测后验概率特征无监督

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于声学分段模型的无监督语音样例检测被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于声学分段模型的无监督语音样例检测 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于声学分段模型的无监督语音样例检测被引量：2