基于流媒体语音关键词识别系统的研究  被引量:1

Study of keyword recognition system for speech based on streaming media

在线阅读下载全文

作  者:吕淑琴[1] 孙成立[2] 

机构地区:[1]北京信息工程学院通信与信息工程系,北京100101 [2]石家庄经济学院信息工程学院

出  处:《北京机械工业学院学报》2006年第4期47-50,共4页Journal of Beijing Institute of Machinery

摘  要:近年来,关键词检测技术在口语语音和电话语音领域取得了显著的发展,但针对流媒体语音关键词检测的有关文献却很少见,基于这个目的,提出一套针对流媒体关键词检测的系统方案。系统利用WMFSDK从流媒体中提取出解码的语音数据。为了区分集外词和关键词,利用了在线垃圾模型拒绝集外词并且得到多个关键词候选。在关键词确认阶段,把解码过程中得到的基于MAP的词置信度和N-best特征作为特征向量,设计了支持向量机(SVM)分类器。通过实验对SVM方法和传统的Fisher方法进行了比较,研究表明前者的应用效果整体优于后者。During recent years, Significant progress has been made in keyword spotting (KWS) for spo- ken speech or telephone speech, but little reference is found concerning word spotting in audio data embedded in streaming media. A keyword recognition system scheme is proposed for streaming media based on audio document retrieval. In the system, the decoded audio data was retrieved from Streaming media via Microsoft Windows Media Format Soft Development Kit (WMFSDK). In order to distinguish between out-of-vocabulary (OOV) and vocabulary words, on-line garbage (OLG) model is proposed aiming to reject OOV and obtain keyword candidates. In utterance verification stage, a Support Vector Machine (SVM) classifier is designed whose input feature vectors consisting of the parameters based on the NBest results and the MAP-based word confident measures. Compared with the traditional Fisher method, results show that the former is more effective than the latter.

关 键 词:流媒体 在线垃圾模型 置信度 支持向量机 

分 类 号:TP391.42[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象