语音识别开发工具包SRDK的研究与开发  被引量:1

Research and Development of Speech Recognition Development Kit

在线阅读下载全文

作  者:陈一宁[1] 朱璇[1] 单翼翔[1] 刘加[1] 

机构地区:[1]清华大学电子工程系,北京100084

出  处:《计算机工程与应用》2003年第1期5-8,共4页Computer Engineering and Applications

基  金:国家自然科学基金项目(编号:69975007);国家"863"高技术研究发展计划项目(编号:863-306ZD13-04-6)

摘  要:详细介绍了一个语音识别开发工具包SRDK(SpeechRecognitionDevelopmentkits)。该工具包可以方便地完成语音识别的各种任务,并且可以用来对语音识别技术进行研究。SRDK的特点是:ANSIC编写,便于向嵌入式系统进行移植;模块化良好,可以任意拆分组合;内置状态捆绑、训练中的剪枝、段长后处理、SSE(StreamingSingle-InstructionMultiple-DataExtensions)指令集的使用等多种先进技术等。已经使用SRDK开发出实用的语音识别系统。Today,using the general-purpose platform to build speech recognizers becomes more and more popular.In this paper,a compact speech recognition development kit(SRDK)featured with effective merits is presented.SRDK is a set of software modules based on modified Hidden Markov Model(HMM).With them,the task for building various prac-tical speech recognizers as well as relative research work becomes more easily.SRDK is written in ANSI C,therefore it can run well not only on Windows operation system but also in UNIX environment.Further more it can be transplanted to other embed systems too.Contributed to the modularization design,any part of SRDK can be employed independently.Besides,in SRDK,four particular build-in approaches should be emphasized here.Firstly,parameter tying can be imple-mented in both semi-syllable level and state level.Secondly,SRDK adopts pruning in training stage in order to enhance the training speed.Thirdly,considering the effect of different speech rate,duration model is imported into the post pro-cessing for improving recognition performance.Finally,source codes of SRDK are optimized by using Streaming SIMD Extensions(SSE)instructions published by Intel Company and supported by AMD Company.Plus oriented graphs frame-works are utilized instead of multi-sub-tree structure in searching network,the recognition performance is improved comprehensively.The writers have already achieved a private automatic branch exchange system based on SRDK intro-duced in this paper.

关 键 词:语音识别开发工具包 SRDK 专用软件 语音识别 段长模型 SSE指令集 隐含马尔可夫模型 

分 类 号:TP319[自动化与计算机技术—计算机软件与理论] TN912.34[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象