一个语音驱动的人脸合成系统

A FACE SYNTHESIS SYSTEM DRIVEN BY VOICE

出　　处：《模式识别与人工智能》2003年第4期459-464,共6页Pattern Recognition and Artificial Intelligence

摘　　要：本文采用多层前馈神经网络,对汉语中的声母和韵母的发音唇型参数及其相应的语音参数之间的映射进行了研究.通过实验,对于在进行语音参数和唇型参数的映射研究中,选择哪种语音参数更为合适进行了分析.最后,把该网络应用于一个人脸合成系统。该系统能够实时地合成和语音同步而且较为自然的唇型.In recent years, visual speech has received a lot of attention and played an active role in computer-human interactive technology. A large effort has been directed to mapping speech to lip movement and much attention has been paid to lip synchronization in face synthesis research. In this paper, a multi-layer feed-forward neural networks is used to map the speech of initials and finals in mandarin to corresponding lip movement. By experiments, which kind of speech parameters is more suitable for the mapping is analysed. The trained network is then applied to a visual speech system. The system can synthesize natural lip movement synchronized with audio speech.

关键词：人脸合成系统语音驱动前馈神经网络唇型参数语音参数隐马尔可夫模型人脸建模

分类号：TP391.4[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一个语音驱动的人脸合成系统

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一个语音驱动的人脸合成系统

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索