视听言语合成技术综述被引量：1

Audio visual speech synthesis: an overview

作　　者：张金光

出　　处：《电声技术》2017年第7期103-107,共5页Audio Engineering

摘　　要：听觉言语合成经历了共振峰合成、基音同步叠加拼接合成、单元选择拼接合成,正在走向基于统计模型的合成技术。技术原理从预设规则的参数合成,发展到基于原始信号波形的拼接合成,再到基于统计规则的参数合成,经历了一个循环。视觉言语合成从生理解剖模型,到几何特征模型,再到基于大语料库的数据驱动的拼接合成,也在走向基于统计规则的参数合成。各种技术的本质是什么?优缺点有哪些?本文尝试进行视听言语合成技术发展综述。Auditory speech synthesis has gone through three critical stages： formant synthesis, pitch-synchronous overlap and add synthesis, unit-selection synthesis. Now it is walking to statistical model based synthesis. The technological basis has undergone reincarnation, changing from preset parameters, to original waveform concatenation, and back to parameters again, but statistical ones. Visual speech synthesis includes anatomy-based models and geometric property based models. The dominant practice in visual speech synthesis is to use corpus to do data-driven concatenation, but there is a tendency that the statistical method will be widely used in the near future. What are the essences of those technologies？ What are their pros and cons？ This article tries to give a brief overview on the development of audio-visual speech synthesis.

关键词：言语合成规则合成拼接合成数据驱动

分类号：TN912.33[电子电信—通信与信息系统]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

视听言语合成技术综述被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

视听言语合成技术综述 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

视听言语合成技术综述被引量：1