情感语音合成综述被引量：1

A survey of emotional speech synthesis

作　　者：施昊翔张旭龙王健宗程宁肖京 SHI Haoxiang;ZHANG Xulong;WANG Jianzong;CHENG Ning;XIAO Jing(Ping An Technology(Shenzhen)Co.,Ltd.,Shenzhen 518063,China;University of Science and Technology of China,Hefei 230026,China)

机构地区：[1]平安科技(深圳)有限公司,广东深圳518063 [2]中国科学技术大学,安徽合肥230026

出　　处：《大数据》2024年第5期56-73,共18页Big Data Research

基　　金：广东省重点领域研发计划“新一代人工智能”重大专项(No.2021B0101400003)。

摘　　要：作为语音领域一个重要的研究方向,语音合成致力于将文本转化为语音。随着深度学习技术的快速发展,语音合成的目的早已不仅仅是合成一段“能听懂”的音频这么简单,情感的加入往往能使语音变得更加具有表现力。基于此,情感语音合成在语音中加入不同的情感并对情感进行调控,以生成灵活且准确的情感语音。从情感语音合成中的几个关键科学问题出发,分别对近几年来基于情感迁移、情感强度控制和情绪混合的发展进行了总结分析,并介绍了情感语音合成的相关数据集和评价指标,最后对情感语音合成进行了展望。As a significant research area in the field of speech technology,speech synthesis is dedicated to converting text into speech.With the rapid development of deep learning technology,the objective of speech synthesis has evolved beyond merely producing"understandable"audio.The incorporation of emotion often enhances the expressiveness of synthesized speech.Consequently,emotional speech synthesis aims to combine speech with different emotions and regulate these emotions to generate flexible and precise emotional speech.Starting from several key issues in emotional speech synthesis,this paper summarizes and analyzes the development based on emotion transfer,emotion intensity control and emotion mixing in recent years,and introduces the relevant data sets and evaluation indicators of emotion speech synthesis.Finally,the emotional speech synthesis is prospected.

关键词：情感语音合成情感迁移情感强度深度学习

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

情感语音合成综述被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

情感语音合成综述 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

情感语音合成综述被引量：1