检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:刘晓旺 郝韵 张劲松[1,2] LIU Xiaowang;HAO Yun;ZHANG Jinsong(School of Information Science,Beijing Language and Culture University,Beijing 100083,China;Institute of Intelligent Language Learning,Beijing Language and Culture University,Beijing 100083,China;Center for Language and Cognition,University of Groningen,Groningen 9700 AB,Netherlands)
机构地区:[1]北京语言大学信息科学学院,北京100083 [2]北京语言大学语言智能学习研究所,北京100083 [3]格罗宁根大学语言与认知中心,格罗宁根9700 AB,荷兰
出 处:《清华大学学报(自然科学版)》2024年第11期1911-1918,共8页Journal of Tsinghua University(Science and Technology)
基 金:中央高校基本科研业务专项资金(23YBT18);教育部人文社科研究规划项目(23YJA740012);国际中文教育研究重点研究项目(22YH49B)。
摘 要:语音信息论主要研究语言信息冗余和声学特征之间的关系。目前的研究结论主要来自印欧语系的语言,而在汉语普通话方面的研究较少,尤其是在语言冗余与韵律结构(边界和重音)的对应关系问题上,还缺乏研究。该文运用信息理论中的惊异度(surprisal)概念表示语言冗余程度,通过计算汉语朗读语篇语料库(ASCCD)中汉字层级的一元惊异度(unigram surprisal)和二元惊异度(bigram surprisal),考察普通话中语言冗余、韵律结构和音节时长三者之间的关系。结果表明:给定前字的二元惊异度与重音程度有关,给定后字的二元惊异度与边界层级存在对应关系;在控制韵律结构因素后,语言冗余能够独立解释音节时长的变化,从而支持弱化版的平稳信号冗余假设。[Objective] Information theory in phonetics primarily investigates the relationship between language redundancy and acoustic features.Language redundancy refers to the predictability of linguistic information,which arises from lexical,syntactic,and semantic contextual factors.The more predictable the information,the higher its redundancy.Numerous studies suggest that when spoken,linguistic units with higher redundancy tend to be shorter in duration.The smooth signal redundancy hypothesis posits that the influence of language redundancy on duration is modulated by prosodic structures.These structures adjust acoustic features by assigning stress and boundaries to elements with lower language redundancy,thus achieving an inverse relationship between language redundancy and duration.However,these conclusions are predominantly based on Indo-European languages,leaving a research gap for Mandarin Chinese.Moreover,there is a lack of research on the correspondence between linguistic redundancy and prosodic structure.Thus,this study aims to investigate the relationships among language redundancy,prosodic structure,and syllable duration,specifically within the context of Mandarin Chinese.[Methods] This study quantifies language redundancy using the concept of surprisal,a principle derived from information theory.A large-scale textual corpus was used to train a 2-gram Chinese character-level language model,which was used to estimate unigram and bigram surprisal.Additionally,The corpus employed in this study comprises Annotated Speech Corpus of Chinese Discourse(ASCCD).The Chinese Tone and Break Index(C-ToBI) annotation system is employed to represent prosodic structures in terms of boundaries and stress.Concurrently,the duration of each syllable and its corresponding stress and boundary levels were recorded.A linear mixed-effect model was employed to explore the effects of language redundancy factors and prosodic structure on syllable duration.To verify whether language redundancy directly explains changes in syllable durati
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.49