检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:王培刚[1] WANG Peigang(Hubei Communications Technical College,Wuhan,Hubei Province,430202 China)
出 处:《科技资讯》2024年第15期35-37,共3页Science & Technology Information
摘 要:自动音频标注的目的是从音频输入生成能够描述此音频的一段文字。目前,音频标注模型的效果欠佳,并且在改善音频标注效果的过程中很少有应用预加载模型。自动音频标注的目标为音频片段产生合适的描述语句,拥有处理音频模态和文本模态数据的能力。为此,对音频模态与文本模态的预加载模型进行研究,并提出基于音频模态的自动标注系统和基于文本模态的自动标注系统,解决传统标注方法中训练和测试阶段目标不一致的问题。The purpose of automatic audio tagging is to generate a paragraph of texts that can describe the audio from the audio input.Currently,the effectiveness of audio tagging models is not good,and there are few applica⁃tions of preloading models in improving the audio tagging effect.The goal of automatic audio tagging is to generate appropriate descriptive statements for audio segments,and to have the ability to process audio and text modal data.Therefore,research is conducted on the preloading models of audio and text modalities,and automatic tagging based on audio modality and text modality are proposed to solve the problem of inconsistent goals in the training and testing stages of traditional tagging methods.
分 类 号:TN912.3[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.49