检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:焦佳辉 马思远[1,2] 宋玉[2] 宋伟[1] JIAO Jia-hui;MA Si-yuan;SONG Yu;SONG Wei(Henan Academy of Big Data,Zhengzhou University,Zhengzhou 450052;School of Computer and Artificial Intelligence,Zhengzhou University,Zhengzhou 450001,China)
机构地区:[1]郑州大学河南省大数据研究院,河南郑州450052 [2]郑州大学计算机与人工智能学院,河南郑州450001
出 处:《计算机工程与科学》2023年第12期2226-2236,共11页Computer Engineering & Science
摘 要:在音乐信息检索(MIR)领域,根据音乐流派进行分类是一项具有挑战性的任务。传统的音频特征工程方法需要手动地选择并提取音乐信号特征进行处理,导致特征提取过程复杂,模型性能不稳定,泛化性差。深度学习与频谱图相结合的方法也有着部分数据不适合模型和全局特征提取困难等问题。提出了一种基于卷积注意力机制的音乐流派分类模型MGTN。MGTN融合了输入频谱图与提取音频信号特征构建音频时序数据2种音乐流派分类方法,使得模型提取特征的能力与泛化性大大提升,提供了音乐流派分类的新思路。在GTZAN与Ballroom数据集上的实验结果表明,MGTN模型能够有效地融合2种不同模态的输入数据。在与数十种基准模型进行的对比中,MGTN模型具备较强的优势。In the field of music information retrieval(MIR),classification according to music genres is a challenging task.Traditional audio feature engineering methods requires manually selecting and extracting music signal features for processing,resulting in complex feature extraction process,unstable model performance and poor generalization.The method combining deep learning with spectrogram also has some problems such as unsuitable model for some data and difficulty in global feature extraction.This paper proposes a music genre classification model based on convolutional attention mechanism,called MGTN.MGTN combines two music genre classification methods:input spectrogram and audio signal feature extraction,to construct audio time series data,which greatly improves the model's ability to extract features and generalization,and provides a new idea for music genre classification.Experimental results on GTZAN and Ballroom datasets show that the MGTN model can effectively fuse input data from two different modalities.Compared with dozens of benchmark models,the MGTN model has strong advantages.
关 键 词:音乐流派分类 Transformer模型 频谱图 音频特征工程 注意力机制
分 类 号:TP301[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.30