基于FP_Growth关联规则挖掘的多轨道数字音频文件分类研究  被引量:3

Research on multi-track digital audio file classification based on FP_Growth association rule mining

在线阅读下载全文

作  者:谢抢来[1] 杨威[1] XIE Qianglai;YANG Wei(Jiangxi University of Technology,Nanchang 330004,China)

机构地区:[1]江西科技学院

出  处:《现代电子技术》2020年第1期179-182,186,共5页Modern Electronics Technique

基  金:江西省教育厅科学技术研究项目(GJJ180978);江西省科技厅科学技术研究项目(20171BBE50060);江西科技学院科学技术研究项目(JY1718);江西科技学院科学技术研究项目(16ZRYB08)

摘  要:为从海量音频信息中快速准确地获取所需文件,研究基于FP_Growth关联规则挖掘的多轨道数字音频文件分类方法。以数据立方体下具有多维属性的多轨道数字音频文件为研究对象,采用FP_Growth关联规则挖掘算法排序多轨道数字音频文件数据集的频繁1-相集,采用FP tree挖掘频繁项集获取多维关联规则集。识别并提取多轨道数字音频文件中某类型音频文件中的音频内容,使用删除停用词结合TF/IDF的计算方式获取清洁的音频文件,应用音频文件数据集的多维关联规则集,综合考虑匹配规则数和置信度,搜索规则集,获取该类型音频文件最相配类别,完成多轨道数字音频文件分类。实验结果表明,所提方法能够直观有效地分类多轨道数字音频文件,且分类结果准确率和召回率平均值分别达到96.60%和96.54%,显著高于对比方法。A multi⁃track digital audio file classification method based on FP_Growth association rule mining is studied to get the required files quickly and accurately from the massive audio information.By taking multi⁃track digital audio files with multidimensional attributes under the data cube as the research object,FP_Growth association rules mining algorithm is adopted to sort the frequent 1⁃phase sets of multi⁃track digital audio file data sets,and FP tree mining frequent item sets are adopted to obtain multidimensional association rules.The audio content in a certain type of audio file in multi⁃track digital audio files is identified and extracted,the calculation method of deleting stop words combined with TF/IDF is adopted to obtain clean audio files.The multidimensional association rules set of audio file data set is applied to take into account comprehensively the matching rule number and degree of confidence,search the rule sets,obtain the most suitable type of audio file categories and complete the classification of multi⁃track digital audio files.The experimental results show that the proposed method can intuitively and effectively classify multi⁃track digital audio files,and the averages of accuracy and recall rate of classification result reach 96.60%and 96.54%respectively,significantly higher than the comparison method.

关 键 词:关联规则 数据挖掘 多轨道 数字音频 文件分类 数据立方体 

分 类 号:TN911.72-34[电子电信—通信与信息系统] TP301.6[电子电信—信息与通信工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象