检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:刘亚多[1] 李伟[1] 李晓强[2] 汪竹蓉[1] 冯瑞[1]
机构地区:[1]复旦大学计算机科学技术学院,上海200433 [2]上海大学计算机工程与科学学院,上海200072
出 处:《电子学报》2010年第5期1172-1176,共5页Acta Electronica Sinica
基 金:国家自然科学基金(No.60873255);上海市科技攻关计划(No.09511501404);上海市科委重点科技攻关项目(No.08511501303)
摘 要:对互联网海量MP3格式音乐数据进行基于内容的有效检索是当前一个重要而又很少涉及的研究方向.本文提出一种基于MDCT频谱熵的压缩域音频指纹算法,对各种常规频域和时间域的音频信号处理失真具有较强的鲁棒性.模拟实验在包含100首不同中文流行歌曲的音乐数据库上进行.对经受各种严重信号处理失真的粒度为5s左右的查询片段,能够取得超过90%的首位正确识别率.With the proliferation of MP3 music, compressed-domain music information retrieval from the Intemet has come into being an import,ant and urgent research field. In this paper, we propose a novel compressed-domain audio fingerprinting algo- rithm based on MDCT spectral entropy. The input MP3 music file is fast partially decompressed to obtain MDCT coefficients as intermediate results, whereby we calculate the MDCT spectral entropy through consecutive long windows and come to the final fingerprint sequence by magnitude relationship modeling. Such fingerprint exhibits strong robustness against various frequency- and timedomain audio distortions due to its statistically stable nature. Experimental results show that in our test datahase which is composed of 100 distinct Chinese pop songs, a 5s music clip is sufficient to identify its original recording in real time, with more than 90% top one precision rate even under various severe audio signal distortions.
关 键 词:音频指纹 压缩域 鲁棒性 MDCT频谱熵 音乐检索
分 类 号:TN919[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222