多反复结构模型的精确音乐分离方法  被引量:11

Music/voice separation based on the multi-repeating structure of Mel-frequency cepstrum coefficients

在线阅读下载全文

作  者:张天骐[1] 徐昕[1] 吴旺军[1] 刘瑜[1] 

机构地区:[1]重庆邮电大学信号与信息处理重庆市重点实验室,重庆400065

出  处:《声学学报》2016年第1期135-142,共8页Acta Acustica

基  金:国家自然科学基金(61371164;61275099;61102131);信号与信息处理重庆市市级重点实验室建设项目(CSTC2009CA2003);重庆市杰出青年基金(CSTC2011jjjq40002);重庆市自然科学基金(CSTC2012JJA40008);重庆市教育委员会科研项目(KJ120525;KJ130524);重庆市研究生科研创新项目(CYS14140)资助

摘  要:针对基本反复模型音乐分离方法自适应性差的问题,提出一种基于美标度倒谱系数(MFCC)的多反复结构模型的音乐分离方法。首先,提取出音乐信号的MFCC系数矩阵(39维的数据构成);然后利用余弦特性得到其相似矩阵,进而将相似度一致的片段划分到一起,建立不同的反复结构模型;之后结合理想二元掩蔽(]BM)分离出背景音乐及歌声的频谱,相应的时域信号则由傅里叶逆变换获得;最后,在不同类型、长度的音乐文件上测试了算法性能,将提出的算法与Rafii的反复算法和Ozerov的灵活窗非负矩阵分解方法进行对比。实验结果表明,改进方法在分离性能上最高提高3 dB左右,并且对于曲调变换大的音乐提高效果更为明显,从而证实了改进方法是一种有效的音乐分离方法,并且更具稳定性。For the poor adaptability of the original repeating pattern, an improved music separation method of multirepeating structure of Mel-Frequency Cepstrum Coefficient (MFCC) was proposed. Firstly, the MFCC coefficient matrix (39-dimensional data) of the music signal was extracted; then the cosine characteristic was applied to the count of similarity matrix of MFCC, and putted the fragments with consistent similarity together, next built different repeating patterns for groups with different, thereby the spectrums of the background music and vocal were separated combined with ideal binary masking (IBM), the corresponding time domain signals were obtained by inverse Fourier transform; finally, the improved method was tested on the music database of different types and length, and the separation results were compared with repeating method of Rafii and the non-negative matrix factorization based on flexible framework method of Ozerov. The experimental results showed that the separation performance of improved method was improved about 3 dB, the performance of music with melody changed larger was significantly improved, thus verifying that that the improved method was an effective music separation algorithm and more

关 键 词:音乐信号 分离方法 结构模型 算法性能 自适应性 倒谱系数 系数矩阵 MFCC 

分 类 号:J605[艺术—音乐] TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象