声纹鉴定中嗓音音质的声学界标初探——基于随机森林和决策树模型的研究  

A Preliminary Study on the Acoustic Landmark of Voice Quality in Voiceprint Identification--A Study Based on Random Forest and Decision Tree Model

在线阅读下载全文

作  者:耿浦洋 施少培 郭弘 卞新伟 卢启萌 曾锦华 GENG Puyang;SHI Shaopei;GUO Hong;BIAN Xinwei;LU Qimeng;ZENG Jinhua(Shanghai Forensic Service Platform,Key Laboratory of Forensic Science,Ministry of Justice,Academy of Forensic Science,Shanghai 200063,China)

机构地区:[1]司法鉴定科学研究院上海市司法鉴定专业技术服务平台司法部司法鉴定重点实验室,上海200063

出  处:《中国司法鉴定》2022年第4期54-59,共6页Chinese Journal of Forensic Sciences

基  金:国家社科基金青年项目(21CYY011);中央级科研院所公益项目(GY2021G-9,GY2019G-2,GY2018G-4);上海市司法鉴定专业技术服务平台资助项目(19DZ2292700)。

摘  要:目的嗓音音质是声纹鉴定的重要参考特征之一。但目前鉴定实践中关于嗓音音质的类别判断尚缺乏客观数据支撑。方法基于随机森林和决策树模型,利用18个声学参数对4种嗓音音质(即正常嗓音、嘎裂嗓音、气嗓音和假嗓音)的声学界标进行探索。结果随机森林结果显示,嗓音类别的判别准确率为90.7%,基频、整字时长、谐噪比(HNR)、基频/振幅抖动、以及第一谐波和第三振幅差值(H1-A3)对于嗓音判别的贡献度较大;决策树模型结果显示,4种嗓音类别可以通过三个决策点(即HNR、基频均值和H1-A3)加以区分,嗓音判别正确率在75%以上。结论基于基频、谐噪比和谐波差值等参数可以实现较好的嗓音判别,且不同嗓音之间的声学界标对于声纹鉴定中嗓音类别判断具有较好的参考价值和可行性。Objective Voice quality serves as one of the most important features in forensic voice comparison.However,the acoustic evidence to define voice quality type is still under study.This study aims at establishing a method to define voice quality.Method Based on random forest and decision tree model,the current paper investigated the acoustic landmarks of four types of voice quality(i.e.,normal,creaky,breathy,and falsetto)using 18 acoustic parameters.Results The random forest analysis received 90.7%accurate results of voice quality classification,and fundamental frequency(F0),duration,HNR,and H1-A3 are salient factors that contributed to the classification.The results of decision tree model showed that the four types of voice quality could be reasonably classified(i.e.,accuracy is above 75%)based on three decision nodes(i.e.,HNR,F0 mean,and H1-A3).Conclusion A promising result of voice quality classification could be achieved based on F0,HNR,H1-A3,and etc.The application acoustic landmarks of voice quality could be an effective and significant method for forensic voice comparison practice.

关 键 词:嗓音音质 声学界标 随机森林 决策树模型 

分 类 号:D918.9[政治法律—法学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象