A Novel Re-weighted CTC Loss for Data Imbalance in Speech Keyword Spotting  被引量:1

在线阅读下载全文

作  者:LAN Xiaotian HE Qianhua YAN Haikang LI Yanxiong 

机构地区:[1]School of Electronic and Information Engineering,South China University of Technology,Guangzhou 510641,China

出  处:《Chinese Journal of Electronics》2023年第3期465-473,共9页电子学报(英文版)

基  金:supported by the National Natural Science Foundation of China(61571192);Guangdong Basic and Applied Basic Research Foundation(2021A1515011454).

摘  要:Speech keyword spotting system is a critical component of human-computer interfaces.And connectionist temporal classifier(CTC)has been proven to be an effective tool for that task.However,the standard training process of speech keyword spotting faces a data imbalance issue where positive samples are usually far less than negative samples.Numerous easy-training negative examples overwhelm the training,resulting in a degenerated model.To deal with it,this paper tries to reshape the standard CTC loss and proposes a novel reweighted CTC loss.It evaluates the sample importance by its number of detection errors during training and automatically down-weights the contribution of easy examples,the majorities of which are negatives,making the training focus on samples deserving more training.The proposed method can alleviate the imbalance naturally and make use of all available data efficiently.Evaluation on several sets of keywords selected from AISHELL-1 and AISHELL-2 achieves 16%–38%relative reductions in false rejection rates over standard CTC loss at 0.5 false alarms per keyword per hour in experiments.

关 键 词:Speech keyword spotting Connectionist temporal classifier Data imbalance Sample importance re-weighting 

分 类 号:TN912.34[电子电信—通信与信息系统] TP181[电子电信—信息与通信工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象