检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:LAN Xiaotian HE Qianhua YAN Haikang LI Yanxiong
出 处:《Chinese Journal of Electronics》2023年第3期465-473,共9页电子学报(英文版)
基 金:supported by the National Natural Science Foundation of China(61571192);Guangdong Basic and Applied Basic Research Foundation(2021A1515011454).
摘 要:Speech keyword spotting system is a critical component of human-computer interfaces.And connectionist temporal classifier(CTC)has been proven to be an effective tool for that task.However,the standard training process of speech keyword spotting faces a data imbalance issue where positive samples are usually far less than negative samples.Numerous easy-training negative examples overwhelm the training,resulting in a degenerated model.To deal with it,this paper tries to reshape the standard CTC loss and proposes a novel reweighted CTC loss.It evaluates the sample importance by its number of detection errors during training and automatically down-weights the contribution of easy examples,the majorities of which are negatives,making the training focus on samples deserving more training.The proposed method can alleviate the imbalance naturally and make use of all available data efficiently.Evaluation on several sets of keywords selected from AISHELL-1 and AISHELL-2 achieves 16%–38%relative reductions in false rejection rates over standard CTC loss at 0.5 false alarms per keyword per hour in experiments.
关 键 词:Speech keyword spotting Connectionist temporal classifier Data imbalance Sample importance re-weighting
分 类 号:TN912.34[电子电信—通信与信息系统] TP181[电子电信—信息与通信工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7