检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]信息工程大学,河南郑州450001
出 处:《信息工程大学学报》2015年第6期711-717,共7页Journal of Information Engineering University
基 金:国家自然科学基金资助项目(61175017)
摘 要:根据测试集中词发生次数调整候选关键词置信度得分,提出一种新的基于ATWV(actual term-weighted value)优化的词相关置信度规整算法。针对ATWV优化计算中存在的置信度偏差问题,分别进行偏差线性补偿和区分性补偿,其中线性补偿通过添加加权和平移系数,以线性方式调整置信度得分;区分性补偿则通过区分性模型训练,将置信度转化为满足ATWV计算要求的正确分类概率,降低置信度偏差带来的影响。基于英文WSJ语料库的关键词识别实验表明,新的置信度规整方法可显著提高系统识别性能。This paper propose a novel term-dependent confidence normalization method based on ATWV( Actual Term-Weighted Value) optimization,where the words' confidence score is adjusted according to their frequency in the test. For the confidence bias in the ATWV optimization,we propose a linear compensation and a discriminative compensation. The linear compensation adjusts confidence in a linear way by adding weighted and translation factors,while the discriminative compensation converts confidence score to classification posterior probability,which meets the requirements of ATWV optimization,by discriminative model training. Experimental results based on WSJ Speech Corpora show that the novel confidence normalization measures can greatly improve the performance of system.
分 类 号:TN391[电子电信—物理电子学]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.117