检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]大连民族学院计算机科学与工程学院,辽宁大连116600
出 处:《计算机应用》2009年第12期3360-3362,3365,共4页journal of Computer Applications
基 金:国家自然科学基金资助项目(60803096);国家民委项目(07DL07)
摘 要:为了能够对文档中的少数民族文字种类进行正确地识别分类,提出一种基于小波分析与改进的二次分类函数(MQDF)的少数民族文字种类识别方法。该方法采用多辨识小波分解,从而获得小波能量和小波能量比例分布的特征描述,利用MQDF分类器对少数民族文种进行识别。构建藏文、西双版纳傣文、纳西象形文、维吾尔文、德宏傣文和彝文6种常用的少数民族文字及汉字、英语共8种文字的样本库,采用该方法对少数民族的样本库进行了进行训练和测试。实验结果显示,该方法在多层小波分解的情况下,对于少数民族文种识别的精度好于传统的贝叶斯和KNN。In order to classify the type of the Chinese minority scripts, the method of identifying the kinds of Chinese minority scripts based on wavelet analysis and Modified Quadratic Discriminant Function (MQDF) was presented. Using wavelet energy and wavelet energy distribution proportion as features by wavelet multi-resolution transform, muhivariate classifier in MQDF was constructed. A sample data set was built which contained six common Chinese minority scripts: Tibetan, Tai Lue, Naxi Pictographs, Uighur, Tai Le, Yi and Chinese and English in total, some samples were used for training, others were for testing, and the proportions of the training samples in dataset were variant. Obviously, the experimental result shows that, in muhi-level decomposition, the method is better than the traditional Bayes and K-Nearest Neighbor (KNN) classification in recognition rate.
关 键 词:中国少数民族文字 文种识别 小波分析 改进的二次分类函数
分 类 号:TP391.43[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.80