检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Naila Habib Khan Awais Adnan AbdulWaheed Mahdi Zareei Abdallah Aldosary Ehab Mahmoud Mohamed
机构地区:[1]Department of Computer Science,Institute of Management Sciences,Peshawar,25000,Pakistan [2]Department of Information Technology,Hazara University,Mansehra,21120,Pakistan [3]School of Electrical and Computer Engineering,Seoul National University,Seoul,08826,South Korea [4]Tecnologico de Monterrey,School of Engineering and Sciences,Zapopan,45201,Mexico [5]Department of Computer Science,Prince Sattam Bin Abdulaziz University,As Sulayyil,11991,Saudi Arabia [6]Electrical Engineering Department,College of Engineering,Prince Sattam Bin Abdulaziz University,Wadi Addwasir,11991,Saudi Arabia [7]Electrical Engineering Department,Faculty of Engineering,Aswan University,Aswan,81542,Egypt
出 处:《Computers, Materials & Continua》2021年第2期1347-1367,共21页计算机、材料和连续体(英文)
摘 要:Cursive text recognition of Arabic script-based languages like Urdu is extremely complicated due to its diverse and complex characteristics.Evolutionary approaches like genetic algorithms have been used in the past for various optimization as well as pattern recognition tasks,reporting exceptional results.The proposed Urdu ligature recognition system uses a genetic algorithm for optimization and recognition.Overall the proposed recognition system observes the processes of pre-processing,segmentation,feature extraction,hierarchical clustering,classification rules and genetic algorithm optimization and recognition.The pre-processing stage removes noise from the sentence images,whereas,in segmentation,the sentences are segmented into ligature components.Fifteen features are extracted from each of the segmented ligature images.Intra-feature hierarchical clustering is observed that results in clustered data.Next,classification rules are used for the representation of the clustered data.The genetic algorithm performs an optimization mechanism using multi-level sorting of the clustered data for improving the classification rules used for recognition of Urdu ligatures.Experiments conducted on the benchmark UPTI dataset for the proposed Urdu ligature recognition system yields promising results,achieving a recognition rate of 96.72%.
关 键 词:Classification rules genetic algorithm intra-feature hierarchical clustering ligature recognition Urdu script
分 类 号:TP3[自动化与计算机技术—计算机科学与技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.249