检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]湖南文理学院计算机科学与技术学院,湖南常德415000 [2]湖南大学电气与信息工程学院,长沙410082
出 处:《计算机应用》2010年第2期449-452,共4页journal of Computer Applications
摘 要:针对现有的文本提取算法不能适应复杂背景变化和文字本身的形状变化问题,提出一种基于敏感点颜色两级聚类和文本行聚类筛选的方法。新方法利用人眼视觉对颜色大幅度变化更敏感的特点,以敏感点的主要颜色作为聚类分析的依据,克服了现有阈值方法和聚类方法受背景颜色变化影响较大的问题。在此基础上,以文本行的空间排列特征为依据进进行文本行筛选,以克服一般方法容易受文字形状和尺寸变化影响的缺点。实验表明,新方法对于背景的复杂变化和文字的形状尺寸变化都具有很好的适应性。Since the existing text extraction methods can not adapt to the variation of complex background and shape, a new method was brought forward. It was founded on two-level color clustering of sensible points and text-line clustering. Because human vision perception is more sensitive to great change of colors, the new method only selected the main colors at sensible points to cluster. The strategy could solve the problems of the existing methods based on threshold and clustering which were greatly influenced by the variation in colors of complex background. And then, the text-lines were selected according to the fact that texts always align with each other in a. same text-line. That course can eliminate the influence of variation in shape and size of characters. Experimental results indicate that, the new method has good adaptability to complex change of background, and texts with different size and shape.
分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.148