关于常用字覆盖率统计算法的研究  

Research on Statistical Algorithms of Coverage Rate of Commonly Used Chinese Characters

在线阅读下载全文

作  者:阿不都克里木·玉素甫[1,2] 杨琴 王亮亮 ABUDUKELIMU Yusufu;YANG Qin;WANG Liang-liang(Modern Education Technology Center,Xinjiang Education Institute,Urumqi 830043,China;Xinjiang Laboratory of EducationCloud Technology and Resources,Urumqi 830043,China;School of Information Science and Technology,Xinjiang Education Institute,Urumqi 830043,China)

机构地区:[1]新疆教育学院现代教育技术中心,新疆乌鲁木齐830043 [2]新疆教育云技术与资源实验室,新疆乌鲁木齐830043 [3]新疆教育学院信息科学与技术学院,新疆乌鲁木齐830043

出  处:《计算机技术与发展》2020年第5期201-205,210,共6页Computer Technology and Development

基  金:新疆维吾尔自治区重点实验室开放课题(2019D04024)。

摘  要:对常用字在教育资源电子文本中的覆盖率、使用率、字频统计算法进行了研究,并根据算法通过计算机语言开发常用字覆盖率统计分析系统。统计分析系统可以对文本中所使用的常用字进行统计分析,即可以统计常用字覆盖率、文本汉字数、常用字字频、常用字使用率等,并根据统计数据以饼形图的方式显示。为了了解常用字在文本中的覆盖率和使用情况,通过常用字覆盖率统计分析系统对一些电子文本进行了统计分析,并得出相应的结果。结果表明常用字在文本中的覆盖率和使用率相当高,即581个常用字在文本中的覆盖率平均在68.9%以上,1 000个常用字在文本中的覆盖率平均在81.4%以上,2 500个常用字在文本中的覆盖率平均在96%以上,并且常用字在不同统计对象文本中的使用频度也会有所不同。The coverage,usage and frequency statistics arithmetic of commonly used Chinese characters in electronic texts has researched. According to the arithmetic,a statistical analysis system for coverage of commonly used Chinese characters has been developed by computer language,which can make statistical analysis of commonly used Chinese characters in text. It can count the coverage rate of commonly used Chinese characters,the number of text Chinese characters,frequency of commonly used Chinese Characters,utilization rate and so on,and display them in a pie chart according to statistical data. In order to understand the coverage and usage of commonly used Chinese characters in texts,some electronic texts are analyzed by the coverage statistical analysis system of commonly used Chinese characters,and the corresponding results are obtained. It is showed that the coverage and usage of commonly used Chinese characters in texts are quite high,that is,the coverage of 581 commonly used Chinese characters in texts averages over 68.9%,that of 1 000 commonly used Chinese characters in texts averages over 81.4%,and that of 2 500 commonly used Chinese characters in texts averages over 96%,and that the frequency results of common Chinese characters used in different statistical object texts are different.

关 键 词:常用字 统计算法 覆盖率统计 使用率统计 字频统计 

分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象