实现对教育课件关键信息的提取——以“数据库原理”课程为例  被引量:1

Realization of the key information of the education courseware:taking the “Database Principle” course as an example

在线阅读下载全文

作  者:谢志庆 张晓天 闫秋艳[1] 胡妍 高淑娟 Xie Zhiqing;Zhang Xiaotian;Yan Qiuyan;Hu Yan;Gao Shujuan(School of Computer Science and Technology,China University of Mining and Technology,Xuzhou 221116,China)

机构地区:[1]中国矿业大学计算机科学与技术学院

出  处:《无线互联科技》2019年第12期66-69,共4页Wireless Internet Technology

摘  要:目前,雨课堂的使用产生了大量学生观看演示文稿的数据,如何更加高效地利用这些数据成了文章的研究起点。为此,需要按页提取演示文稿中的关键信息。文章通过分析演示文稿文件的设计特点,建立一个评价体系,对演示文稿中的文本内容依据文本特征(颜色、字号、字体、粗体、斜体)进行分析从而估计重要指数。结合重要指数的评分,选取最大的k个_Run对象提取关键词或是结合TF-IDF算法,根据词频提取关键词,以实现对教学课件按页提取关键信息。借助Python的pptx模块和jieba模块,实现教学课件关键词的提取。最后,以“数据库原理”课程为例,进行关键词的提取,以此进行有效性的检验。结果表明,文章所提出的基于演示文稿文本属性的关键词提取算法准确率可以达到82.32%。At present,the use of rain classroom has produced a large number of students to watch the data of the presentation,and how to make more efficient use of these data becomes the starting point of this paper.It needs to extract key information from the presentation by page to do this.By analyzing the design features of the presentation file,the article sets up an evaluation system,and the text content in the presentation is analyzed according to the text features(color,font size,font,bold,italic)to estimate the important index.Combining the scores of the important indexes,selecting the largest k_Run object to extract keywords or combining the TF-IDF algorithm,extracting the keywords according to the word frequency,In this paper,the key information is extracted by page for the teaching courseware.With the help of pptx module and jieba module of Python,the keyword extraction of teaching courseware is realized.Finally,taking the“Database Principle”course as an example,the keyword extraction is carried out to test the effectiveness.The results show that the accuracy of the keyword extraction algorithm based on presentation text attributes can reach 82.32%.

关 键 词:教学课件 关键词提取 文本特征 PYTHON 雨课堂 

分 类 号:TP3[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象