教育质性研究中人机协同文本挖掘技术的运用——以某高校教学评估中文文本数据为例  被引量:6

Application of Human-Computer Collaboration Text Mining Technology in Educational Qualitative Research——Based on Chinese Text Data of Teaching Evaluation in a University

在线阅读下载全文

作  者:王金羽 詹逸思[2] 冯起 李曼丽[1] WHAN Jin-yu;ZHAN Yi-si;FENG Qi;LI Man-li(Institute of Education,Tsinghua University,Beijing,100084;Then Center for Student Learning and Development,Tsinghua University,Beijing,100084;Department of Electrical Engineering,Tsinghua University,Beijing,100084)

机构地区:[1]清华大学教育研究院,北京100084 [2]清华大学学生学习与发展指导中心,北京100084 [3]清华大学电机工程与应用电子技术系,北京100084

出  处:《清华大学教育研究》2022年第2期56-63,共8页Tsinghua Journal of Education

基  金:清华大学自主科研计划“人工智能条件下教育领域社会实验方法设计预研”(2019THZWYY05)。

摘  要:信息时代海量增长的文本资料成为质性研究者开展研究的数据宝藏,但未得到充分研究,其原因在于针对海量中文文本数据的有效分析方法尚待突破。文章率先在质性研究范式中使用了以结构主题模型(STM)为代表的人机协同方法,对某大学在线教学效果评估的课堂观察记录数据展开文本挖掘。以教学评估研究数据分析为例,完整呈现了在教育质性研究中应用STM进行数据挖掘的四个步骤,并分析了其在挖掘海量中文文本资料方面的独特优势。研究表明,跨学科研究方法的尝试有助于解决教育学科甚至人文社科领域内海量中文文本在质性分析上的固有难题。Although sharp growing massive text data has become a treasure for qualitative researchers,it has not been fully studied because few effective analysis methods for massive Chinese text data have been created.This research initiated the use of a human-computer collaboration method represented by the structural topic model(STM)in the R language in the educational qualitative research paradigm.By mining classroom observation record data of a university’s online teaching effect evaluation,this paper presents the four steps of applying the STM model to data mining in the qualitative research of education and analyzes the strengths and weaknesses of the R language in mining massive Chinese text data.Studies have shown that interdisciplinary research methods can help overcome the inherent challenges in the qualitative analysis of massive Chinese texts in educational research and even beyond the humanities and social sciences.

关 键 词:结构主题模型(STM) 超大文本挖掘 教育质性研究 

分 类 号:G40-034[文化科学—教育学原理]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象