检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:许嘉 杨攀原 吕品 刘恒 XU Jia;YANG Panyuan;LYU Pin;LIU Heng(School of Cyberspace Security,Guangzhou University,Guangzhou 510006 China;School of Computer Electronics and Information,Guangxi University,Nanning 530004,China;School of Information and Management,Guangxi Medical University,Nanning 530021,China)
机构地区:[1]广州大学网络空间安全学院,广东广州510006 [2]广西大学计算机与电子信息学院,广西南宁530004 [3]广西医科大学信息与管理学院,广西南宁530021
出 处:《工程科学与技术》2025年第1期80-88,共9页Advanced Engineering Sciences
基 金:国家自然科学基金项目(62067001)。
摘 要:随着大量中文MOOC平台的兴起,批改大规模学生提交的主观题作业成为教育研究领域亟待解决的问题。同行互评要求学生作为同行评价者来批改同伴的作业,是解决该挑战问题的主流方法。近年来,研究人员基于概率图模型对同行评价者的评分可靠性和偏见建模,有效提升了基于同行评价打分估计主观题作业真实分数的准确性。然而,现有概率图模型只考虑学生在本次作业上的得分对其评分可靠性的影响,未对可以直接衡量评价者评分可靠性的学生评分偏差进行建模,存在局限性。鉴于此,本文结合教师抽查的方式,基于学生评分偏差对评价者评价能力进行有效量化,并以此为基础提出两种新颖的同行互评概率图模型,即RPG_(6)(reliability-aware peer grading 6)和RPG_(7)(reliability-aware peer grading 7)。这两个模型在现有概率图模型的基础上,在学生的评分可靠性建模中添加了基于评分偏差感知的学生评价能力,以提高模型对作业真实分数的估计准确性。真实课堂实验表明,本文提出的RPG_(6)和RPG_(7)模型在同行互评活动中对作业真实分数的估计更为准确,比现有最好技术在均方根误差方面平均降低了11.75%。With the proliferation of many MOOC platforms,grading open-ended assignments submitted by many students presents a significant challenge in educational research.Peer assessment,which requires students to act as peer graders and evaluate their peers’submissions of assignments,is the mainstream solution to address this issue.Researchers have recently proposed various probabilistic graph models to evaluate peer graders’grading reliability and bias,effectively improving the estimated actual scores of assignments based on peer grades.However,the existing probabilistic graph models consider only the impact of students’scores on the current assignment regarding their grading reliability,failing to account for their scoring deviation,which directly measures their reliability.This limitation affects the performance of these models.Therefore,this study proposes two novel probabilistic graph models,RPG_(6) and RPG_(7),which incorporate the peer graders’grading ability,quantified based on their score deviation within a small proportion of submissions being spot-checked by teachers.These models,constructed on the foundation of two existing probabilistic graph models,represent the grading reliability of peer graders as a variable dependent on their scoring deviation-aware grading ability rather than their scores for the current assignment.This approach enhances the estimation of the true scores of assignments.Real classroom experiments demonstrated that the proposed RPG_(6) and RPG_(7) models achieve greater accuracy in estimating the true scores of assignments in peer assessment activities.Specifically,the RMSE values of RPG_(6) and RPG_(7) are,on average,11.75%lower than those of the state-of-theart method.
关 键 词:同行互评 概率图模型 真实分数估计 评分偏差 评价能力 抽查
分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222