FNNWV:farthest-nearest neighbor-based weighted voting for class-imbalanced crowdsourcing  

在线阅读下载全文

作  者:Wenjun ZHANG Liangxiao JIANG Ziqi CHEN Chaoqun LI 

机构地区:[1]School of Computer Science,China University of Geosciences,Wuhan 430074,China [2]Key Laboratory of Artificial Intelligence,Ministry of Education,Shanghai 200240,China [3]School of Mathematics and Physics,China University of Geosciences,Wuhan 430074,China

出  处:《Science China(Information Sciences)》2024年第10期168-184,共17页中国科学(信息科学)(英文版)

基  金:partially supported by National Natural Science Foundation of China(Grant No.62276241);Foundation of Key Laboratory of Artificial Intelligence,Ministry of Education,China(Grant No.AI2022004)。

摘  要:In crowdsourcing scenarios,we can hire crowd workers to label crowdsourced tasks and then use label integration algorithms to infer the integrated label for each instance in the tasks.As more and more label integration algorithms are proposed,the performance of inference based only on the information of the inferred instance gradually converges.Recent algorithms attempt to exploit the information of the inferred instance's nearest neighbors to infer and achieve good performance.However,when crowdsourced tasks are class-imbalanced,negative instances are more easily to occur in the nearest neighbors because negative instances are the majority,and thus recent algorithms are more easily biased toward the negative class.To this end,in this paper,we propose a novel label integration algorithm called farthest-nearest neighbor-based weighted voting(FNNWV)for class-imbalanced crowdsourcing.Specifically,FNNWV considers the nearest neighbors to be more similar to the inferred instance and thus uses them to vote ayes in weighted voting.Yet at the same time,FNNWV considers the farthest neighbors to be more different from the inferred instance and thus uses them to vote nays in weighted voting.Since negative instances are easier to occur in both the nearest neighbors and the farthest neighbors,FNNWV weakens the effect of negative instances by voting ayes and nays.The experimental results on 22 simulated and one real-world crowdsourced datasets show that FNNWV significantly outperforms all the other state-of-the-art competitors.

关 键 词:crowdsourcing label integration nearest neighbor farthest neighbor weighted voting 

分 类 号:TP18[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象