Improving Multi-task GNNs for Molecular Property Prediction via Missing Label Imputation  

在线阅读下载全文

作  者:Fenyu Hu Dingshuo Chen Qiang Liu Shu Wu 

机构地区:[1]University of Chinese Academy of Sciences,Beijing,100190,China [2]Center for Research on Intelligent Perception and Computing,State Key Laboratory of Multimodal Artificial Intelligence Systems,Institute of Automation,Chinese Academy of Sciences,Beijing,100190,China

出  处:《Machine Intelligence Research》2025年第1期131-144,共14页机器智能研究(英文版)

基  金:supported by the National Natural Science Foundation of China(Nos.62141608 and U19B 2038),the CAAI Huawei MindSpore Open Fund.

摘  要:The prediction of molecular properties is a fundamental task in the field of drug discovery.Recently,graph neural networks(GNNs)have been gaining prominence in this area.Since a molecule tends to have multiple correlated properties,there is a great need to develop the multi-task learning ability of GNNs.However,limited by expensive and time-consuming human annotations,collecting complete labels for each task is difficult.As a result,most existing benchmarks involve many missing labels in training data,and the performance of GNNs is impaired due to the lack of sufficient supervision information.To overcome this obstacle,we propose to improve multi-task molecular property prediction by missing label imputation.Specifically,a bipartite graph is first introduced to model the molecule-task co-occurrence relationships.Then,the imputation of missing labels is transformed into predicting missing edges on this bipartite graph.To predict the missing edges,a graph neural network is devised,which can learn the complex molecule-task co-occurrence relationships.After that,we select reliable pseudo labels according to the uncertainty of the prediction results.Boosting with enough and reliable supervision information,our approach achieves state-of-the-art performance on a variety of real-world datasets.

关 键 词:Graph classification imbalance learning prediction bias mixture of experts multiview representations 

分 类 号:TP183[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象