基于晶体图卷积神经网络的晶格能回归模型  

Lattice energy regression model based on crystal graph convolutional neural networks

作  者:郑欣雨 任泽华 周利 柴士阳 吉旭[1] ZHENG Xinyu;REN Zehua;ZHOU Li;CHAI Shiyang;JI Xu(School of Chemical Engineering,Sichuan University,Chengdu 610000,Sichuan,China)

机构地区:[1]四川大学化学工程学院,四川成都610000

出  处:《化工学报》2025年第3期1084-1092,F0004,共10页CIESC Journal

基  金:国家自然科学基金项目(22308228)。

摘  要:晶格能是决定晶体热力学稳定性的关键物理性质,对药物多晶型稳定性的筛选具有指导意义。晶格能的获取方式通常为实验试错和基于分子/量子力学的理论计算,对于数量庞大的晶型结构,两种方法均费时费力。提出一种基于密度泛函理论(density functional theory,DFT)和晶体图卷积神经网络(crystal graph convolutional neural networks,CGCNN)的晶格能回归模型。首先采用自洽屏蔽多体色散校正的DFT方法计算晶格能,建立包含酸、醇、酰胺、氨基酸、酸酐等248种晶型的晶格能数据集;基于所建立的数据集,采用CGCNN进一步建立晶型和晶格能之间的定量回归模型,该模型训练集和测试集的MAPE分别为1.24%和5.04%,R2分别为0.9978和0.9750,表明该模型具有较好的预测效果,可以为高通量筛选稳定的晶型提供理论指导。The lattice energy is a critical physical property determining the thermodynamic stability of crystals and holds instructive significance in screening the stability of polymorphism.Lattice energy is usually obtained by experimental trial and error as well as theoretical calculation based on molecular/quantum mechanics.For a large number of crystal structures,both methods are time-consuming and laborious.In this paper,a lattice energy regression model based on density functional theory(DFT)and crystal graph convolutional neural networks(CGCNN)is proposed.First,the lattice energies were calculated using the range-separated self-consistent screened many-body dispersion corrected DFT method.A dataset comprising lattice energies for 248 crystal structures including acids,alcohols,amides,amino acids,and anhydrides was established.Subsequently,leveraging this dataset,a crystal graph convolutional neural networks model was employed to establish a quantitative regression model for the relationship between crystal structures and lattice energies,which demonstrated promising predictive performance with mean absolute percentage error(MAPE)values of 1.24%for the training set and 5.04%for the test set,and R2 values of 0.9978 and 0.9750,respectively.The results show that the model has a good predictive performance which can provide theoretical guidance and technical support for high-throughput screening of stable crystal forms.

关 键 词:晶格能 多晶型 密度泛函理论 神经网络 回归模型 

分 类 号:TP183[自动化与计算机技术—控制理论与控制工程] TP391[自动化与计算机技术—控制科学与工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象