基于随机森林的有机小分子的化学键解离能预测  

Prediction of Chemical Bond Dissociation Energies of Small Organic Molecules Based on Random Forest

作  者:栾玥 孔丁羚 郭莉莉 张庆友 周艳梅[1] LUAN Yue;KONG Dingling;GUO Lili;ZHANG Qingyou*;ZHOU Yanmei*(Henan Engineering Research Center of Industrial Circulating Water Treatment,College of Chemistry and Molecular Sciences,Henan University,Kaifeng 475004,China)

机构地区:[1]河南大学化学与分子科学学院,河南省工业水循环利用工程技术研究中心,开封475004

出  处:《高等学校化学学报》2025年第3期44-52,共9页Chemical Journal of Chinese Universities

基  金:国家自然科学基金(批准号:22278112)资助。

摘  要:从i BonD有机物键能数据库中手动收集1208个含C,H,O,N和S原子的有机分子,并记录相应的化学键解离能实验值.提出了化学键类型描述符、杂原子描述符和支化度描述符,并与此前提出的原子类型描述符结合,从而更全面地描述目标化学键的周边环境.采用随机森林建立键解离能的预测模型,结果表明目标化学键周围的原子类型和化学键类型的描述符组合建模得到的预测结果最佳,在没有量子化学辅助的情况下得到了较好的预测结果.与已报道的预测结果进行比较发现,本文结果优于文献中的相应结果.此外,还设计了一个应用域算法来初步判断预测结果的质量,重新随机划分训练集和测试集来验证模型的稳定性,与零模型比较来判断模型的可行性.1208 organic molecules containing C,H,O,N,and S were manually collected from the iBonD organic bond energy database,and the corresponding experimental bond dissociation energy values were recorded.Chemical bond type descriptors,heteroatomic count descriptors,and branch descriptors were proposed and combined with previously suggested atomic type descriptors to provide a more comprehensive description of the surrounding environment of the target chemical bond.The prediction models for bond dissociation energy were constructed using random forest,and the results show that the combination of the descriptors of atomic types and chemical bond types around the target chemical bond achieves the best prediction results,and satisfactory results were obtained without quantum chemistry assistance.Compared with the results in published literature,the predicted results herein are better than the corresponding results in the literature.In addition,an algorithm on the application domain was designed to assess the quality of prediction results in advance,and the training set and the test set were randomly re-partitioned to verify the stability of the model,as well as the feasibility of the model was evaluated by comparing it with a zero model.

关 键 词:键解离能 随机森林 iBonD 原子类型 化学键类型 

分 类 号:O657[理学—分析化学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象