基于PPO算法的逻辑综合序列优化通用框架设计

PPO-based universal framework design for logic synthesis sequence optimization

作　　者：王梦可杨朝晖查晓婧夏银水[1] WANG Mengke;YANG Zhaohui;ZHA Xiaojing;XIA Yinshui(Faculty of Electrical Engineering and Computer Science,Ningbo University,Ningbo 315211,China)

机构地区：[1]宁波大学信息科学与工程学院,浙江宁波315211

出　　处：《宁波大学学报(理工版)》2025年第2期78-85,共8页Journal of Ningbo University:Natural Science and Engineering Edition

基　　金：国家自然科学基金(62131010,U22A2013);国家自然科学基金青年项目(62304115);浙江省自然科学基金创新群体课题(LDT23F04021F04);浙江省科研计划一般项目(Y202248965).

摘　　要：逻辑综合通常采用启发式方法将逻辑优化算法组成为序列进行电路性能优化,而启发式方法难以根据电路和优化目标的差异进行序列自动化调节,影响了电路优化质量.为了在集成电路设计中提升序列的自适应生成能力,将序列优化问题建模为马尔可夫决策过程,提出一种面向多种逻辑表示的强化学习框架,利用近端策略优化(Proximal Policy Optimization,PPO)指导智能体来探索序列优化空间,改善其生成序列的泛化能力.并将EPFL基准电路转变为与-非图(And-Inverter Graph,AIG)和异或多数图(Xor-MajorityGraph,XMG)形式,分别经由所提出的框架进行实验,AIG形式下本文方法与DRiLLS和BOiLS方法相比分别有18.66百分点和27.67百分点的性能提升;XMG形式下则可提升原始电路性能约37.34%.实验结果表明,由本文方法生成的算法序列对电路性能有较大改进.Logic synthesis typically employs heuristic methods to compose a sequence of logic optimization algorithms for circuit performance improvement.However,heuristic methods face challenges in automatically adjusting sequences based on circuit and optimization objectives,thus affecting the quality of circuit optimization.In order to enhance the capability of adaptive sequence generation in integrated circuit design,this paper models the sequence optimization problem as a Markov Decision Process and proposes a reinforcement learning framework for multiple logic representations.The framework utilizes Proximal Policy Optimization(PPO)to guide the agent in exploring the sequence optimization space and improve the generalization ability of its sequence generation.The paper transforms EPFL benchmark circuits into AIG and XMG forms,respectively,and conducts experiments on them by using the proposed framework.Compared to DRiLLS and BOiLS methods,the proposed method achieves performance improvements of 18.66 percentages and 27.67 percentages in the AIG form,and approximately 37.34% improvement in the original circuit performance in the XMG form.The experimental results demonstrate significant improvements in circuit performance achieved by the algorithm sequences generated with the proposed method.

关键词：逻辑综合序列优化强化学习近端策略优化

分类号：TP331.2[自动化与计算机技术—计算机系统结构] TN431.2[自动化与计算机技术—计算机科学与技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于PPO算法的逻辑综合序列优化通用框架设计

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于PPO算法的逻辑综合序列优化通用框架设计

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索