检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:马楠 李洪奇[1] 刘华林[2,3] 杨磊[2,3] MA Nan;LI Hongqi;LIU Hualin;YANG Lei(School of Information Science and Engineering,China University of Petroleum,Beijing 102249,China;Petrochina Planning and Engineering Institute,Beijing 100083,China;Key Laboratory of Oil Gas Business Chain Optimization,CNPC,Beijing 100083,China)
机构地区:[1]中国石油大学(北京)信息科学与工程学院,北京102249 [2]中国石油天然气股份有限公司规划总院,北京100083 [3]中国石油天然气股份有限公司油气业务链优化重点实验室,北京100083
出 处:《化工进展》2024年第3期1167-1177,共11页Chemical Industry and Engineering Progress
基 金:直属院所基础研究和战略储备技术研究基金(KJ2021-316)。
摘 要:目前对于炼厂原油储运调度决策的研究大多采用基于数学规划的静态调度方案,求解时间较长并且无法针对环境的变化进行实时高效的储运调度优化。为此,本文结合深度强化学习算法建立了考虑炼厂生产约束的原油资源储运动态实时调度决策算法。该算法首先将炼厂原油资源调度问题转换为马尔可夫决策过程,其次提出了一种基于软演员-评论家(soft actor-critic,SAC)的深度强化学习算法来同时确定调度过程中的传输目标等离散决策以及传输速度等连续决策。结果表明,算法学习到的策略可行性较好,与基线算法相比,油轮在港时间、调度方案事件数量、加工计划执行率等重要指标方面均得到了较好的效果,在求解时间方面大幅提升至毫秒级,并有效控制随机事件对整体决策的影响范围。该算法可为沿海炼厂原油储运调度快速决策提供新的思路。Currently,most refinery crude oil scheduling studies adopt static scheduling schemes based on mathematical programming,which cannot adjust and optimize according to environmental change in realtime.This paper established a dynamic real-time scheduling decision model subject to refinery production constraints and designed the corresponding agent interaction environment.The soft actorcritic(SAC)algorithm in deep reinforcement learning solved the model.Firstly,the crude oil resource scheduling problem was transformed into a Markov decision process,and a deep reinforcement learning algorithm based on SAC was proposed to simultaneously determine discrete decisions such as transmission target and continuous decisions such as transmission speed in the scheduling process.Extensive experimental results showed that the strategy learned by the algorithm has better usability,which effectively improved the decision-making efficiency of the algorithm and effectively controlled the influence range of random events on the overall decision-making compared with the baseline algorithm.This algorithm can provide new ideas for rapid decision-making of crude oil storage and transportation scheduling in coastal refineries.
关 键 词:炼厂原油储运 资源调度 深度强化学习 软演员-评论家
分 类 号:TE624[石油与天然气工程—油气加工工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.185