基于强化学习的互联电网CPS自校正控制  被引量:18

Reinforcement learning based CPS self-tuning control methodology for interconnected power systems

在线阅读下载全文

作  者:余涛[1] 周斌[1] 

机构地区:[1]华南理工大学电力学院,广东广州510640

出  处:《电力系统保护与控制》2009年第10期33-38,共6页Power System Protection and Control

基  金:国家自然科学基金项目(50807016);广东省自然科学基金博士启动基金项目(06300091)~~

摘  要:AGC是一个动态多级决策问题——马尔可夫决策过程(MDP),应用强化学习算法可有效地实现控制策略的在线学习和动态优化决策。引入Q学习算法作为强化学习核心算法,将CPS值看作包含AGC的电力系统"环境"所给的"奖励",依靠奖励值Q函数与CPS控制动作形成的闭环控制结构实现在线学习。学习目标是使CPS控制动作从环境获得的长期积累奖励值最大,从而快速自动地在线优化CPS控制系统的输出。仿真研究显示,引入强化学习自校正控制后显著增强了整个AGC系统的鲁棒性和适应性,有效提高了CPS考核合格率。The automatic generation control (AGC) problem is a stochastic multistage decision problem, which can be modeled as a Markovian Decision Process (MDP). The paper introduces the Q-learning method as the core algorithm of reinforcement learning (RL), and regards the CPS values as the rewards from the interconnected power systems. By regulating a closed-loop CPS control rule to maximize the total reward in the procedure of on-line learning, the optimal CPS control strategy can be gradually obtained. The case study shows that after adding the RL control, the robustness and adaptability of AGC system is enhanced obviously and the CPS compliance is ensured. This work is supported by National Natural Science Foundation of China(No.50807016) and Natural Science Funds of Guangdong Province (No. 06300091).

关 键 词:强化学习 Q学习算法 自动发电控制 CPS标准 自校正控制 

分 类 号:TM76[电气工程—电力系统及自动化]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象