检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Chao Huang Hongcai Zhang Long Wang Xiong Luo Yonghua Song
机构地区:[1]IEEE [2]State Key Laboratory of Internet of Things for Smart City,University of Macao,Macao S.A.R.,China [3]School of Computer and Communication Engineering,University of Science and Technology Beijing,Beijing 100083,China [4]Shunde Graduate School,University of Science and Technology Beijing,Foshan 528399,China [5]State Key Laboratory of Internet of Things for Smart City and Department of Electrical and Computer Engineering,University of Macao,Macao S.A.R.,China
出 处:《Journal of Modern Power Systems and Clean Energy》2022年第3期743-754,共12页现代电力系统与清洁能源学报(英文)
基 金:supported by the National Natural Science Foundation of China(No.62002016);the Science and Technology Development Fund,Macao S.A.R.(No.0137/2019/A3);the Beijing Natural Science Foundation(No.9204028);the Guangdong Basic and Applied Basic Research Foundation(No.2019A1515111165)。
摘 要:This paper develops deep reinforcement learning(DRL)algorithms for optimizing the operation of home energy system which consists of photovoltaic(PV)panels,battery energy storage system,and household appliances.Model-free DRL algorithms can efficiently handle the difficulty of energy system modeling and uncertainty of PV generation.However,discretecontinuous hybrid action space of the considered home energy system challenges existing DRL algorithms for either discrete actions or continuous actions.Thus,a mixed deep reinforcement learning(MDRL)algorithm is proposed,which integrates deep Q-learning(DQL)algorithm and deep deterministic policy gradient(DDPG)algorithm.The DQL algorithm deals with discrete actions,while the DDPG algorithm handles continuous actions.The MDRL algorithm learns optimal strategy by trialand-error interactions with the environment.However,unsafe actions,which violate system constraints,can give rise to great cost.To handle such problem,a safe-MDRL algorithm is further proposed.Simulation studies demonstrate that the proposed MDRL algorithm can efficiently handle the challenge from discrete-continuous hybrid action space for home energy management.The proposed MDRL algorithm reduces the operation cost while maintaining the human thermal comfort by comparing with benchmark algorithms on the test dataset.Moreover,the safe-MDRL algorithm greatly reduces the loss of thermal comfort in the learning stage by the proposed MDRL algorithm.
关 键 词:Demand response deep reinforcement learning discrete-continuous action space home energy management safe reinforcement learning
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:52.14.187.136