检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:何杉杉 周雅兰 郭宇阳 He Shanshan;Zhou Yalan;Guo Yuyang(School of Information Science,Guangdong University of Finance and Economics,Guangzhou,510320,China)
出 处:《南京大学学报(自然科学版)》2024年第6期940-953,共14页Journal of Nanjing University(Natural Science)
基 金:广东省自然科学基金(2021A1515012298);教育部人文社科项目(24YJAZH042)
摘 要:随着人工智能应用的发展,在动荡多变的金融市场中帮助投资者获得可观收益的最优自动股票交易策略成为目前的研究热点.因此,提出了一种股票交易决策算法LSTM-DDPG (Long Short-Term Memory Network-Deep Deterministic Policy Gradient),将擅于捕捉时间序列特征的LSTM网络融入擅于处理高维空间数据的DDPG算法,并加入Dropout操作来减少过拟合.为了更好地把握市场的动态变化,引入了股票市场中六种经典技术指标来拓展LSTMDDPG的状态空间维度.同时,在LSTM-DDPG上使用累计收益和夏普比率两种奖励函数,为投资者提供多种投资方案.为了验证提出的算法的有效性,将该算法应用在单只股票和股票投资组合两种交易任务中,两种投资任务的数据集均包含了美国市场和中国市场的数据.实验结果表明,在两种投资任务的国内外市场中,所提出的算法在累计回报、夏普比率、卡玛比率等多个评价指标上均有良好表现.With the development of Artificial Intelligence applications,the optimal automatic stock trading strategy to help investors achieve considerable returns in the volatile financial market has become a research hotspot at present.This paper proposes a stock trading decision‐making algorithm LSTM‐DDPG(Long Short‐Term Memory Network‐Deep Deterministic Policy Gradient).This algorithm combines the LSTM network that is better at capturing time series characteristics with the DDPG algorithm that is good at processing high‐dimensional spatial data,and adds Dropout operation to reduce overfitting.In order to better grasp the dynamic changes of the market,six classic technical indicators in the stock market are introduced to expand the state space dimension of LSTM‐DDPG.At the same time,two reward functions,cumulative return and Sharpe ratio,are used on LSTM‐DDPG to provide investors with a variety of investment options.To verify its effectiveness,the proposed algorithm is applied to two kinds of trading tasks:single stock and stock portfolio.The datasets for the investment tasks include the data from both the US market and the Chinese market.The experimental results on multiple evaluation metrics such as cumulative return,Sharpe ratio,and Calmar ratio show that the proposed algorithm performs well in both domestic and foreign markets for the two kinds of investment tasks.
关 键 词:深度强化学习 交易决策 DDPG LSTM 夏普比率 单只股票交易 股票投资组合
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.12.151.104