云南高校图书馆联盟文献共享服务平台- LAY

公共卫生与预防医学

营养与食品卫生学

人体解剖和组织胚胎学

航空、航天与航海医学

影像医学与核医学

血液循环系统疾病

神经病学与精神病学

皮肤病学与性病学

微生物与生化药学

农业机械化工程

农业电气化与自动化

作物栽培与耕作技术

农业昆虫与害虫防治

木材科学与技术

特种经济动物饲养

材料科学与工程

矿井通风与安全

石油与天然气工程

油气田开发工程

冶金机械及自动化

金属切削加工及机床

机械设计及理论

机械制造及自动化

仪器科学与技术

精密仪器及机械

测试计量技术及仪器

兵器科学与技术

兵器发射理论与技术

武器系统与运用工程

火炮、自动武器与弹药工程

军事化学与烟火技术

动力工程及工程热物理

动力机械及工程

流体机械及工程

核燃料循环与材料

辐射防护及环境保护

电工理论与新技术

电力系统及自动化

高电压与绝缘技术

电力电子与电力传动

微电子学与固体电子学

信息与通信工程

通信与信息系统

信号与信息处理

自动化与计算机技术

控制科学与工程

控制理论与控制工程

检测技术与自动化装置

计算机科学与技术

计算机系统结构

计算机软件与理论

计算机应用技术

合成树脂塑料工业

轻工技术与工程

纺织科学与工程

纺织材料与纺织品设计

纺织化学与染整工程

服装设计与工程

食品科学与工程

粮食、油脂及植物蛋白工程

农产品加工及贮藏工程

水产品加工及贮藏工程

皮革化学与工程

建筑设计及理论

城市规划与设计

供热、供燃气、通风及空调工程

桥梁与隧道工程

水文学及水资源

水力学及河流动力学

道路与铁道工程

交通信息工程及控制

交通运输规划与管理

载运工具运用工程

船舶与海洋工程

船舶及航道工程

港口、海岸及近海工程

航空宇航科学技术

航空宇航推进理论与工程

航空宇航制造工程

人机与环境工程

环境科学与工程

概率论与数理统计

运筹学与控制论

一般力学与力学基础

热学与物质分子运动论

原子与分子物理

粒子物理与原子核物理

测绘科学与技术

大地测量学与测量工程

摄影测量与遥感

地图制图学与地理信息工程

固体地球物理学

大气科学及气象学

大气物理学与大气环境

古生物学与地层学

职业技术教育学

国际共产主义运动

宪法学与行政法学

环境与资源保护法学

马克思主义哲学

发展与教育心理学

考古学及博物馆学

时间限定

时间：

更新时间：

期刊范围

全部期刊核心期刊 EI来源期刊 SCI来源期刊 CAS来源期刊 CSCD来源期刊 CSSCI来源期刊

学科限定全选

LAY: 作品数：233被引量：393H指数：10; 导出分析报告; 相关领域：文化科学更多>>; 相关作者：岳前进张向锋梁辉王辉何宁更多>>; 相关机构：中国海洋石油总公司大连理工大学中海石油(中国)有限公司海南分公司浙江大学更多>>; 相关期刊：更多>>; 相关基金：国家自然科学基金国家重点基础研究发展计划国家科技重大专项国家高技术研究发展计划更多>>

在结果中检索

检索结果分析

选择条件：

主题=F-P

共条记录，以下是1-6

全选清除导出

参考文献引证文献引用追踪

视图：

排序：

Distributed Deep Reinforcement Learning:A Survey and a Multi-player Multi-agent Learning Toolbox: 《Machine Intelligence Research》2024年第3期411-430,共20页Qiyue Yin Tongtong Yu Shengqi Shen Jun Yang Meijing Zhao Wancheng Ni Kaiqi Huang Bin Liang Liang Wang; supported by Open Fund/Postdoctoral Fund of the Laboratory of Cognition and Decision Intelligence for Complex Systems,Institute of Automation,Chinese Academy of Sciences,China(No.CASIA-KFKTXDA27040809).; With the breakthrough of AlphaGo,deep reinforcement learning has become a recognized technique for solving sequential decision-making problems.Despite its reputation,data inefficiency caused by its trial and error lea...; 关键词：Deep reinforcement learning distributed machine learning self-play population-play TOOLBOX

Optimal Strategy for Aircraft Pursuit-evasion Games via Self-play Iteration: 《Machine Intelligence Research》2024年第3期585-596,共12页Xin Wang Qing-Lai Wei Tao Li Jie Zhang; In this paper,the pursuit-evasion game with state and control constraints is solved to achieve the Nash equilibrium of both the pursuer and the evader with an iterative self-play technique.Under the condition where th...; 关键词：Differential games pursuit-evasion games nonlinear control optimal control Nash equilibrium solution

AI in Human-computer Gaming:Techniques,Challenges and Opportunities被引量：2: 《Machine Intelligence Research》2023年第3期299-317,共19页Qi-Yue Yin Jun Yang Kai-Qi Huang Mei-Jing Zhao Wan-Cheng Ni Bin Liang Yan Huang Shu Wu Liang Wang; National Natural Science Foundation of China(No.61906197).; With the breakthrough of AlphaGo,human-computer gaming AI has ushered in a big explosion,attracting more and more researchers all over the world.As a recognized standard for testing artificial intelligence,various hum...; 关键词：Human-computer gaming AI intelligent decision making deep reinforcement learning self-play

Autonomous air combat decision-making of UAV based on parallel self-play reinforcement learning被引量：5: 《CAAI Transactions on Intelligence Technology》2023年第1期64-81,共18页Bo Li Jingyi Huang Shuangxia Bai Zhigang Gan Shiyang Liang Neretin Evgeny Shouwen Yao; National Natural Science Foundation of China,Grant/Award Number:62003267;Fundamental Research Funds for the Central Universities,Grant/Award Number:G2022KY0602;Technology on Electromagnetic Space Operations and Applications Laboratory,Grant/Award Number:2022ZX0090;Key Core Technology Research Plan of Xi'an,Grant/Award Number:21RGZN0016。; Aiming at addressing the problem of manoeuvring decision-making in UAV air combat,this study establishes a one-to-one air combat model,defines missile attack areas,and uses the non-deterministic policy Soft-Actor-Crit...; 关键词：air combat decision deep reinforcement learning parallel self-play SAC algorithm UAV

A Monte Carlo Neural Fictitious Self-Play approach to approximate Nash Equilibrium in imperfect-information dynamic games被引量：5: 《Frontiers of Computer Science》2021年第5期137-150,共14页Li ZHANG Yuxuan CHEN Wei WANG Ziliang HAN Shijian Li Zhijie PAN Gang PAN; National Key Research and Development Program of China(2017YFB1002503);Science and Technology Innovation 2030-“New Generation Artificial Intelligence”Major Project(2018AAA0100902),China.; Solving the optimization problem to approach a Nash Equilibrium point plays an important role in imperfect information games,e.g.,StarCraft and poker.Neural Fictitious Self-Play(NFSP)is an effective algorithm that lea...; 关键词：approximate Nash Equilibrium imperfect-information games dynamic games Monte Carlo tree search Neural Fictitious Self-Play reinforcement learning

Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning: 《Journal of Intelligent Learning Systems and Applications》2010年第2期57-68,共12页Marco A. Wiering; A promising approach to learn to play board games is to use reinforcement learning algorithms that can learn a game position evaluation function. In this paper we examine and compare three different methods for genera...; 关键词：Board GAMES Reinforcement LEARNING TD(λ) Self-Play LEARNING From Demonstration

全选清除导出

共1页<1>

检索报告对象比较聚类工具使用帮助返回顶部

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

LAY

检索结果分析

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

LAY

检索结果分析

下载全文

用户登录

高级检索检索式检索