REINFORCEMENT_LEARNING

作品数:938被引量:1620H指数:17
导出分析报告
相关作者:孟巧荣廉自生辛英尹晓虎王长缨更多>>
相关机构:电子科技大学沈阳理工大学北京工业大学太原理工大学更多>>
相关期刊:更多>>
相关基金:国家自然科学基金北京市自然科学基金中国博士后科学基金广东省自然科学基金更多>>
-

检索结果分析

结果分析中...
条 记 录,以下是1-10
视图:
排序:
Exploring & exploiting high-order graph structure for sparse knowledge graph completion
《Frontiers of Computer Science》2025年第2期31-42,共12页Tao HE Ming LIU Yixin CAO Zekun WANG Zihao ZHENG Bing QIN 
supported by the National Key R&D Program of China(2022YFF0903301);the National Natural Science Foundation of China(Grant Nos.U22B2059,61976073,62276083);the Shenzhen Foundational Research Funding(JCYJ20200109113441941);the Major Key Project of PCL(PCL2021A06).
Sparse Knowledge Graph(KG)scenarios pose a challenge for previous Knowledge Graph Completion(KGC)methods,that is,the completion performance decreases rapidly with the increase of graph sparsity.This problem is also ex...
关键词:knowledge graph completion graph neural networks reinforcement learning 
MARCS:A Mobile Crowdsensing Framework Based on Data Shapley Value Enabled Multi-Agent Deep Reinforcement Learning
《Computers, Materials & Continua》2025年第3期4431-4449,共19页Yiqin Wang Yufeng Wang Jianhua Ma Qun Jin 
sponsored by Qinglan Project of Jiangsu Province,and Jiangsu Provincial Key Research and Development Program(No.BE2020084-1).
Opportunistic mobile crowdsensing(MCS)non-intrusively exploits human mobility trajectories,and the participants’smart devices as sensors have become promising paradigms for various urban data acquisition tasks.Howeve...
关键词:Mobile crowdsensing online data acquisition data Shapley value multi-agent deep reinforcement learning centralized training and decentralized execution(CTDE) 
Passive homing method with reinforcement learning for a single hydrophone
《Chinese Journal of Acoustics》2025年第1期18-31,共14页YIN Han CHEN Jianfeng ZHANG Dongzhe 
In order to reduce the stringent volume and cost requirements for underwater unmanned autonomous vehicles(AUVs)when equipped with homing systems,this paper proposes a simple and feasible passive homing method.The prop...
关键词:Single hydrophone Passive homing Reinforcement learning Doppler frequency shift 
RIS Enabled Simultaneous Transmission and Key Generation with PPO:Exploring Security Boundary of RIS Phase Shift
《ZTE Communications》2025年第1期11-17,共7页FAN Kaiqing YAO Yuze GAO Ning LI Xiao JIN Shi 
supported in part by the National Science Foundation of China(NSFC)under Grant No.62371131;in part by the National Key R&D Program of China under Grant No.2024YFE0200700;in part by the program of Zhishan Young Scholar of Southeast University under Grant No.2242024RCB0030。
Due to the broadcast nature of wireless channels and the development of quantum computers,the confidentiality of wireless communication is seriously threatened.In this paper,we propose an integrated communications and...
关键词:reconfigurable intelligent surfaces physical layer key generation integrated communications and security one-time pad deep reinforcement learning 
Optimization of Intelligent Education Systems Based on Reinforcement Learning
《Artificial Intelligence Education Studies》2025年第1期53-69,共17页Sophia LI 
This paper explores how reinforcement learning(RL)can improve intelligent education systems.RL helps make learning personal,flexible,and efficient by choosing actions based on student needs and rewards like better sco...
关键词:Reinforcement Learning Intelligent Education Personalized Learning Adaptive Assessment Teacher Support 
A dynamic control decision approach for fixed-wing aircraft games via hybrid action reinforcement learning
《Science China(Information Sciences)》2025年第3期193-215,共23页Xing ZHUANG Dongguang LI Hanyu LI Yue WANG Jihong ZHU 
supported by China National Defense Basic Research Programs(Grant No.JCKY2021204B104)。
Autonomous decision-making is crucial for aircraft to achieve quick victories in diverse scenarios.Based on a 6-degree-of-freedom aircraft model,this paper proposes a decoupled guidance and control theory for autonomo...
关键词:intelligent air combat unmanned aerial vehicle game dynamic control reinforcement learning 
Intelligent integrated sensing and communication:a survey
《Science China(Information Sciences)》2025年第3期1-42,共42页Jifa ZHANG Weidang LU Chengwen XING Nan ZHAO Naofal AL-DHAHIR George K.KARAGIANNIDIS Xiaoniu YANG 
supported by National Natural Science Foundation of China(Grant Nos.U23A20271,62325103);Application and Fundamental Research Planning Project in Liaoning Province(Grant No.2023TH2/101300197)。
Integrated sensing and communication(ISAC)is a promising technique to increase spectral efficiency and support various emerging applications by sharing the spectrum and hardware between these functionalities.However,t...
关键词:artificial intelligence deep learning deep reinforcement learning federated learning generative artificial intelli-gence integrated sensing and communication machine learning transfer learning 
Improving Machine Translation Formality with Large Language Models
《Computers, Materials & Continua》2025年第2期2061-2075,共15页Murun Yang Fuxue Li 
Preserving formal style in neural machine translation (NMT) is essential, yet often overlooked as an optimization objective of the training processes. This oversight can lead to translations that, though accurate, lac...
关键词:Neural machine translation FORMALITY large language model text style transfer style evaluation reinforcement learning 
Cooperative output regulation of heterogeneous directed multi-agent systems:a fully distributed model-free reinforcement learning framework
《Science China(Information Sciences)》2025年第2期166-181,共16页Xiongtao SHI Yanjie LI Chenglong DU Huiping LI Chaoyang CHEN Weihua GUI 
supported by National Natural Science Foundation of China(Grant Nos.62303492,61977019,62222306);Shenzhen Basic Research Program(Grant Nos.JCYJ20220818102415033,JSGG20201103093802006,KJZD2023092311-4222045);Natural Science Foundation of Hunan Province(Grant No.2023JJ40765);Natural Science Foundation of Changsha(Grant No.kq2208287);Science and Technology Innovation Program of Hunan Province(Grant No.2022WZ1001);China Postdoctoral Innovation Talents Support Program(Grant No.BX20230430)。
In this paper,the cooperative output regulation(COR)problem of a class of unknown heterogeneous multi-agent systems(MASs)with directed graphs is studied via a model-free reinforcement learning(RL)based fully distribut...
关键词:model-free reinforcement learning unknown heterogeneous multi-agent systems fully distributed event-triggered control directedgraph 
Graph reinforcement learning with relational priors for predictive power allocation
《Science China(Information Sciences)》2025年第2期226-243,共18页Jianyu ZHAO Chenyang YANG 
supported in part by National Key R&D Program of China(Grant No.2022YFB2902002);National Natural Science Foundation of China(Grant No.62271024)。
Deep reinforcement learning for resource allocation has been investigated extensively owing to its ability of handling model-free and end-to-end problems.However,its slow convergence and high time complexity during on...
关键词:reinforcement learning graph neural network relational priors resource allocation 
检索报告 对象比较 聚类工具 使用帮助 返回顶部