云南高校图书馆联盟文献共享服务平台- OFFLINE

OFFLINE: 作品数：251被引量：313H指数：8; 导出分析报告; 相关作者：潘丽萍侯松张庆刘延磊张刘斌更多>>; 相关机构：江苏食品药品职业技术学院奇瑞汽车股份有限公司电子科技大学浙江师范大学更多>>; 相关期刊：更多>>; 相关基金：国家自然科学基金国家重点基础研究发展计划国家高技术研究发展计划国家社会科学基金更多>>

Offline model-based reinforcement learning with causal structured world models: 《Frontiers of Computer Science》2025年第4期77-90,共14页Zhengmao ZHU Honglong TIAN Xionghui CHEN Kun ZHANG Yang YU; Model-based methods have recently been shown promising for offline reinforcement learning(RL),which aims at learning good policies from historical data without interacting with the environment.Previous model-based off...; 关键词：reinforcement learning offline reinforcement learning model-based reinforcement learning causal discovery

A ROOT-based detector geometry and event visualization system for JUNO-TAO: 《Nuclear Science and Techniques》2025年第3期50-59,共10页Ming-Hua Liao Kai-Xuan Huang Yu-Mei Zhang Jia-Yang Xu Guo-Fu Cao Zheng-Yun You; supported by the National Natural Science Foundation of China(Nos.12175321,11975021,and 11675275);the Strategic Priority Research Program of the Chinese Academy of Sciences(No.XDA10010900)。; The Taishan Antineutrino Observatory(TAO)is a satellite experiment of the Jiangmen Underground Neutrino Observatory,located near the Taishan nuclear power plant(NPP).The TAO aims to measure the energy spectrum of reac...; 关键词：Visualization GEOMETRY Offline software JUNO TAO

Effectiveness of the“online+offline+practice”trinity teaching model on the learning of junior nursing students:an exploratory study: 《Nursing Communications》2024年第12期1-6,共6页Cai-Xia Liu Ling Wang; supported by Social Science Research Project of Yichang(ysk24ybkt011).; Background:Surgical Nursing is a main course of nursing specialty and a large course lasting 96 credit hours.In response to the teaching pain points such as the complicated and boring content of the surgical nursing c...; 关键词：BOPPPS Rain Classroom Surgical Nursing teaching

Robust Offline Actor-Critic With On-policy Regularized Policy Evaluation: 《IEEE/CAA Journal of Automatica Sinica》2024年第12期2497-2511,共15页Shuo Cao Xuesong Wang Yuhu Cheng; supported in part by the National Natural Science Foundation of China(62176259,62373364);the Key Research and Development Program of Jiangsu Province(BE2022095)。; To alleviate the extrapolation error and instability inherent in Q-function directly learned by off-policy Q-learning(QL-style)on static datasets,this article utilizes the on-policy state-action-reward-state-action(SA...; 关键词：Offline reinforcement learning off-policy QL-style on-policy SARSA-style policy evaluation(PE) Q-value estimation

Federated Offline Reinforcement Learning with Proximal Policy Evaluation: 《Chinese Journal of Electronics》2024年第6期1360-1372,共13页Sheng YUE Yongheng DENG Guanbo WANG Ju REN Yaoxue ZHANG; supported by the National Natural Science Foundation of China(Grant Nos.62341201,62122095,62072472,62172445,62302260,and 62202256);the National Key R&D Program of China(Grant No.2022YFF0604502);the China Postdoctoral Science Foundation(Grant No.2023M731956);a grant from the Guoqiang Institute;Tsinghua University。; Offline reinforcement learning(RL)has gathered increasing attention in recent years,which seeks to learn policies from static datasets without active online exploration.However,the existing offline RL approaches often...; 关键词：Offline reinforcement learning Batch reinforcement learning Federated learning Reinforcement learning

Implicit policy constraint for offline reinforcement learning: 《CAAI Transactions on Intelligence Technology》2024年第4期973-981,共9页Zhiyong Peng Yadong Liu Changlin Han Zongtan Zhou; National Natural Science Foundation of China,Grant/Award Number:U19A2083。; Offline reinforcement learning(RL)aims to learn policies entirely from passively collected datasets,making it a data‐driven decision method.One of the main challenges in offline RL is the distribution shift problem,w...; 关键词：artificial intelligence artificial neural network learning(artificial intelligence) planning(artificial intelligence)

Model-based offline reinforcement learning framework for optimizing tunnel boring machine operation: 《Underground Space》2024年第6期47-71,共25页Yupeng Cao Wei Luo Yadong Xue Weiren Lin Feng Zhang; Research on automation and intelligent operation of tunnel boring machine(TBM)is receiving more and more attention,benefiting from the increasing construction data.However,most studies on TBM operations optimization w...; 关键词：TBM Reinforcement learning TRANSFORMER Data mining

Construction and Application of a Blended Online and Offline University English Course: 《Journal of Contemporary Educational Research》2024年第11期257-263,共7页Yuanli Chen; Guangdong Ocean University Undergraduate Teaching Quality and Teaching Reform Project“Integrated English 3 Blended Online and Offline Course”(PX-112024042);Guangdong Ocean University Research Initiation Project(060302162402)。; This paper explores the design,implementation,and evaluation of the Integrated English 3 blended course,which integrates online learning through massive open online courses(MOOCs)and face-to-face classroom instruction...; 关键词：Blended learning Massive open online courses Production-oriented approach Intercultural communication Course design

Data-driven offline reinforcement learning approach for quadrotor's motion and path planning: 《Chinese Journal of Aeronautics》2024年第11期386-397,共12页Haoran ZHAO Hang FU Fan YANG Che QU Yaoming ZHOU; supported by the National Natural Science Foundation of China(No.52272382);the Aeronautical Science Foundation of China(No.20200017051001);the Fundamental Research Funds for the Central Universities,China。; Non-learning based motion and path planning of an Unmanned Aerial Vehicle(UAV)is faced with low computation efficiency,mapping memory occupation and local optimization problems.This article investigates the challenge ...; 关键词：Motion planning Unmanned aerial vehicle Reinforcement learning Data-driven learning Markov decision process

Exploration of the Teaching Reform of Computer Composition Principles with Ideological and Political Education at Anqing Normal University: 《Journal of Contemporary Educational Research》2024年第10期294-299,共6页Liangliang Zhang Liefu Ai Xin Zheng Xianyang Wang Qingfeng Tang; National Natural Science Foundation of China(62302014);Key Project of Science Research in Universities of Anhui Province of China(2023AH050492,2023AH050497);Anhui Province Graduate Education Teaching Key Project(2023jyjxggyjY193);Anqing Normal University Undergraduate Education Teaching Key Project(2023aqnujyxm15,2023aqnujyxm12);Anqing Normal University Undergraduate Education Teaching General Project(2023aqnujyxm34)。; The teaching mode of the Computer Composition Principles includes theoretical and practical teaching.At present,there is a problem of inconsistency in the teaching content of the two methods in our school.To this end,...; 关键词：Theoretical teaching Practical teaching Ideological and political education Online and offline

OFFLINE