云南高校图书馆联盟文献共享服务平台- POMDP

POMDP: 作品数：91被引量：177H指数：6; 导出分析报告; 相关领域：自动化与计算机技术电子电信更多>>; 相关作者：殷保群唐伦陈前斌陈小平冯妍更多>>; 相关机构：中国科学技术大学南京大学重庆邮电大学南京邮电大学更多>>; 相关期刊：更多>>; 相关基金：国家自然科学基金国家高技术研究发展计划国家教育部博士点基金中国航空科学基金更多>>

Near Optimal Approximations and Finite Memory Policies for POMPDs with Continuous Spaces Dedicated to Professor Peter E. Caines, on the occasion of his 80th birthday: 《Journal of Systems Science & Complexity》2025年第1期238-270,共33页KARA Ali Devran BAYRAKTAR Erhan YUKSEL Serdar; partially supported by the National Science Foundation under Grant No.DMS-2106556;by the Susan M.Smith chair;partially supported by the Natural Sciences and Engineering Research Council(NSERC)of Canada。; The authors study an approximation method for partially observed Markov decision processes(POMDPs)with continuous spaces.Belief MDP reduction,which has been the standard approach to study POMDPs requires rigorous appr...; 关键词：Filter stability POMDP reinforcement learning stochastic control

Adaptive cache policy optimization through deep reinforcement learning in dynamic cellular networks: 《Intelligent and Converged Networks》2024年第2期81-99,共19页Ashvin Srinivasan Mohsen Amidzadeh Junshan Zhang Olav Tirkkonen; We explore the use of caching both at the network edge and within User Equipment(UE)to alleviate traffic load of wireless networks.We develop a joint cache placement and delivery policy that maximizes the Quality of S...; 关键词：wireless caching deep reinforcement learning advantageous actor critic long short term memory non-stationary Partial Observable Markov Decision Process(POMDP)

Analysis of a POMDP Model for an Optimal Maintenance Problem with Multiple Imperfect Repairs: 《American Journal of Operations Research》2023年第6期133-146,共14页Nobuyuki Tamura; I consider a system whose deterioration follows a discrete-time and discrete-state Markov chain with an absorbing state. When the system is put into practice, I may select operation (wait), imperfect repair, or replac...; 关键词：Partially Observable Markov Decision Process Imperfect Repair Stochastic Order Monotone Property Optimal Maintenance Policy

区分多业务的跨层优化无线网络协议头压缩算法被引量：1: 《重庆邮电大学学报（自然科学版）》2023年第2期316-327,共12页张明鑫李云夏世超; 国家自然科学基金(62071077)。; 在网络中,鲁棒性协议头压缩(robust header compression,ROHC)算法需要压缩端和解压端的状态同步,才能成功解压ROHC数据包,但ROHC算法的双向可靠R模式和双向优化O模式需要单独的反馈信道,增加了网络成本。针对ROHC算法的单向U模式,当无...; 关键词：鲁棒性协议头压缩(ROHC) 跨层优化部分可观测马尔科夫过程(POMDP) 多业务

Data Analytics of an Information System Based on a Markov Decision Process and a Partially Observable Markov Decision Process: 《Journal of Computer Science Research》2023年第1期21-30,共10页Lidong Wang Reed L.Mosher Terril C.Falls Patti Duett; Data analytics of an information system is conducted based on a Markov decision process(MDP)and a partially observable Markov decision process(POMDP)in this paper.Data analytics over a finite planning horizon and an i...; 关键词：Predictive modelling Information system MDP POMDP CYBERSECURITY Q-LEARNING

面向不确定性环境的自动驾驶运动规划:机遇与挑战被引量：3: 《模式识别与人工智能》2023年第1期1-21,共21页张晓彤王嘉诚何景涛陈仕韬郑南宁; 运动规划算法作为自动驾驶系统中的重要研究内容,愈发受到研究者们关注.然而目前多数算法仅考虑在确定性结构化环境中的应用,忽视动态交通环境中潜在的不确定性因素.文中面向不确定性环境,将运动规划算法总结为两类:部分可观测马尔可夫...; 关键词：自动驾驶运动规划部分可观测马尔可夫决策过程(POMDP) 概率占用栅格图(POGM)

基于POMDP的电梯群控调度策略被引量：1: 《闽江学院学报》2022年第5期104-111,共8页彭诚姚进发董正山; 安徽省重点自然科学项目(KJ2021A1403)。; 针对电梯群组系统的随机性和复杂性,以离散事件动态系统和分布式部分可观马尔可夫决策过程为理论基础,将电梯群组的调度问题建模为基于事件驱动的部分可观马尔可夫决策模型,并利用多智能体强化学习算法求解最优调度策略。仿真实验结果表...; 关键词：电梯群控系统离散事件动态系统马尔可夫决策过程强化学习

基于POMDP的多机无源传感器协同任务规划被引量：2: 《无线电工程》2022年第7期1260-1265,共6页马玲左燕彭冬亮任金磊; 国家自然科学基金(61673146,61771028,61973102);电子信息控制重点实验室基金(6142105200110)。; 针对多机无源传感器协同跟踪任务规划问题,提出了一种基于部分可观察马尔可夫决策过程(Partially Observable Markov Decision Process,POMDP)的多无人机无源传感器调度算法。在POMDP框架下建立了多无人机协同跟踪规划模型。考虑量测噪...; 关键词：机载无源传感器部分可观察马尔可夫决策广义克拉美-罗下界分布式决策任务规划

Local Observations-Based Energy-Efficient Multi-Cell Beamforming via Multi-Agent Reinforcement Learning: 《Journal of Communications and Information Networks》2022年第2期170-180,共11页Kaiwen Yu Gang Wu Shaoqian Li Geoffrey Ye Li; Fundamental Research Funds for the Central Universities(ZYGX2020ZB042)。; With affordable overhead on information exchange,energy-efficient beamforming has potential to achieve both low power consumption and high spectral efficiency.This paper formulates the problem of joint beamforming and...; 关键词：distributed beamforming energy efficiency deep reinforcement learning interference-cooperation POMDP

一种无人车无信号保护路口左转规划方法: 《合肥工业大学学报（自然科学版）》2022年第5期665-672,共8页夏志远黄妙华李其仲; 国家重点研发计划资助项目(2018YFC0808405);政府间国际科技创新合作重点专项资助项目(SQ2018YFGH000405);中央高校基本科研业务费资助项目(205207002)。; 为解决无人驾驶车辆在无信号保护路口左转规划中高效性与安全性相矛盾的问题,文章参考路径-速度解耦规划思路,提出一种左转规划区对角线分割(diagonal division of the planning area of left turns,DDPALT)的路径生成方法,结合基于部...; 关键词：无人驾驶车辆无信号路口左转规划交通安全部分可观察马尔可夫决策过程(POMDP)

POMDP