基于强化学习的订单生产型企业的订单接受策略被引量：10

Reinforcement learning based order acceptance policy in make-to-order enterprises

出　　处：《系统工程理论与实践》2014年第12期3121-3129,共9页Systems Engineering-Theory & Practice

基　　金：国家自然科学基金(71201020);中央高校基本科研业务经费(N120406002);中国博士后科学基金(2013M540233)

摘　　要：针对订单生产型企业在订单接受决策过程中的不确定性,基于强化学习的思想,在考虑生产成本、延迟惩罚成本以及拒绝成本的前提下,引入顾客等级这一要素,从收益管理的角度建立了基于半马尔可夫决策过程的订单接受模型.在此基础上,提出了基于SMART算法的最优订单接受策略求解方法,旨在最大化订单生产型企业的长期利润.仿真实验结果表明:基于SMART算法得到的订单接受策略要优于基于先来先服务方法得到的订单接受策略;同时,针对考虑顾客等级的仿真实验及数据分析结果,也验证了引入顾客等级这一要素的必要性和重要性.From the perspective of revenue management, a semi-Markov decision process based order acceptance model （SMDP-OA model） is proposed on the basis of reinforcement learning. This model is to solve the uncertainties during order accepting decision processes for make-to-order （MTO） compa- nies, not only taking into account the production cost, delay cost and reject cost of the incoming order, but also the factor of customer level. Besides, SMART-based optimal order acceptance algorithm is pre- sented, aiming at maximizing the profit of MTO companies. The simulation experiments indicate that the proposed SMART-based algorithm performs better than the algorithm based on the first-come-first-serve （FCFS） order acceptance strategy. Moreover, the experiments also justify the necessity and importance of incorporating the customer level factor during the determination of the optimal order acceptance policy.

关键词：收益管理订单接受 SMART算法平均利润强化学习

分类号：C934[经济管理—管理学]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于强化学习的订单生产型企业的订单接受策略被引量：10

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于强化学习的订单生产型企业的订单接受策略 被引量：10

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于强化学习的订单生产型企业的订单接受策略被引量：10