检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Mathias Oster Leon Sallandt Reinhold Schneider
机构地区:[1]Technische Universität Berlin,Strasse des 17.Juni 135,10623 Berlin,Germany
出 处:《Journal of Computational Mathematics》2024年第3期638-661,共24页计算数学(英文)
基 金:support from the Research Training Group“Differential Equation-and Data-driven Models in Life Sciences and Fluid Dynamics:An Interdisciplinary Research Training Group(DAEDALUS)”(GRK 2433)funded by the German Research Foundation(DFG).
摘 要:We treat infinite horizon optimal control problems by solving the associated stationary Bellman equation numerically to compute the value function and an optimal feedback law.The dynamical systems under consideration are spatial discretizations of non linear parabolic partial differential equations(PDE),which means that the Bellman equation suffers from the curse of dimensionality.Its non linearity is handled by the Policy Iteration algorithm,where the problem is reduced to a sequence of linear equations,which remain the computational bottleneck due to their high dimensions.We reformulate the linearized Bellman equations via the Koopman operator into an operator equation,that is solved using a minimal residual method.Using the Koopman operator we identify a preconditioner for operator equation,which deems essential in our numerical tests.To overcome computational infeasability we use low rank hierarchical tensor product approximation/tree-based tensor formats,in particular tensor trains(TT tensors)and multi-polynomials,together with high-dimensional quadrature,e.g.Monte-Carlo.By controlling a destabilized version of viscous Burgers and a diffusion equation with unstable reaction term numerical evidence is given.
关 键 词:Feedback control Dynamic programming Hamilton-Jacobi-Bellman Tensor product approximation Variational Monte-Carlo
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222