webcal

Description

Seminar on control theory

Wednesday, 24 January, 2.30-3.30

====================

Venue: Room 105, Dept. of Mathematics

Host: Sudhir Ghorpade

Speaker: Chandrajit Bajaj

Affiliation: University of Texas at Austin, USA

Title: Optimized Decision-Making via Active Learning of Stochastic Hamiltonians

Abstract: A Hamiltonian represents the energy of a dynamical system in phase space with coordinates of position and momentum. Hamilton’s equations of motion are obtainable as coupled symplectic differential equations. In this talk, I shall show how optimized decision-making (action sequences) can be obtained via a reinforcement learning problem wherein the agent interacts with the unknown environment to simultaneously learn a Hamiltonian surrogate and the optimal action sequences using Hamilton dynamics, by invoking the Pontryagin Maximum Principle. We use optimal control theory to define an optimal control gradient flow, which guides the reinforcement learning process of the agent to progressively optimize the Hamiltonian while simultaneously converging to the optimal action sequence. Extensions to stochastic Hamiltonians leading to stochastic action sequences and the free-energy principle shall also be discussed.

This is joint work with Taemin Heo, Minh Nguyen
Brief Bio: Professor Chandrajit Bajaj is a Computational Applied Mathematics Chair in Visualization at the Department of Computer Science and the Oden Institute of Computational Engineering and Sciences at the University of Texas at Austin, USA. He is a Fellow of AAAS, ACM. IEEE and SIAM. More information about him is available at: http://www.cs.utexas.edu/~bajaj

Chandrajit Bajaj, University of Texas at Austin, USA