Science Cast

Generative Intrinsic Optimization: Intrisic Control with Model Learning

Jianfei MaOctober 13, 2023 10:54am

Views (34)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Generative Intrinsic Optimization: Intrisic Control with Model Learning

arXivPDFOctober 12, 2023 12:00am

Authors

Jianfei Ma

Abstract

Future sequence represents the outcome after executing the action into the environment. When driven by the information-theoretic concept of mutual information, it seeks maximally informative consequences. Explicit outcomes may vary across state, return, or trajectory serving different purposes such as credit assignment or imitation learning. However, the inherent nature of incorporating intrinsic motivation with reward maximization is often neglected. In this work, we propose a variational approach to jointly learn the necessary quantity for estimating the mutual information and the dynamics model, providing a general framework for incorporating different forms of outcomes of interest. Integrated into a policy iteration scheme, our approach guarantees convergence to the optimal policy. While we mainly focus on theoretical analysis, our approach opens the possibilities of leveraging intrinsic control with model learning to enhance sample efficiency and incorporate uncertainty of the environment into decision-making.

TwitterandLinkedIn

0 comments

Add comment

Generative Intrinsic Optimization: Intrisic Control with Model Learning

Generative Intrinsic Optimization: Intrisic Control with Model Learning

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments