An MILP-Based Solution Scheme for Factored Markov Decision Processes_Tongji SEM An MILP-Based Solution Scheme for Factored Markov Decision Processes

An MILP-Based Solution Scheme for Factored Markov Decision Processes

Thu, Apr 23, 2026

SPEAKER: 余文忠副教授, 香港理工大学

TIME/DATE: 2026.4.27 10：00

CLASSROOM: A308

ABSTRACT:

Factored Markov decision processes (MDPs) are a prominent paradigm within the artificial intelligence community for modeling and solving large-scale MDPs whose rewards and dynamics decompose into smaller, loosely interacting components. Through the use of value function approximations, dynamic Bayesian networks and context-specific independence, factored MDPs can achieve an exponential reduction in the state space of an MDP and thus scale to problem sizes that are beyond the reach of classical MDP algorithms. However, factored MDPs are typically solved using custom-designed algorithms that can require meticulous implementations and considerable fine-tuning. In this paper, we propose a mathematical programming approach to solving factored MDPs. Unlike existing solution schemes, our approach leverages off-the-shelf solvers, which enables streamlined implementation and maintenance. Moreover, we exploit the factored structure in both state and action spaces, and we employ feature representations to unify existing methods while taking advantage of context-specific independence in problem classes that other approaches cannot solve efficiently. To further enhance scalability, we introduce a feature learning scheme that automatically identifies informative features and a dynamic basis approximation scheme that adaptively refines our value function approximations. Our numerical experiments demonstrate the potential of our approach.

GUEST BIO:

Dr. Man-Chung Yue is currently an Associate Professor at the Department of Applied Mathematics of The Hong Kong Polytechnic University. He received his B.Sc. degree in Mathematics and Ph.D. degree in Systems Engineering and Engineering Management, both from The Chinese University of Hong Kong. He also worked at The University of Hong Kong as an Assistant Professor and Imperial College London as a Research Associate. His research focuses on continuous optimization and its interplay with decision-making under uncertainty, signal processing, machine learning, and operations research.

Privacy settings