An MILP-Based Solution Scheme for Factored Markov Decision Processes
Thu, Apr 23, 2026
SPEAKER: 余文忠 副教授, 香港理工大学
TIME/DATE: 2026.4.27 10:00
CLASSROOM: A308

ABSTRACT:
Factored Markov decision processes (MDPs) are a prominent paradigm within the artificial intelligence community for modeling and solving large-scale MDPs whose rewards and dynamics decompose into smaller, loosely interacting components. Through the use of value function approximations, dynamic Bayesian networks and context-specific independence, factored MDPs can achieve an exponential reduction in the state space of an MDP and thus scale to problem sizes that are beyond the reach of classical MDP algorithms. However, factored MDPs are typically solved using custom-designed algorithms that can require meticulous implementations and considerable fine-tuning. In this paper, we propose a mathematical programming approach to solving factored MDPs. Unlike existing solution schemes, our approach leverages off-the-shelf solvers, which enables streamlined implementation and maintenance. Moreover, we exploit the factored structure in both state and action spaces, and we employ feature representations to unify existing methods while taking advantage of context-specific independence in problem classes that other approaches cannot solve efficiently. To further enhance scalability, we introduce a feature learning scheme that automatically identifies informative features and a dynamic basis approximation scheme that adaptively refines our value function approximations. Our numerical experiments demonstrate the potential of our approach.
GUEST BIO:
Dr. Man-Chung Yue is currently an Associate Professor at the Department of Applied Mathematics of The Hong Kong Polytechnic University. He received his B.Sc. degree in Mathematics and Ph.D. degree in Systems Engineering and Engineering Management, both from The Chinese University of Hong Kong. He also worked at The University of Hong Kong as an Assistant Professor and Imperial College London as a Research Associate. His research focuses on continuous optimization and its interplay with decision-making under uncertainty, signal processing, machine learning, and operations research.
We protect your privacy. We use cookies to personalize content, provide features, and analyze traffic to our website anonymously and in a privacy compliant manner. By law, we may store cookies on your device if they are strictly necessary for the operation of this site. For all other cookie types, we need your permission. For more information, please see the privacy policy linked below.