![]() |
Craig Boutilier's Current and Recent Students |
Georgios Chalkiadakis is a Ph.D. student interested in various aspects of reinforcement learning and coalition formation. His work combines these concepts, aiming at enabling rational Bayesian agents to take rewarding actions when faced with sequential decision making under uncertainty in multiagent environments.
Bowen Hui is a Ph.D. student interested in learning a person's utility function by modeling it as an inverse reinforcement learning problem. She has done work on decision theoretic user modeling using exact belief state monitoring for POMDPs in a word prediction application. She is also interested in preference elicitation, multi-agent communication, psychological effects of preference elicitation and negotiation, among others.
Yilan Gu is a Ph.D. student interested in knowledge representation and first-order models of stochastic systems. She is currently working on problems of solving first-order MDP problem more efficiently using macro-actions. She is also interested in other problems related to reasoning about actions.
Scott Sanner is a Ph.D. student interested in knowledge representation and reasoning, planning under uncertainty, and reinforcement learning.
Nathanael Hyafil is a Ph.D. student working on computational aspects of mechanism design and game theory. In particular, recent work has focused on the design of mechanisms requiring only partial type revelation while preserving incentive properties and reasonable decision quality.
Darius Braziunas is a Ph.D. student interested in various decision-theoretic aspects of preference elicitation, in particular graphical multiattribute utility models, sequentially optimal elicitation strategies, and decision making with partial utility information.
Kevin Regan is a Ph.D. student interested in artifical intelligence, modeling reputation and trust, computational aspects of game theory, mechanism design and electronic markets, multi-agent systems and cognitive science.
Michael Pavlin is pursuing his Ph.D. at the Rotman School of Business at the University of Toronto. He is interested in computational issues that arise in economic systems. His recent work focused on ascending auctions in the presence of externalities and is motivated by applications to wireless communication networks. He completed his MSc, "Ascending Auction for Markets with Externalities and Applications to Routing in Wireless Networks," in 2006.
Pascal Poupart is an Assistant Professor of Computer Science at the University of Waterloo. Pascal's interests include decision making under uncertainty (including MDPs and POMDPs), machine learning, applications to health informatics and dialog systems. He completed his PhD, "Exploiting Structure to Efficiently Solve Large Scale Partially Observable Markov Decision Processes," in 2005.
Bob Price is Research Scientist at Xerox PARC (following a postdoc at The University of Alberta with the Alberta Ingenuity Center for Machine Learning. His interests include reinforcement learning (including imitation, teaching and knowledge transfer), machine learning, Markov decision processes, and web personalization. He completed his PhD, "Accelerating Reinforcement Learning through Imitation," in 2003.
Richard Dearden completed his Ph.D. in Fall 2000 on Markov decision processes as a model for decision theoretic planning and reinforcement learning. He developed methods for the learning and use of Bayesian methods for estimating value functions and models in reinforcement learning and developing exploration methods based on uncertainty in these estimates. He has been deeply involved in much of the work on structured policy construction algorithms for MDPs as well. He is currently a Senior Lecturer at the University of Birmingham School of Computer Science.
Alexander Kress completed his M.Sc. degree in 2004. His thesis title was "An Incremental Elicitation Approach to Limited-Precision Auctions."
Darius Braziunas finished his M.Sc. degree in 2003. His thesis title was "Stochastic Local Search for POMDP controllers".
Tianhan Wang finished an M.Sc. degree in 2003. His thesis title was "Preference Elicitation using the Minimax Regret Decision Criterion."
Return to Craig's HomePage