| Date | Speaker | Title | Downloads |
| January 21st, 2008 | Kevin Regan | "Robustness in Markov Decision Problems with Uncertain Transition
Matrices"
(Nilim & Ghaoui, 2003) Related: "Robust Control of Markov Decision Processes with Uncertain Transition Matrices" (Nilim & Ghaoui, 2004) Related: "Robust Dynamic Programming" (Iyengar, 2005) | paper, related 1, related 2 |
| February 04th, 2008 | Paolo Viappiani | "Preference-based search using Example-critiquing with Suggestions"
(Viappiani, Faltings, & Pu 2006) Related: "Conversational recommenders with adaptive suggestions" (Viappiani, Pu, Faltings 2007) Related: Preference-based search with suggestions. (Viappiani 2007) | paper, related 1, related 2 |
| February 18th, 2008 | n/a | cancelled | n/a |
| March 03rd, 2008 | Kevin Regan | Percentile Optimization in Uncertain Markov Decision Processes with Application to Efficient Exploration (Delage & Mannor 2007) | paper |
| March 04th, 2008 | Sebastian Thrun* | When will we get our robotic car? | abstract |
| March 17th, 2008 | n/a | cancelled | n/a |
| March 31st, 2008 | Laurent Charlin |
An Analytic Solution to Discrete Bayesian Reinforcement Learning
(Poupart, Vlassis, Hoey, Regan, 2006) Related: Model-based Bayesian Reinforcement Learning in Partially Observable Domains (Poupart & Vlassis, 2008) | paper,
slides,
tutorial
(parts3-5) paper |
| April 14th, 2008 | Paolo Viappiani | Selected Topics in Behavioral Decision Theory (prominence effect and preference-reversal, anchoring effect and heuristics) | paper paper |
| April 28th, 2008 | n/a | cancelled | n/a |
| May 05th, 2008 | Paolo Viappiani | Dynamic Critiquing (Reilly, McCarthy, McGinty, Smyth 2004), Incremental critiquing (Reilly, McCarthy, McGinty, Smyth 2005), Evaluating compound critiquing recommenders: a real-user study (Reilly, Zhang, McGinty, Pu, Smyth 2007) | paper paper paper |
*: External speakers. Also note different meeting day and location.
| Date | Speaker | Title | Downloads |
| November 05th, 2007 | n/a | Organizational meeting | n/a |
| November 19th, 2007 | Laurent Charlin | "Mixed Collaborative and Content-Based Filtering with User-Contributed
Semantic Features" (Garden & Dudek 2006) Related: "Semantic feedback for hybrid recommendations in Recommendz" (Garden & Dudek 2005) | paper, related |
| December 03rd, 2007 | open discussion |
"The Netflix Prize" (Bennett & Lanning 2007), "Learning a Meta-Level Prior for Feature Relevance from Multiple Related Tasks" (Lee, Chatalbashev, Vickrey & Koller 2007) |
paper1, paper2 |
| December 07th, 2007 | Mike Wellman* | Empirical Game-Theoretic Analysis for Practical Strategic Reasoning | abstract |
| December 17th, 2007 | open discussion | Active Collaborative Prediction with Maximum Margin Matrix
Factorization (Rish & Tesauro 2007) Background: Maximum Margin Matrix Factorization (Srebro, Rennie & Jaakkola 2004) | paper,
paper2 |
| December 18th, 2007 | Roni Khardon* | Learning to Classify Graphs and Hypergraphs | abstract |
*: External speakers. Also note different meeting day and location.
Winter 2007: 12:30 pm @ PT 266
| Date | Speaker | Title | Downloads |
| January 24th, 2007 | David Parkes* | Adaptive Online Allocation Mechanisms for Single-Valued Domains | abstract |
| February 05th, 2007 | Laurent Charlin* | Automated Hierarchy Discovery for Planning in Partially Observable Environments | abstract, paper |
| February 16th, 2007 | Darius Braziunas* | Introduction to Preference Elicitation | paper |
| February 19th, 2007 | Mike Pavlin | Ascending Auctions for Markets with Structured Externalities | paper |
*: External speakers. Also note different meeting day and location.
*** on hiatus until the summer or until further notice.
Fall 2006: 1:30pm @ SF 3207/PT 378/PT 266
| Date | Speaker | Title | Downloads |
| September 11th, 2006 | Georgios Chalkiadakis | A Bayesian Approach to Multiagent Reinforcement Learning and Coalition Formation under Uncertainty | n/a |
| September 18th, 2006 | n/a | Organizational meeting | n/a |
| October 02nd, 2006 | Kevin Regan (PT266, 2:00pm) | Bayesian Reputation Modeling in E-Marketplaces Sensitive to Subjectivity, Deception and Change | paper |
| October 16th, 2006 | cancelled | n/a | n/a |
| October 30th, 2006 | Nathanael Hyafil | Mechanism Design with Partial Revelation | paper |
| November 13th, 2006 | Georgios Chalkiadakis | Coalitional Bargaining with Agent Type Uncertainty | abstract |
| November 27th, 2006 | Bowen Hui | Modeling the Disruption to the User's Mental Model | abstract |
| December 11th, 2006 | all |
"Regrets Only. Online Stochastic Optimization under Time Constraints"
(Bent & Van Hentenryck 2004) "Scenario-Based Planning for Partially Dynamic Vehicle Routing with Stochastic Customers" (Bent & Van Hentenryck 2004) |
paper1, paper2 |
Summer 2006: Monday 10:00am @ PT 378
| Date | Speaker | Title | Downloads |
| May 29th, 2006 | n/a | Organizational meeting | n/a |
| June 12th, 2006 | Bowen Hui | "Bayesian Inverse Reinforcement Learning" (Ramachandran & Amir 2006) | paper |
| July 10th, 2006 (3pm, PT266) | Scott Sanner* | Practical Linear Value-approximation Techniques for First-order MDPs | paper |
| July 12th, 2006 (3pm, PT266) | Yilan Gu* | A Logic For Decidable Reasoning About Services | paper |
| July 13th, 2006 (1pm, PT266) | Nathanael Hyafil* | Regret-based Incremental Partial Revelation Mechanism Design | paper |
*: Additional/External speakers. Also note different meeting day and location.
Winter 2006: Monday 10:00am @ PT 378
| Date | Speaker | Title | Downloads |
| January 16th, 2006 | Bowen Hui | Who's Asking for Help? A Bayesian Approach to Intelligent Assistance | paper, |
| January 23rd, 2006 | n/a | cancelled | n/a |
| January 30th, 2006 | Scott Sanner |
"Samuel Meets Amarel: Automating Value Function Approximation using Global
State Space Analysis" (Mahadevan 2005), "Proto-Value Functions: Developmental Reinforcement Learning" (Mahadevan 2005) |
paper1, paper2 |
| February 06th, 2006 | n/a | cancelled | n/a |
| February 13th, 2006 | Jesse Hoey |
"Reinforcement Learning with Gaussian Processes"
(Engel, Mannor, Meir 2005), "Learning to Control an Octopus Arm with Gaussian Process Temporal Difference Methods" (Engel, Szabo, Volkinshtein 2005) |
paper 1, paper 2 |
| February 20th, 2006 | Scott Sanner | Rich Representations for Reinforcement Learning | paper, slides |
| February 27th, 2006 | Darius Braziunas | "Polyhedral Methods for Adaptive Choice-Based Conjoint Analysis," (Toubia, Hauser, & Simester 2004) | paper |
| March 06th, 2006 | Darius Braziunas | Computational Approaches to Preference Elicitation | paper |
| March 13th, 2006 | n/a | cancelled | n/a |
| March 20th, 2006 | Georgios Chalkiadakis | "Order Independent Equilibria" (Moldovanu and Winter 1995) | paper |
| March 27th, 2006 | n/a | cancelled | n/a |
| April 03rd, 2006 | Rich Sutton* | Experience-Oriented Artificial Intelligence | abstract |
| April 10th, 2006 | Georgios Chalkiadakis | "The Advantages of Compromising in Coalition Formation with Incomplete
Information" (Kraus, Shehory, & Taase 2004) "Feasible Formation of Coalitions Among Autonomous Agents in Non-Super-Additive Environments" (Shehory & Kraus 1999) |
paper1, paper2 |
| April 17th, 2006 | Craig Boutilier | "Dynamic Preferences in Multi-Criteria Reinforcement Learning"
(Natarajan & Tadepalli 2005) "Multi-criteria Reinforcement Learning" (Gabor et al. 1998) |
paper1 paper2 |
| April 24th, 2006 | Bowen Hui | Decision-Theoretic Human Computer Interaction | paper |
| April 27th, 2006 | Nathanael Hyafil* | Regret-based Incremental Partial Revelation Mechanisms | paper |
| May 01st, 2006 | n/a | cancelled | n/a |
| May 08th, 2006 | Joe Halpern* | Distributed Computing Meets Game Theory: Robus Mechanisms for Rational Secret Sharing and Multiparty Computation | abstract |
*: External speakers. Also note different meeting day and location.
Fall 2005: Monday 10:30am @ PT 378
| Date | Speaker | Title | Downloads |
| September 19th, 2005 | n/a | Organizational meeting | n/a |
| September 26th, 2005 | Jesse Hoey | "Location-Based Activity Recognition using Relational Markov Networks"
(Liao, Fox, & Kautx 2005), "Learning and Inferring Transportation Routines" (Liao, Fox & Kautz 2004) (review from last year) |
paper1, paper2 |
| October 03rd, 2005 | Jesse Hoey | "Activity recognition and abnormality detectiong with the switching
hidden semi-Markov model"
(Duong, Bui, Phung, & Vekatesh 2005) "Learning and detecting activities from movement trajectories using the hierarchical hidden Markov model" (Nguyen, Phung, Venkatesh, & Bui 2005) |
paper 1, paper 2 |
| October 10th, 2005 | n/a | cancelled due to (Canadian) Thanksgivings | n/a |
| October 11th, 2005 | Stuart Russell* | Uncertainty in an unknown world | abstract |
| October 11th, 2005 | Stuart Russell* | Reinforcement learning with partial programs | abstract |
| October 17th, 2005 | Scott Sanner, Mike Pavlin, Georgios Chalkiadakis | "Solving Transition Independent Decentralized Markov Decision Processes" (Becker, Zilberstein, Lesser, & Goldman 2004) | paper |
| October 24th, 2005 | Darius Braziunas, Scott Sanner | "Bounded Policy Iteration for Decentralized POMDPs" (Bernstein, Hansen, & Zilberstein 2005) | paper |
| October 31st, 2005 | Georgios Chalkiadakis, Mike Pavlin | "Near-Optimal Nonmyopic Value of Information in Graphical Models"
(Krause & Guestrin 2005) "Optimal Nonmyopic Value of Information in Graphical Models Efficient Algorithms and Theoretical Limits" (Krause & Guestrin 2005) |
paper1, paper2 |
| November 07th, 2005 | Nathanael Hyafil | "Intelligent Light Control using Sensor Networks" (Singhvi, Krause, Guestrin, Garrett, Matthews 2005) | background, paper |
| November 14th, 2005 | Mike Pavlin | "Charging and rate control for elastic traffic" (Kelly 1997), "Congestion Pricing and User Adaptation" (Ganesh et al 2001) |
paper1, paper2 |
| November 15th, 2005 | Peter Norvig* | AI in the Middle: Mediating between Author and Reader | abstract |
| November 21st, 2005 | Mike Pavlin and Nathanael Hyafil | "Specification Faithfulness in Networks with Rational Nodes" (Shneidman & Parkes 2004) | paper |
| November 29th, 2006 | Michael Kearns* | Behavioural Graph Colouring | abstract |
*: External speakers. Also note different meeting day and location.
Summer 2005: Monday 10:00am @ PT 378
| Date | Speaker | Title | Downloads |
| May 02nd, 2005 | n/a | cancelled | n/a |
| May 09th, 2005 | n/a | cancelled | n/a |
| May 16th, 2005 | Scott Sanner | "Linear program approximations for factored continuous-state Markov
Decision Processes" (Hauskrecht & Kveton 2003) "Solving Factored MDPs with Continuous and Discrete Variables" (Guestrin, Hauskrecht, & Kveton 2004) |
paper1, paper2 |
| May 23rd, 2005 | n/a | cancelled | n/a |
| May 30th, 2005 | Mike Pavlin | Ad Hoc Networks |
paper1, paper2 |
| June 06th, 2005 | n/a | cancelled | n/a |
Winter 2005: Monday 10:00am @ PT 378
| Date | Speaker | Title | Downloads |
| January 17th, 2005 | n/a | Organizational meeting | n/a |
| January 24th, 2005 | Nathanael Hyafil | "Approximately Efficient Online Mechanism Design" (Parkes, Singh & Yanovsky 2004) | paper |
| January 31st, 2005 | n/a | cancelled | n/a |
| February 7th, 2005 | Relu Patrascu | "Optimal Learning: Computational Procedures for Bayes-Adaptive Markov
Decision Processes" (Duff 2002) |
chapters 1,2,3 |
| February 14th, 2005 | Relu Patrascu | "Markovian Decision Processes with Uncertain Transition Probabilities" (Satia & Lave 1973) | (hardcopy) |
| February 21st, 2005 | Relu Patrascu | "Bayesian Q-Learning" (Dearden, Friedman & Russell 1998) | paper |
| February 28th, 2005 | Relu Patrascu | "Optimal Learning: Computational Procedures for Bayes-Adaptive Markov Decision Processes" (Duff 2002) | chapter 4 |
| March 07th, 2005 | n/a | cancelled | n/a |
| March 14th, 2005 | n/a | cancelled | n/a |
| March 21st, 2005 | n/a | cancelled | n/a |
| March 28th, 2005 | Relu Patrascu | New Approaches to Optimization and Utility Elicitation in Autonomic Computing (Patrascu et al. submitted) | paper |
| April 04th, 2005 | Jesse Hoey | "Solving POMDPs with Continuous or Large Discrete Observation Spaces" (Hoey & Poupart 2005) | paper |
| April 11th, 2005 | n/a | cancelled | n/a |
| April 18th, 2005 | Scott Sanner | "Approximate Linear Programming for First-order MDPs" (Sanner & Boutilier submitted) | paper |
| April 21st, 2005 | Nathanael Hyafil* | Computational Mechanism Design | paper |
| April 25th, 2005 | Darius Braziunas | "Local Utility Elicitation in GAI Models" (Braziunas & Boutilier submitted) | paper |
Fall 2004: Mondays 10:00am @ PT 378
| Date | Speaker | Title | Downloads |
| August 23rd, 2004 | n/a | Organizational meeting | n/a |
| September 13th, 2004 | Jesse Hoey | "The use of artificial intelligence in the design of an intelligent
cognitive orthosis for people with dementia" (Mihailidis et al. 2001), "A decision-theoretic approach to task assistance for persons with dementia" (Hoey et al. 2004) |
paper1, paper2 |
| September 20th, 2004 | Jesse Hoey | "Autominder: An Intelligent Cognitive Orthotic System for People with
Memory Impairment" (Pollack et al. 2003), "Adaptive Cognitive Orthotics: Combining Reinforcement Learning and Constraint-Based Temporal Reasoning" (Rudary, Singh & Pollack 2004) |
paper1, paper2 |
| September 27th, 2004 | Jesse Hoey | "An Overview of the Assisted Cognition Project" (Kautz, Fox, Etzioni,
Borriello & Arnstein 2002), "Learning and Inferring Transportation Routines" (Liao, Fox & Kautz 2004) |
paper1, paper2 |
| October 04th, 2004 | Nathanael Hyafil | Chapter 23 of "Microeconomic Theory" by Colell, Whinston, & Green, titled "Incentives and Mechanism Design" | n/a |
| October 11th, 2004 | n/a | cancelled due to (Canadian) Thanksgivings | n/a |
| October 18th, 2004 | Scott Sanner | Affine Algebraic Decision Diagrams (AADDs) and their Application to Structured Probabilistic Inference | abstract |
| October 25th, 2004 | Nathanael Hyafil | continue from October 04th | n/a |
| October 26th, 2004 | Adnan Darwiche* | A Perspective on Knowledge Compilation and its Role in Modern Propositional Inference | abstract |
| October 27th, 2004 | Adnan Darwiche* | Sensitivity Analysis in Bayesian Networks | abstract |
| November 01st, 2004 | Nathanael Hyafil | "Computational Criticisms of the Revelation Principle" (Conitzer & Sandholm 2003) | paper |
| November 08th, 2004 | Nathanael Hyafil | "Auction Design with Costly Preference Elicitation" (Parkes 2004) | paper |
| November 15th, 2004 | Scott Sanner | "Decomposable Negation Normal Form" (Darwiche 2001), "A Compiler for Deterministic Decomposable Negation Normal Form" (Darwiche 2002) |
paper1, paper2 |
| November 16th, 2004 | John Lafferty* | Data, Structure, and Geometry in Statistical Learning | abstract |
| November 22nd, 2004 | n/a | cancelled | n/a |
| November 23rd, 2004 | Jonathan Schaeffer* | Raising the Stakes | abstract |
| November 29th, 2004 | Relu Patrascu | "Predictive State Representations: A New Theory for Modeling Dynamical
Systems" (Singh, James, & Rudary 2004) "Learning Predictive State Representations" (Littman, Jong, Pardoe, & Stone 2003) |
paper1 paper2 |
| December 06th, 2004 | Darius Braziunas | "Multiattribute Utilities in Expected Utility Theory" (Fishburn 1977,
in "Conflicting Objectives in Decisions" (ed. Bell, Keeney, & Raiffa 1977)),
"Graphical models for preference and utility" (Bacchus & Grove 1995) |
paper2 |
| December 13th, 2004 | Darius Braziunas | "Reasoning With Conditional Ceteris Paribus Preference Statements"
(Boutilier, Brafman, Hoos, & Poole 1999), "UCP-Networks: A Directed Graphical Representation of Conditional Utilities" (Boutilier, Bacchus, & Brafman 2001) |
paper1, paper1 (long), paper2 |
*: External speakers. Also note different meeting day and location.
Summer 2004: (occasional) Wednesdays 12:00pm @ PT 378
| Date | Speaker | Title | Downloads |
| May 5th, 2004 | n/a | cancelled | n/a |
| May 12th, 2004 | Scott Sanner | ``Synthesis of Hierarchical Finite-State Controllers for POMDPs'', (Eric Hansen & Rong Zhou 2003) | paper |
| May 19th, 2004 | Relu Patrascu and Pascal Poupart | Repository for sharing code and data structures | notes1, notes2 |
| May 26th, 2004 | Alex Kress | ``The Communication Requirements of Efficient Allocations and Supporting Lindahl Prices'' (Nisan & Segal 2003) | paper |
| June 02nd, 2004 | Nathanael Hyafil | Regret Minimizing Equilibria and Mechanisms for Games with Strict Type Uncertainty | paper |
| June 09th, 2004 | n/a | cancelled | n/a |
| June 16th, 2004 | Bowen Hui | Probablistic and decision-theoretic user modeling in the context of software customization | paper |
| July 19th, 2004 | Darius Braziunas* | Stochastic Local Search for POMDP Controllers | abstract, paper |
*: Note irregular meeting day and location.
Winter 2004: Wednesdays 12:00pm @ PT 378
| Date | Speaker | Title | Downloads | |
| January 8th, 2004 | Ronen Brafman* | Efficient Learning in Stochastic Games | abstract | |
| January 21th, 2004 | n/a | Organizational meeting | n/a | |
| January 22th, 2004 | Jesse Hoey* | Learning POMDPs for Gesture and Facial Display Understanding | abstract, slides | |
| January 28th, 2004 | Yilan Gu | "Enveloped-based Planning in Relational MDPs", (Natalia H. Gardiol & Leslie Pack Kaelbling 2004) | paper | |
| February 4th, 2004 | Relu Patrascu | "Max-Margin Markov Networks", (Taskar, Guestrin, & Koller 2004) | paper | |
| February 11th, 2004 | Georgios Chalkiadakis | Bayesian Reinforcement Learning for Coalition Formation under Uncertainty | paper | |
| February 12th, 2004 | Pascal Poupart* | New Compression and Policy Search Techniques for Partially Observable Markov Decision Processes | abstract | |
| February 18th, 2004 | Relu Patrascu | Sequential Resource Allocation in Autonomic Systems with the Minimax Regret Decision Criterion | paper | |
| February 25th, 2004 | n/a | cancelled | n/a | |
| March 3rd, 2004 | Bowen Hui | Decision-Theoretic User Modeling: Word Prediction as a Test-Bed | slides | |
| March 10th, 2004 | n/a | cancelled | n/a | |
| March 17th, 2004 | Alex Kress | A Study of Limited-Precision, Incremental Elicitation in Auctions | paper | |
| March 24th, 2004 | Jesse Hoey | SPUDD | abstract | |
| March 31st, 2004 | Craig Boutilier | Eliciting Bid Taker Non-price Preferences in (Combinatorial) Auctions (joint with Tuomas Sandholm and Rob Shields) | abstract, paper | |
| April 1st, 2004 | Peter Marbach* | Cooperation in Wireless Ad Hoc Networks: A Market-Based Approach | abstract | |
| April 7th, 2004 | Yilan Gu | ...waiting... | ...waiting... | |
| April 14th, 2004 | Relu Patrascu | Efficient constraint-based optimization and preference elicitation with the minimax regret decision criterion | abstract | |
| April 21th, 2004 | Georgios Chalkiadakis | Using (Bayesian) Reinforcement Learning in Non-cooperative and Cooperative Game Theory | paper | |
| April 28th, 2004 | Scott Sanner | Relational and First-Order Decision-Theoretic Planning Foundations and Future Directions | paper |
*: External speakers. Also note different meeting day and location.
Fall 2003: Wednesdays 10:30am @ PT 378
| Date | Speaker | Title | Downloads |
| September 3th, 2003 | n/a | Organizational meeting | n/a |
| September 10th, 2003 | Relu Patrascu | Structured MDP Approximations using an Expanding Set of Basis Functions | ...waiting... |
| September 17th, 2003 | Craig Boutilier | Cooperative Negotiation in Autonomic Systems using Incremental Utility Elicitation | paper |
| September 24th, 2003 | Fahiem Bacchus | Value Elimination: Bayesian Inference via Backtracking Search | paper |
| October 01st, 2003 | Pascal Poupart, Yilan Gu, Scott Sanner | "Generalizing Plans to New Environments in Relational MDPs" (Guestrin, Koller, Gearhart & Kanodia, 2003) | paper |
| October 08th, 2003 | Scott Sanner | A Representation and Algorithm for Efficient Operations on First-Order Algebraic Decision Diagrams | paper |
| October 15th, 2003 | Pascal Poupart | "First-order probabilistic inference" (Poole 2003) | paper |
| October 22nd, 2003 | Relu Patrascu | "A Natural Policy Gradient" (Kakade 2002), "Covariant Policy Search" (Bagnell, Schneider 2003), "Policy Search by Dynamic Programming" (Bagnell, Kakade, Ng, Schneider 2003) |
paper, paper, paper |
| October 29th, 2003 | Scott Sanner |
"Relational Markov Models and Their Application to Adaptive Web Navigation"
(Anderson, Domingos, & Weld 2002) |
paper |
| November 05th, 2003 | Craig Boutilier | "SUPPLE: Automatically Generating User Interfaces" (Gajos & Weld,
submitted 2004) "Automatically Personalizing User Interfaces" (Weld, Anderson, Domingos, Etzioni, Gajos, Lau, & Welfman 2003) |
paper, paper |
| November 12th, 2003 | Alex Kress and Nathanael Hyafil | "Automated Mechanism Design: A New Application Area for Search
Algorithms" (Sandholm 2003), "Complexity of Mechanism Design" (Conitzer & Sandholm 2002) |
paper, paper |
| November 19th, 2003 | Pascal Poupart | Bounded Finite State Controllers | paper |
| November 26th, 2003 | Georgios Chalkiadakis | "Transition-Independent Decentrailized MDPs" (Becker, Zilberstein, Lesser, & Goldman 2003) | paper |
Summer 2003: (occasional) Wednesdays 12:30pm @ PT 378
| Date | Speaker | Title | Downloads |
| May 07th, 2003 | Pascal Poupart | Symbolic structured computation for probabilistic and decision theoretic models | abstract , slides (.ps) |
| May 14th, 2003 | John Boadway | "Preference Elicitation in Proxied Multiattribute Auctions" (Sunderam & Parkes, 2002) | paper (.pdf), slides (.ps) |
| May 21st, 2003 | Georgios Chalkiadakis | On Coalition Formation | abstract |
| May 28th, 2003 | Joe Halpern* | Great Expectations: On the Universality of Expected Utility | abstract |
| June 04th, 2003 | Nathanael Hyafil and Scott Sanner | Puterman's MDP book, Chapter 6 | n/a |
| June 25th, 2003 | Pascal Poupart | "Approximate Policy Iteration with a Policy Language Bias" (Fern, Yoon & Givan, submitted 2003) | link to paper |
| July 02th, 2003 | Georgios Chalkiadakis | "Coordination in Multiagent Reinforcement Learning: A Bayesian Approach" | abstract, paper (.pdf) |
*: External speakers. Also note different meeting day and location.
Winter 2003: Wednesdays 12:30pm @ PT 378
| Date | Speaker | Title | Downloads |
| January 8th, 2003 | Pascal Poupart | Value-Directed Compression of POMDPs | abstract |
| January 15th, 2003 | Alex Kress | Introduction to Auctions | slides (.pdf) |
| January 21st, 2003 | Mike Jordan* | Machine Learning and the Integration of Multiple Data Sources | abstract |
| January 22nd, 2003 | Darius Braziunas | POMDP Finite Policy Graphs | slides (.pdf) |
| January 28th, 2003 | Daphne Koller* | Probabilistic Models of Relational Data | abstract |
| January 29th, 2003 | Craig Boutilier | Language for Combinatorial Auctions | ...waiting... |
| February 5th, 2003 | Bob Price | Accelerating Reinforcement Learning with Imitation | slides (.pdf) |
| February 12th, 2003 | Tuomas Sandholm* | Roundtable discussion | homepage |
| February 13th, 2003 | Tuomas Sandholm* | Select issues in computing in games | abstract |
| February 19th, 2003 | Dale Schuurmans* | Monte Carlo inference via greedy importance sampling | abstract |
| February 26th, 2003 | Georgios Chalkiadakis | Multiagent Reinforcement Learning: Stochastic Games With Multiple Learning Players | abstract, paper (.ps) |
| March 5th, 2003 | Bowen Hui | Some User Models | slides (.pdf) |
| March 12th, 2003 | n/a | TAC update | n/a |
| March 19th, 2003 | Scott Sanner | An Introduction to Symbolic Dynamic Programming for First-Order MDP's | slides (.pdf) |
| March 19th, 2003 | Bob Price | The PhD Experience (meet at O'Gradys at 4:30pm!) | n/a |
| March 26th, 2003 | John Boadway and Yilan Gu | Auctions with severely bounded communication, by Blumrosen and Nisan (*postponed*) | paper |
| April 2th, 2003 | Yilan Gu and John Boadway | Auctions with severely bounded communication, by Blumrosen and Nisan | slides I (.ps), slides II (.ps) |
| April 9th, 2003 | n/a | april break | n/a |
| April 16th, 2003 | Tianhan Wang | Incremental Utility Elicitation with the Minimax Regret Decision Criterion | slides (.pdf) |
| April 23rd, 2003 | Bowen Hui | "Learning an Agent's Utility Function by Observing Behavior" (Chajewska, Koller, Ormoneit. ICML 2001) | paper (.ps), slides (.pdf) |
| April 23rd, 2003 | Bob Price | Farewell Party (meet at O'Gradys at 2:30pm!) | n/a |
| April 29th, 2003 | Yoav Shoham* | It's Not Your Father's Mechanism Design | abstract |
| April 30th, 2003 | Pascal Poupart | Bayesian learning in sequential decision processes | abstract |
*: External speakers. Also note different meeting day and location.
Fall 2002: Wednesdays 12pm @ PT 378
| Date | Speaker | Title | Downloads |
| September 12th, 2002 | n/a | Organizational meeting | n/a |
| September 18th, 2002 | n/a | Lab reconstruction meeting | n/a |
| September 25th, 2002 | Pascal Poupart | Introduction to POMDPs | slides (.ppt) |
| October 2nd, 2002 | Craig Boutilier | Brain Dump | slides (.pdf) |
| October 9th, 2002 | Pascal Poupart | POMDPs: Part II | (updated) slides (.ppt) |
| October 16th, 2002 | Darius Braziunas | Introduction to TAC | slides (.html) |
| October 23th, 2002 | Bowen Hui | Defining and Formalizing Software Customization | slides (.ppt) |
| October 30th, 2002 | Tianhan Wang | Elicitation with the Minimax Regret Model | slides (.pdf) |
| November 6th, 2002 | Scott Sanner | The Joy of Description Logics | slides (redirect) |
| November 13th, 2002 | Georgios Chalkiadakis | Coordination in Multiagent Reinforcement Learning: A Bayesian Approach | abstract, slides (.ppt) |
| November 20th, 2002 | John Boadway | RL in Continuous Double Auctions | abstract |
| November 27th, 2002 | Yilan Gu | Handling Uncertainty Systems in the Situation Calculus with Macro-actions | abstract |
| December 4th, 2002 | Nathanael Hyafil | Conformant Probabilistic Planning via CSPs | abstract, paper (.ps) |
| December 7th, 2002 | n/a | Holiday *Eggnog* Party~~ | n/a |
| December 11th, 2002 | n/a | Computer purchases | n/a |
CoGS Group
[Talks]
[Conferences]
[People]
[Individual Meetings]