Meta-reasoning:
Thinking about thinking.

David Duvenaud

Figures thanks to Nick Hay
Slide software thanks to Christian!







Common problems




All require estimating the relative value of computation.

Meta-reasoning


One formalism for Meta-reasoning:


Principles of Meta-reasoning (1991)
(Russell, Wefald, Karnaugh, Karp, Mcallester, Subramanian, Wellman)


First insight:

Expected Utility of a Computation

Expected Utility of a Computation


Is just difference between
Computing b) is hard, because you don't know which action you will choose!

Expected Utility of a Computation


Notation gets hairy:
Expect expected utilities of actions not to change after computation,

Prior Work: Old

(Courtesy of Nick Hay)


Newer Related Work



Selecting Computations: Theory and Applications
(Hay, Russell, Tolpin, Shimony 2012)


Selecting Computations: Theory and Applications
(Hay, Russell, Tolpin, Shimony 2012)


Number of computations of optimal policy is bounded:

Also note that UCT and Monte Carlo Tree Search are bandit algorithms:

Application to Research (From Roger Grosse)

Research requires evaluating ideas with small chance of large payoff.
How to calibrate expectation of success, and choose worthwhile projects?

One strategy (that he actually tried):
Roger: "I noticed I was vastly overconfident about ideas outside my areas of expertise, and underconfident about ideas inside them."

Conclusion


Meta-reasoning formalizes a currently ad-hoc part of inference.

Early days for the field! Most existing work has been:
Might be a big opening for model-based meta-reasoning.