bluelogo.gif (1643 bytes)

home page
people
research
publications
seminars
travel
search
UCL

Coaching Variables for Regression and Classification

Robert Tibshirani
Department of Public Health Sciences and Department of Statistics
University of Toronto

Geoffrey E. Hinton
Department of Computer Science
University of Toronto

Abstract

In a regression or classification setting where we wish to predict Y from x1,x2,…, xp, we suppose that an additional set of ’coaching‘ variables z1,z2,…, zm are available in our training sample. These might be variables that are difficult to measure, and they will not be available when we predict Y from x1,x2,…, xp in the future. We consider two methods of making use of the coaching variables in order to improve the prediction of Y from x1,x2,…, xp. The relative merits of these approaches are discussed and compared in a
number of examples.

Download:  [ps] [pdf]

Statistics and Computing, 8, 25-33

[home page]  [publications]