2:00pm, Thursday, 20 September 2007:

Regressing without supervision

Martin Raphan

There are two standard frameworks for describing optimal estimation of a random quantity from corrupted measurements. The first technique uses explicit models of both the corruption process and the prior distribution of the quantity to be estimated in order to formulate an optimal estimator via Bayes' rule. The second technique uses supervised training on a data set, which has clean samples paired with corrupted versions of those samples, to choose an optimal estimator from some family. In many applications, however, one has available neither a model of the prior distribution, nor uncorrupted measurements of the variable being estimated. We will describe a linear algebraic framework for expressing the least squares estimator (regression function) entirely in terms of a model of the corruption process and the density of the corrupted measurements. We show practical implementations of these nonparametric estimators for various corruption models, and demonstrate the use of this procedure for denoising photographic images corrupted by additive Gaussian noise. We also describe a dual, prior-free formulation of the Mean Square Error (MSE) and show how this may be used for the selection of optimal estimators from a parametric family. We then demonstrate the use of this dual formulation in image denoising.