Stochastic Algorithms for One Pass Learning

This talk was first given in the Sixth Annual Machine Learning Symposium of the New York Academy of Sciences and in the NIPS 2011 Workshop on Computational Trade-offs in Statistical Learning.

Summary

The goal of the presentation is to describe practical stochastic gradient algorithms that process each training example only once, yet asymptotically match the performance of the true empirical optimum. This statement needs, of course, to be made more precise. To achieve this, we'll review the works of Nevel'son and Has'minskij (1972), Fabian (1973, 1978), Murata & Amari (1998), Bottou & LeCun (2004), Polyak & Juditsky (1992), Wei Xu (2010), and Bach & Moulines (2011). We will then show how these ideas lead to practical algorithms and new challenges.

Links

Papers

Léon Bottou and Yann LeCun: Large Scale Online Learning, Advances in Neural Information Processing Systems 16 (NIPS 2003), Edited by Sebastian Thrun, Lawrence Saul and Bernhard Schölkopf, MIT Press, Cambridge, MA, 2004.

more...

Léon Bottou and Yann LeCun: On-line Learning for Very Large Datasets, Applied Stochastic Models in Business and Industry, 21(2):137-151, 2005.

more...

leon.bottou.org

Table of Contents

Stochastic Algorithms for One Pass Learning

Summary

Links

Papers