Nicolò Cesa-Bianchi and Gábor Lugosi
Cambridge University Press, 2006 ISBN 0521841089 Published March 2006, 406 pages ...beware of mathematicians, and all those who make empty prophecies. The danger already exists that the mathematicians have made a covenant with the devil to darken the spirit and to confine man in the bonds of Hell. St. Augustine, De Genesi ad Litteram libri duodecim. Liber Secundus, 17, 37. Errata (as of September 8, 2006) A review (in French) appeared on the Gazette of Mathématiciens, 110, October 2006. |
This book offers the first comprehensive treatment of the problem
of predicting "individual sequences". Unlike standard statistical
approaches to forecasting, prediction of individual sequences does
not impose any probabilistic assumption on the data-generating
mechanism. Yet, prediction algorithms can be constructed that work
well for all possible sequences, in the sense that their performance
is always nearly as good as the best forecasting strategy in a given
reference class.
The central theme is the model of "prediction using expert advice",
a general framework within which many related problems can be cast
and discussed. Repeated game playing, adaptive data compression,
sequential investment in the stock market, sequential pattern analysis,
and several other problems are viewed as instances of the experts'
framework and analyzed from a common nonstochastic standpoint that
often reveals new and intriguing connections. Old and new forecasting
methods are described in a mathematically precise way with the purpose
of characterizing their theoretical limitations and possibilities.
Preface 1 Introduction 2 Prediction with expert advice 2.1 Weighted average prediction 2.2 An optimal bound 2.3 Bounds that hold uniformly over time 2.4 An improvement for small losses 2.5 Forecasters using the gradient of the loss 2.6 Scaled losses and signed games 2.7 The multilinear forecaster 2.8 The exponential forecaster for signed games 2.9 Simulatable experts 2.10 Minimax regret 2.11 Discounted regret 2.12 Bibliographic remarks 2.13 Exercises 3 Tight bounds for specific losses 3.1 Introduction 3.2 Follow the best expert 3.3 Exp-concave loss functions 3.4 The greedy forecaster 3.5 The aggregating forecaster 3.6 Mixability for certain losses 3.7 General lower bounds 3.8 Bibliographic remarks 3.9 Exercises 4 Randomized prediction 4.1 Introduction 4.2 Weighted average forecasters 4.3 Follow the perturbed leader 4.4 Internal regret 4.5 Calibration 4.6 Generalized regret 4.7 Calibration with checking rules 4.8 Bibliographic remarks 4.9 Exercises 5 Efficient forecasters for large classes of experts 5.1 Introduction 5.2 Tracking the best expert 5.3 Tree experts 5.4 The shortest path problem 5.5 Tracking the best of many actions 5.6 Bibliographic remarks 5.7 Exercises 6 Prediction with limited feedback 6.1 Introduction 6.2 Label efficient prediction 6.3 Lower bounds 6.4 Partial monitoring 6.5 A general forecaster for partial monitoring 6.6 Hannan consistency and partial monitoring 6.7 Multi-armed bandit problems 6.8 An improved bandit strategy 6.9 Lower bounds for the bandit problem 6.10 How to select the best action 6.11 Bibliographic remarks 6.12 Exercises 7 Prediction and playing games 7.1 Games and equilibria 7.2 Minimax theorems 7.3 Repeated two-player zero-sum games 7.4 Correlated equilibrium and internal regret 7.5 Unknown games: game-theoretic bandits 7.6 Calibration and correlated equilibrium 7.7 Blackwell's approachability theorem 7.8 Potential-based approachability 7.9 Convergence to Nash equilibria 7.10 Convergence in unknown games 7.11 Playing against opponents that react 7.12 Bibliographic remarks 7.13 Exercises 8 Absolute loss 8.1 Simulatable experts 8.2 Optimal algorithm for simulatable experts 8.3 Static experts 8.4 A simple example 8.5 Bounds for classes of static experts 8.6 Bounds for general classes 8.7 Bibliographic remarks 8.8 Exercises 9 Logarithmic loss 9.1 Sequential probability assignment 9.2 Mixture forecasters 9.3 Gambling and data compression 9.4 The minimax optimal forecaster 9.5 Examples 9.6 The Laplace mixture 9.7 A refined mixture forecaster 9.8 Lower bounds for most sequences 9.9 Prediction with side information 9.10 A general upper bound 9.11 Further examples 9.12 Bibliographic remarks 9.13 Exercises 10 Sequential investment 10.1 Portfolio selection 10.2 The minimax wealth ratio 10.3 Prediction and investment 10.4 Universal portfolios 10.5 The EG investment strategy 10.6 Investment with side information 10.7 Bibliographic remarks 10.8 Exercises 11 Linear pattern recognition 11.1 Prediction with side information 11.2 Bregman divergences 11.3 Potential-based gradient descent 11.4 The transfer function 11.5 Forecasters using Bregman projections 11.6 Time-varying potentials 11.7 The elliptic potential 11.8 A nonlinear forecaster 11.9 Lower bounds 11.10 Mixture forecasters 11.11 Bibliographic remarks 11.12 Exercises 12 Linear classification 12.1 The zero-one loss 12.2 The hinge loss 12.3 Maximum margin classifiers 12.4 Label efficient classifiers 12.5 Kernel-based classifiers 12.6 Bibliographic remarks 12.7 Exercises 13 Appendix 13.1 Inequalities from probability theory 13.1.1 Hoeffding's inequality 13.1.2 Bernstein's inequality 13.1.3 Hoeffding-Azuma inequality and related results 13.1.4 Khinchine's inequality 13.1.5 Slud's inequality 13.1.6 A simple limit theorem 13.1.7 Proof of Theorem 8.3 13.1.8 Rademacher averages 13.1.9 The Beta distribution 13.2 Basic information theory 13.3 Basics of classification Bibliography Author index