Accelerated gradient methods for stochastic optimization and online learning
Contributor(s)
The Pennsylvania State University CiteSeerX Archives
Full record
Show full item recordOnline Access
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.212.4115http://www.cs.ust.hk/~jamesk/papers/nips09.pdf
Abstract
Regularized risk minimization often involves non-smooth optimization, either because of the loss function (e.g., hinge loss) or the regularizer (e.g., ℓ1-regularizer). Gradient methods, though highly scalable and easy to implement, are known to converge slowly. In this paper, we develop a novel accelerated gradient method for stochastic optimization while still preserving their computational simplicity and scalability. The proposed algorithm, called SAGE (Stochastic Accelerated GradiEnt), exhibits fast convergence rates on stochastic composite optimization with convex or strongly convex objectives. Experimental results show that SAGE is faster than recent (sub)gradient methods including FOLOS, SMIDAS and SCD. Moreover, SAGE can also be extended for online learning, resulting in a simple algorithm but with the best regret bounds currently known for these problems. 1Date
2012-02-21Type
textIdentifier
oai:CiteSeerX.psu:10.1.1.212.4115http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.212.4115