SIAM Review, volume 60, issue 2, pages 223-311

Optimization Methods for Large-Scale Machine Learning

Léon Bottou ¹

Frank E. Curtis ²

Jorge Nocedal ³

Show full list: 3 authors

Hide authors affiliations Show authors affiliations: 3 affiliations

FACEBOOK, INC.,

Lehigh University, |

Industrial Engineering and Management Sciences

Publication type: Journal Article

Publication date: 2018-05-03

Society for Industrial and Applied Mathematics (SIAM)

Journal: SIAM Review

scimago Q1

SJR: 2.900

CiteScore: 16.9

Impact factor: 10.8

ISSN: 00361445, 10957200

DOI: 10.1137/16M1080173

Copy DOI

Computational Mathematics

Applied Mathematics

Theoretical Computer Science

Abstract

This paper provides a review and commentary on the past, present, and future of numerical optimization algorithms in the context of machine learning applications. Through case studies on text classification and the training of deep neural networks, we discuss how optimization problems arise in machine learning and what makes them challenging. A major theme of our study is that large-scale machine learning represents a distinctive setting in which the stochastic gradient (SG) method has traditionally played a central role while conventional gradient-based nonlinear optimization techniques typically falter. Based on this viewpoint, we present a comprehensive theory of a straightforward, yet versatile SG algorithm, discuss its practical behavior, and highlight opportunities for designing algorithms with improved performance. This leads to a discussion about the next generation of optimization methods for large-scale machine learning, including an investigation of two main streams of research on techniques th...

Found

Top-30

Journals

Publishers

	100 200 300 400 500 600
Institute of Electrical and Electronics Engineers (IEEE)	Institute of Electrical and Electronics Engineers (IEEE), 516, 32.25% Institute of Electrical and Electronics Engineers (IEEE) 516 publications, 32.25%
Springer Nature	Springer Nature, 323, 20.19% Springer Nature 323 publications, 20.19%
Elsevier	Elsevier, 212, 13.25% Elsevier 212 publications, 13.25%
MDPI	MDPI, 60, 3.75% MDPI 60 publications, 3.75%
Association for Computing Machinery (ACM)	Association for Computing Machinery (ACM), 59, 3.69% Association for Computing Machinery (ACM) 59 publications, 3.69%
Society for Industrial and Applied Mathematics (SIAM)	Society for Industrial and Applied Mathematics (SIAM), 54, 3.38% Society for Industrial and Applied Mathematics (SIAM) 54 publications, 3.38%
Taylor & Francis	Taylor & Francis, 46, 2.88% Taylor & Francis 46 publications, 2.88%
Cambridge University Press	Cambridge University Press, 40, 2.5% Cambridge University Press 40 publications, 2.5%
Wiley	Wiley, 33, 2.06% Wiley 33 publications, 2.06%
IOP Publishing	IOP Publishing, 30, 1.88% IOP Publishing 30 publications, 1.88%
Institute for Operations Research and the Management Sciences (INFORMS)	Institute for Operations Research and the Management Sciences (INFORMS), 24, 1.5% Institute for Operations Research and the Management Sciences (INFORMS) 24 publications, 1.5%
Oxford University Press	Oxford University Press, 13, 0.81% Oxford University Press 13 publications, 0.81%
Frontiers Media S.A.	Frontiers Media S.A., 13, 0.81% Frontiers Media S.A. 13 publications, 0.81%
Cold Spring Harbor Laboratory	Cold Spring Harbor Laboratory, 13, 0.81% Cold Spring Harbor Laboratory 13 publications, 0.81%
American Institute of Mathematical Sciences (AIMS)	American Institute of Mathematical Sciences (AIMS), 11, 0.69% American Institute of Mathematical Sciences (AIMS) 11 publications, 0.69%
American Physical Society (APS)	American Physical Society (APS), 10, 0.63% American Physical Society (APS) 10 publications, 0.63%
World Scientific	World Scientific, 8, 0.5% World Scientific 8 publications, 0.5%
Public Library of Science (PLoS)	Public Library of Science (PLoS), 7, 0.44% Public Library of Science (PLoS) 7 publications, 0.44%
AIP Publishing	AIP Publishing, 6, 0.38% AIP Publishing 6 publications, 0.38%
Hindawi Limited	Hindawi Limited, 6, 0.38% Hindawi Limited 6 publications, 0.38%
SAGE	SAGE, 5, 0.31% SAGE 5 publications, 0.31%
Institute of Mathematical Statistics	Institute of Mathematical Statistics, 5, 0.31% Institute of Mathematical Statistics 5 publications, 0.31%
EDP Sciences	EDP Sciences, 4, 0.25% EDP Sciences 4 publications, 0.25%
Tech Science Press	Tech Science Press, 4, 0.25% Tech Science Press 4 publications, 0.25%
American Mathematical Society	American Mathematical Society, 3, 0.19% American Mathematical Society 3 publications, 0.19%
Copernicus	Copernicus, 3, 0.19% Copernicus 3 publications, 0.19%
Emerald	Emerald, 3, 0.19% Emerald 3 publications, 0.19%
Science in China Press	Science in China Press, 3, 0.19% Science in China Press 3 publications, 0.19%
American Chemical Society (ACS)	American Chemical Society (ACS), 3, 0.19% American Chemical Society (ACS) 3 publications, 0.19%
	100 200 300 400 500 600

We do not take into account publications without a DOI.
Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.

Metrics

CITATIONS

2k Total citations

862 Recent citations

879 Field Citation Ratio

n/a Relative Citation Ratio

Cite this

GOST |

Cite this

GOST Copy

Bottou L., Curtis F. E., Nocedal J. Optimization Methods for Large-Scale Machine Learning // SIAM Review. 2018. Vol. 60. No. 2. pp. 223-311.

GOST all authors (up to 50) Copy

Bottou L., Curtis F. E., Nocedal J. Optimization Methods for Large-Scale Machine Learning // SIAM Review. 2018. Vol. 60. No. 2. pp. 223-311.

RIS |

Cite this

RIS Copy

TY - JOUR

DO - 10.1137/16M1080173

UR - https://doi.org/10.1137/16M1080173

TI - Optimization Methods for Large-Scale Machine Learning

T2 - SIAM Review

AU - Bottou, Léon

AU - Curtis, Frank E.

AU - Nocedal, Jorge

PY - 2018

DA - 2018/05/03

PB - Society for Industrial and Applied Mathematics (SIAM)

SP - 223-311

IS - 2

VL - 60

SN - 0036-1445

SN - 1095-7200

ER -

BibTex |

Cite this

BibTex (up to 50 authors) Copy

@article{2018_Bottou,

author = {Léon Bottou and Frank E. Curtis and Jorge Nocedal},

title = {Optimization Methods for Large-Scale Machine Learning},

journal = {SIAM Review},

year = {2018},

volume = {60},

publisher = {Society for Industrial and Applied Mathematics (SIAM)},

month = {may},

url = {https://doi.org/10.1137/16M1080173},

number = {2},

pages = {223--311},

doi = {10.1137/16M1080173}

}

MLA

Cite this

MLA Copy

Bottou, Léon, et al. “Optimization Methods for Large-Scale Machine Learning.” SIAM Review, vol. 60, no. 2, May. 2018, pp. 223-311. https://doi.org/10.1137/16M1080173.

Found error?

Message

Support chat

Publisher

Society for Industrial and Applied Mathematics (SIAM)

Journal

SIAM Review

scimago Q1

SJR

2.900

CiteScore

16.9

Impact factor

10.8

ISSN

00361445 (Print)

10957200 (Electronic)

	5 10 15 20 25 30 35 40
Lecture Notes in Computer Science	Lecture Notes in Computer Science, 37, 2.31% Lecture Notes in Computer Science 37 publications, 2.31%
Computational Optimization and Applications	Computational Optimization and Applications, 32, 2% Computational Optimization and Applications 32 publications, 2%
IEEE Access	IEEE Access, 30, 1.88% IEEE Access 30 publications, 1.88%
IEEE Transactions on Signal Processing	IEEE Transactions on Signal Processing, 28, 1.75% IEEE Transactions on Signal Processing 28 publications, 1.75%
SIAM Journal on Optimization	SIAM Journal on Optimization, 25, 1.56% SIAM Journal on Optimization 25 publications, 1.56%
IEEE Transactions on Neural Networks and Learning Systems	IEEE Transactions on Neural Networks and Learning Systems, 24, 1.5% IEEE Transactions on Neural Networks and Learning Systems 24 publications, 1.5%
IEEE Transactions on Automatic Control	IEEE Transactions on Automatic Control, 21, 1.31% IEEE Transactions on Automatic Control 21 publications, 1.31%
IEEE Transactions on Wireless Communications	IEEE Transactions on Wireless Communications, 20, 1.25% IEEE Transactions on Wireless Communications 20 publications, 1.25%
Inverse Problems	Inverse Problems, 19, 1.19% Inverse Problems 19 publications, 1.19%
IEEE Transactions on Mobile Computing	IEEE Transactions on Mobile Computing, 19, 1.19% IEEE Transactions on Mobile Computing 19 publications, 1.19%
Journal of Optimization Theory and Applications	Journal of Optimization Theory and Applications, 18, 1.13% Journal of Optimization Theory and Applications 18 publications, 1.13%
Optimization Methods and Software	Optimization Methods and Software, 17, 1.06% Optimization Methods and Software 17 publications, 1.06%
IEEE Journal on Selected Areas in Communications	IEEE Journal on Selected Areas in Communications, 16, 1% IEEE Journal on Selected Areas in Communications 16 publications, 1%
Mathematical Programming	Mathematical Programming, 16, 1% Mathematical Programming 16 publications, 1%
Neurocomputing	Neurocomputing, 16, 1% Neurocomputing 16 publications, 1%
IEEE Transactions on Pattern Analysis and Machine Intelligence	IEEE Transactions on Pattern Analysis and Machine Intelligence, 15, 0.94% IEEE Transactions on Pattern Analysis and Machine Intelligence 15 publications, 0.94%
IEEE Internet of Things Journal	IEEE Internet of Things Journal, 13, 0.81% IEEE Internet of Things Journal 13 publications, 0.81%
Journal of Computational and Applied Mathematics	Journal of Computational and Applied Mathematics, 10, 0.63% Journal of Computational and Applied Mathematics 10 publications, 0.63%
Journal of the Operations Research Society of China	Journal of the Operations Research Society of China, 9, 0.56% Journal of the Operations Research Society of China 9 publications, 0.56%
Optimization Letters	Optimization Letters, 9, 0.56% Optimization Letters 9 publications, 0.56%
Information Sciences	Information Sciences, 9, 0.56% Information Sciences 9 publications, 0.56%
Neural Networks	Neural Networks, 9, 0.56% Neural Networks 9 publications, 0.56%
Mathematics of Operations Research	Mathematics of Operations Research, 8, 0.5% Mathematics of Operations Research 8 publications, 0.5%
Operations Research	Operations Research, 8, 0.5% Operations Research 8 publications, 0.5%
Journal of Computational Physics	Journal of Computational Physics, 8, 0.5% Journal of Computational Physics 8 publications, 0.5%
ACM Computing Surveys	ACM Computing Surveys, 7, 0.44% ACM Computing Surveys 7 publications, 0.44%
SIAM Journal of Scientific Computing	SIAM Journal of Scientific Computing, 7, 0.44% SIAM Journal of Scientific Computing 7 publications, 0.44%
Applied Soft Computing Journal	Applied Soft Computing Journal, 7, 0.44% Applied Soft Computing Journal 7 publications, 0.44%
IEEE Transactions on Communications	IEEE Transactions on Communications, 7, 0.44% IEEE Transactions on Communications 7 publications, 0.44%
	5 10 15 20 25 30 35 40