SIAM Review, volume 60, issue 2, pages 223-311
Optimization Methods for Large-Scale Machine Learning
Léon Bottou
1
,
Frank E. Curtis
2
,
Jorge Nocedal
3
Publication type: Journal Article
Publication date: 2018-05-03
Journal:
SIAM Review
scimago Q1
SJR: 2.900
CiteScore: 16.9
Impact factor: 10.8
ISSN: 00361445, 10957200
Computational Mathematics
Applied Mathematics
Theoretical Computer Science
Abstract
This paper provides a review and commentary on the past, present, and future of numerical optimization algorithms in the context of machine learning applications. Through case studies on text classification and the training of deep neural networks, we discuss how optimization problems arise in machine learning and what makes them challenging. A major theme of our study is that large-scale machine learning represents a distinctive setting in which the stochastic gradient (SG) method has traditionally played a central role while conventional gradient-based nonlinear optimization techniques typically falter. Based on this viewpoint, we present a comprehensive theory of a straightforward, yet versatile SG algorithm, discuss its practical behavior, and highlight opportunities for designing algorithms with improved performance. This leads to a discussion about the next generation of optimization methods for large-scale machine learning, including an investigation of two main streams of research on techniques th...
Found
Found
Top-30
Journals
5
10
15
20
25
30
35
40
|
|
Lecture Notes in Computer Science
37 publications, 2.31%
|
|
Computational Optimization and Applications
32 publications, 2%
|
|
IEEE Access
30 publications, 1.88%
|
|
IEEE Transactions on Signal Processing
28 publications, 1.75%
|
|
SIAM Journal on Optimization
25 publications, 1.56%
|
|
IEEE Transactions on Neural Networks and Learning Systems
24 publications, 1.5%
|
|
IEEE Transactions on Automatic Control
21 publications, 1.31%
|
|
IEEE Transactions on Wireless Communications
20 publications, 1.25%
|
|
Inverse Problems
19 publications, 1.19%
|
|
IEEE Transactions on Mobile Computing
19 publications, 1.19%
|
|
Journal of Optimization Theory and Applications
18 publications, 1.13%
|
|
Optimization Methods and Software
17 publications, 1.06%
|
|
IEEE Journal on Selected Areas in Communications
16 publications, 1%
|
|
Mathematical Programming
16 publications, 1%
|
|
Neurocomputing
16 publications, 1%
|
|
IEEE Transactions on Pattern Analysis and Machine Intelligence
15 publications, 0.94%
|
|
IEEE Internet of Things Journal
13 publications, 0.81%
|
|
Journal of Computational and Applied Mathematics
10 publications, 0.63%
|
|
Journal of the Operations Research Society of China
9 publications, 0.56%
|
|
Optimization Letters
9 publications, 0.56%
|
|
Information Sciences
9 publications, 0.56%
|
|
Neural Networks
9 publications, 0.56%
|
|
Mathematics of Operations Research
8 publications, 0.5%
|
|
Operations Research
8 publications, 0.5%
|
|
Journal of Computational Physics
8 publications, 0.5%
|
|
ACM Computing Surveys
7 publications, 0.44%
|
|
SIAM Journal of Scientific Computing
7 publications, 0.44%
|
|
Applied Soft Computing Journal
7 publications, 0.44%
|
|
IEEE Transactions on Communications
7 publications, 0.44%
|
|
5
10
15
20
25
30
35
40
|
Publishers
100
200
300
400
500
600
|
|
Institute of Electrical and Electronics Engineers (IEEE)
516 publications, 32.25%
|
|
Springer Nature
323 publications, 20.19%
|
|
Elsevier
212 publications, 13.25%
|
|
MDPI
60 publications, 3.75%
|
|
Association for Computing Machinery (ACM)
59 publications, 3.69%
|
|
Society for Industrial and Applied Mathematics (SIAM)
54 publications, 3.38%
|
|
Taylor & Francis
46 publications, 2.88%
|
|
Cambridge University Press
40 publications, 2.5%
|
|
Wiley
33 publications, 2.06%
|
|
IOP Publishing
30 publications, 1.88%
|
|
Institute for Operations Research and the Management Sciences (INFORMS)
24 publications, 1.5%
|
|
Oxford University Press
13 publications, 0.81%
|
|
Frontiers Media S.A.
13 publications, 0.81%
|
|
Cold Spring Harbor Laboratory
13 publications, 0.81%
|
|
American Institute of Mathematical Sciences (AIMS)
11 publications, 0.69%
|
|
American Physical Society (APS)
10 publications, 0.63%
|
|
World Scientific
8 publications, 0.5%
|
|
Public Library of Science (PLoS)
7 publications, 0.44%
|
|
AIP Publishing
6 publications, 0.38%
|
|
Hindawi Limited
6 publications, 0.38%
|
|
SAGE
5 publications, 0.31%
|
|
Institute of Mathematical Statistics
5 publications, 0.31%
|
|
EDP Sciences
4 publications, 0.25%
|
|
Tech Science Press
4 publications, 0.25%
|
|
American Mathematical Society
3 publications, 0.19%
|
|
Copernicus
3 publications, 0.19%
|
|
Emerald
3 publications, 0.19%
|
|
Science in China Press
3 publications, 0.19%
|
|
American Chemical Society (ACS)
3 publications, 0.19%
|
|
100
200
300
400
500
600
|
- We do not take into account publications without a DOI.
- Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Cite this
GOST |
RIS |
BibTex |
MLA
Cite this
GOST
Copy
Bottou L., Curtis F. E., Nocedal J. Optimization Methods for Large-Scale Machine Learning // SIAM Review. 2018. Vol. 60. No. 2. pp. 223-311.
GOST all authors (up to 50)
Copy
Bottou L., Curtis F. E., Nocedal J. Optimization Methods for Large-Scale Machine Learning // SIAM Review. 2018. Vol. 60. No. 2. pp. 223-311.
Cite this
RIS
Copy
TY - JOUR
DO - 10.1137/16M1080173
UR - https://doi.org/10.1137/16M1080173
TI - Optimization Methods for Large-Scale Machine Learning
T2 - SIAM Review
AU - Bottou, Léon
AU - Curtis, Frank E.
AU - Nocedal, Jorge
PY - 2018
DA - 2018/05/03
PB - Society for Industrial and Applied Mathematics (SIAM)
SP - 223-311
IS - 2
VL - 60
SN - 0036-1445
SN - 1095-7200
ER -
Cite this
BibTex (up to 50 authors)
Copy
@article{2018_Bottou,
author = {Léon Bottou and Frank E. Curtis and Jorge Nocedal},
title = {Optimization Methods for Large-Scale Machine Learning},
journal = {SIAM Review},
year = {2018},
volume = {60},
publisher = {Society for Industrial and Applied Mathematics (SIAM)},
month = {may},
url = {https://doi.org/10.1137/16M1080173},
number = {2},
pages = {223--311},
doi = {10.1137/16M1080173}
}
Cite this
MLA
Copy
Bottou, Léon, et al. “Optimization Methods for Large-Scale Machine Learning.” SIAM Review, vol. 60, no. 2, May. 2018, pp. 223-311. https://doi.org/10.1137/16M1080173.
Found error?
Found error?
Journal
scimago Q1
SJR
2.900
CiteScore
16.9
Impact factor
10.8
ISSN
00361445
(Print)
10957200
(Electronic)