An overview of large AI models and their applications
In recent years, large-scale artificial intelligence (AI) models have become a focal point in technology, attracting widespread attention and acclaim. Notable examples include Google’s BERT and OpenAI’s GPT, which have scaled their parameter sizes to hundreds of billions or even tens of trillions. This growth has been accompanied by a significant increase in the amount of training data, significantly improving the capabilities and performance of these models. Unlike previous reviews, this paper provides a comprehensive discussion of the algorithmic principles of large-scale AI models and their industrial applications from multiple perspectives. We first outline the evolutionary history of these models, highlighting milestone algorithms while exploring their underlying principles and core technologies. We then evaluate the challenges and limitations of large-scale AI models, including computational resource requirements, model parameter inflation, data privacy concerns, and specific issues related to multi-modal AI models, such as reliance on text-image pairs, inconsistencies in understanding and generation capabilities, and the lack of true “multi-modality”. Various industrial applications of these models are also presented. Finally, we discuss future trends, predicting further expansion of model scale and the development of cross-modal fusion. This study provides valuable insights to inform and inspire future future research and practice.
Top-30
Journals
|
1
2
|
|
|
Advances in Computational Intelligence and Robotics
2 publications, 7.14%
|
|
|
FlatChem
1 publication, 3.57%
|
|
|
Agriculture (Switzerland)
1 publication, 3.57%
|
|
|
Automation in Construction
1 publication, 3.57%
|
|
|
Transactions on Emerging Telecommunications Technologies
1 publication, 3.57%
|
|
|
Diagnosis
1 publication, 3.57%
|
|
|
Electronics (Switzerland)
1 publication, 3.57%
|
|
|
Machine Learning with Applications
1 publication, 3.57%
|
|
|
Lecture Notes in Computer Science
1 publication, 3.57%
|
|
|
International Journal of Geographical Information Science
1 publication, 3.57%
|
|
|
International Journal of Advanced Manufacturing Technology
1 publication, 3.57%
|
|
|
Array
1 publication, 3.57%
|
|
|
Ad Hoc Networks
1 publication, 3.57%
|
|
|
Journal of Physics and Chemistry of Solids
1 publication, 3.57%
|
|
|
Discover Oncology
1 publication, 3.57%
|
|
|
Frontiers in Public Health
1 publication, 3.57%
|
|
|
Journal of the World Federation of Orthodontists
1 publication, 3.57%
|
|
|
Pattern Recognition
1 publication, 3.57%
|
|
|
Visual Intelligence
1 publication, 3.57%
|
|
|
Journal of Supercomputing
1 publication, 3.57%
|
|
|
Earth Systems and Environment
1 publication, 3.57%
|
|
|
1
2
|
Publishers
|
1
2
3
4
5
6
7
8
|
|
|
Elsevier
8 publications, 28.57%
|
|
|
Springer Nature
6 publications, 21.43%
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
5 publications, 17.86%
|
|
|
MDPI
2 publications, 7.14%
|
|
|
IGI Global
2 publications, 7.14%
|
|
|
Wiley
1 publication, 3.57%
|
|
|
Walter de Gruyter
1 publication, 3.57%
|
|
|
Taylor & Francis
1 publication, 3.57%
|
|
|
Frontiers Media S.A.
1 publication, 3.57%
|
|
|
Association for Computing Machinery (ACM)
1 publication, 3.57%
|
|
|
1
2
3
4
5
6
7
8
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.