Validating neural networks for spectroscopic classification on a universal synthetic dataset
To aid the development of machine learning models for automated spectroscopic data classification, we created a universal synthetic dataset for the validation of their performance. The dataset mimics the characteristic appearance of experimental measurements from techniques such as X-ray diffraction, nuclear magnetic resonance, and Raman spectroscopy among others. We applied eight neural network architectures to classify artificial spectra, evaluating their ability to handle common experimental artifacts. While all models achieved over 98% accuracy on the synthetic dataset, misclassifications occurred when spectra had overlapping peaks or intensities. We found that non-linear activation functions, specifically ReLU in the fully-connected layers, were crucial for distinguishing between these classes, while adding more sophisticated components, such as residual blocks or normalization layers, provided no performance benefit. Based on these findings, we summarize key design principles for neural networks in spectroscopic data classification and publicly share all scripts used in this study.
Top-30
Journals
|
1
2
|
|
|
npj Computational Materials
2 publications, 8.33%
|
|
|
Scientific Reports
2 publications, 8.33%
|
|
|
Advanced Intelligent Systems
1 publication, 4.17%
|
|
|
Machine Learning: Science and Technology
1 publication, 4.17%
|
|
|
Journal of the American Chemical Society
1 publication, 4.17%
|
|
|
Food Bioscience
1 publication, 4.17%
|
|
|
Journal of the Franklin Institute
1 publication, 4.17%
|
|
|
Physical Review A
1 publication, 4.17%
|
|
|
Lecture Notes in Computer Science
1 publication, 4.17%
|
|
|
Environmental Earth Sciences
1 publication, 4.17%
|
|
|
Mendeleev Communications
1 publication, 4.17%
|
|
|
Journal of Food Measurement and Characterization
1 publication, 4.17%
|
|
|
Advanced Functional Materials
1 publication, 4.17%
|
|
|
Chemical Society Reviews
1 publication, 4.17%
|
|
|
Journal of Chemical Information and Modeling
1 publication, 4.17%
|
|
|
Agriculture (Switzerland)
1 publication, 4.17%
|
|
|
Digital Discovery
1 publication, 4.17%
|
|
|
ACS Measurement Science Au
1 publication, 4.17%
|
|
|
Sensors & Diagnostics
1 publication, 4.17%
|
|
|
Measurement: Journal of the International Measurement Confederation
1 publication, 4.17%
|
|
|
Journal of Cleaner Production
1 publication, 4.17%
|
|
|
Chemie-Ingenieur-Technik
1 publication, 4.17%
|
|
|
1
2
|
Publishers
|
1
2
3
4
5
6
7
|
|
|
Springer Nature
7 publications, 29.17%
|
|
|
Elsevier
4 publications, 16.67%
|
|
|
Wiley
3 publications, 12.5%
|
|
|
American Chemical Society (ACS)
3 publications, 12.5%
|
|
|
Royal Society of Chemistry (RSC)
3 publications, 12.5%
|
|
|
IOP Publishing
1 publication, 4.17%
|
|
|
American Physical Society (APS)
1 publication, 4.17%
|
|
|
OOO Zhurnal "Mendeleevskie Soobshcheniya"
1 publication, 4.17%
|
|
|
MDPI
1 publication, 4.17%
|
|
|
1
2
3
4
5
6
7
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.