The effect of feature normalization methods in radiomics
Objectives
In radiomics, different feature normalization methods, such as z-Score or Min–Max, are currently utilized, but their specific impact on the model is unclear. We aimed to measure their effect on the predictive performance and the feature selection.
Methods
We employed fifteen publicly available radiomics datasets to compare seven normalization methods. Using four feature selection and classifier methods, we used cross-validation to measure the area under the curve (AUC) of the resulting models, the agreement of selected features, and the model calibration. In addition, we assessed whether normalization before cross-validation introduces bias.
Results
On average, the difference between the normalization methods was relatively small, with a gain of at most + 0.012 in AUC when comparing the z-Score (mean AUC: 0.707 ± 0.102) to no normalization (mean AUC: 0.719 ± 0.107). However, on some datasets, the difference reached + 0.051. The z-Score performed best, while the tanh transformation showed the worst performance and even decreased the overall predictive performance. While quantile transformation performed, on average, slightly worse than the z-Score, it outperformed all other methods on one out of three datasets. The agreement between the features selected by different normalization methods was only mild, reaching at most 62%. Applying the normalization before cross-validation did not introduce significant bias.
Conclusion
The choice of the feature normalization method influenced the predictive performance but depended strongly on the dataset. It strongly impacted the set of selected features.
Critical relevance statement
Feature normalization plays a crucial role in the preprocessing and influences the predictive performance and the selected features, complicating feature interpretation.
Key points
• The impact of feature normalization methods on radiomic models was measured.
• Normalization methods performed similarly on average, but differed more strongly on some datasets.
• Different methods led to different sets of selected features, impeding feature interpretation.
• Model calibration was not largely affected by the normalization method.
Graphical Abstract
Top-30
Journals
|
1
2
3
|
|
|
European Radiology
3 publications, 5.88%
|
|
|
Scientific Reports
2 publications, 3.92%
|
|
|
Respiratory Research
1 publication, 1.96%
|
|
|
Bioengineering
1 publication, 1.96%
|
|
|
Oral Oncology Reports
1 publication, 1.96%
|
|
|
Computers in Biology and Medicine
1 publication, 1.96%
|
|
|
Electronics (Switzerland)
1 publication, 1.96%
|
|
|
Journal of Neuromuscular Diseases
1 publication, 1.96%
|
|
|
European Journal of Radiology
1 publication, 1.96%
|
|
|
Ecology and Evolution
1 publication, 1.96%
|
|
|
International Journal of Imaging Systems and Technology
1 publication, 1.96%
|
|
|
BMC Cancer
1 publication, 1.96%
|
|
|
Frontiers in Neurology
1 publication, 1.96%
|
|
|
Journal of Orthopaedic Surgery and Research
1 publication, 1.96%
|
|
|
Computer Methods and Programs in Biomedicine
1 publication, 1.96%
|
|
|
NeuroImage
1 publication, 1.96%
|
|
|
Astrophysical Journal, Supplement Series
1 publication, 1.96%
|
|
|
Environmental Research
1 publication, 1.96%
|
|
|
Cogent Social Sciences
1 publication, 1.96%
|
|
|
Biomedical Signal Processing and Control
1 publication, 1.96%
|
|
|
Strahlentherapie und Onkologie
1 publication, 1.96%
|
|
|
BMC Medical Imaging
1 publication, 1.96%
|
|
|
Neurocomputing
1 publication, 1.96%
|
|
|
Reliability Engineering and System Safety
1 publication, 1.96%
|
|
|
TrAC - Trends in Analytical Chemistry
1 publication, 1.96%
|
|
|
Zeitschrift fur Medizinische Physik
1 publication, 1.96%
|
|
|
European radiology experimental
1 publication, 1.96%
|
|
|
Clinical Radiology
1 publication, 1.96%
|
|
|
Kocatepe Tıp Dergisi
1 publication, 1.96%
|
|
|
1
2
3
|
Publishers
|
2
4
6
8
10
12
14
16
18
|
|
|
Elsevier
17 publications, 33.33%
|
|
|
Springer Nature
14 publications, 27.45%
|
|
|
MDPI
6 publications, 11.76%
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
5 publications, 9.8%
|
|
|
Wiley
2 publications, 3.92%
|
|
|
SAGE
1 publication, 1.96%
|
|
|
Frontiers Media S.A.
1 publication, 1.96%
|
|
|
Cold Spring Harbor Laboratory
1 publication, 1.96%
|
|
|
American Astronomical Society
1 publication, 1.96%
|
|
|
Taylor & Francis
1 publication, 1.96%
|
|
|
Afyon Kocatepe University
1 publication, 1.96%
|
|
|
AME Publishing Company
1 publication, 1.96%
|
|
|
2
4
6
8
10
12
14
16
18
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.