том 26 издание 4 страницы 253-267

AI-driven extraction and intelligent retrieval of missionary archives in Malabar: advancing preservation and accessibility with machine learning

Тип публикацииJournal Article
Дата публикации2025-09-08
SCImago Q1
Tоп 10% SCImago
WOS Q2
БС2
SJR0.326
CiteScore1.9
Impact factor1.9
ISSN14678047, 17586925
Краткое описание
Purpose

This study shows how AI improves the transcription, indexing and searchability of historical documents by utilizing AI-driven Optical Character Recognition (OCR), Handwritten Text Recognition (HTR), Named Entity Recognition (NER), machine learning-based classification and transformer-based retrieval models.

Design/methodology/approach

This study uses a computational archival science approach to analyze missionary records in Malabar by combining machine learning-based text recognition, natural language processing (NLP), document classification and AI-powered retrieval models.

Findings

The findings show that AI and ML significantly improve the speed, performance and efficiency of archival digitization. OCR achieves up to 97.5% performance for modern printed texts, while HTR models exceed 92.5% for structured handwriting, demonstrating the efficacy of deep learning in text recognition. NER models successfully extract missionary names (91.3% F1-score) and locations (90.0% F1-score), whereas classification models such as Random Forest achieve the performance of 89.3% when categorizing archival documents, and bidirectional encoder representations from transformers (BERT)-based search engines scoring 93.5% Precision@10 and 91.2% Recall@10, demonstrating their superior ability to retrieve relevant archival records. Precision@10 means that out of the top ten retrieved results, 93.5% are relevant, while Recall@10 indicates that 91.2% of all relevant results were found within the top ten retrieved results.

Originality/value

This study presents a novel integration of AI and machine learning for the systematic extraction, classification and retrieval of historical missionary records, bridging the gap between historical preservation and computational intelligence.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.
 Войти с ORCID
Метрики
0
Поделиться
Цитировать
ГОСТ |
Цитировать
Kaluvilla B. B., Kalarikkal S. A., Thamilvanan G. AI-driven extraction and intelligent retrieval of missionary archives in Malabar: advancing preservation and accessibility with machine learning // Performance Measurement and Metrics. 2025. Vol. 26. No. 4. pp. 253-267.
ГОСТ со всеми авторами (до 50) Скопировать
Kaluvilla B. B., Kalarikkal S. A., Thamilvanan G. AI-driven extraction and intelligent retrieval of missionary archives in Malabar: advancing preservation and accessibility with machine learning // Performance Measurement and Metrics. 2025. Vol. 26. No. 4. pp. 253-267.
RIS |
Цитировать
TY - JOUR
DO - 10.1108/pmm-02-2025-0008
UR - https://www.emerald.com/pmm/article/doi/10.1108/PMM-02-2025-0008/1278374/AI-driven-extraction-and-intelligent-retrieval-of
TI - AI-driven extraction and intelligent retrieval of missionary archives in Malabar: advancing preservation and accessibility with machine learning
T2 - Performance Measurement and Metrics
AU - Kaluvilla, Bincy Baburaj
AU - Kalarikkal, Subhash Abel
AU - Thamilvanan, G.
PY - 2025
DA - 2025/09/08
PB - Emerald
SP - 253-267
IS - 4
VL - 26
SN - 1467-8047
SN - 1758-6925
ER -
BibTex |
Цитировать
BibTex (до 50 авторов) Скопировать
@article{2025_Kaluvilla,
author = {Bincy Baburaj Kaluvilla and Subhash Abel Kalarikkal and G. Thamilvanan},
title = {AI-driven extraction and intelligent retrieval of missionary archives in Malabar: advancing preservation and accessibility with machine learning},
journal = {Performance Measurement and Metrics},
year = {2025},
volume = {26},
publisher = {Emerald},
month = {sep},
url = {https://www.emerald.com/pmm/article/doi/10.1108/PMM-02-2025-0008/1278374/AI-driven-extraction-and-intelligent-retrieval-of},
number = {4},
pages = {253--267},
doi = {10.1108/pmm-02-2025-0008}
}
MLA
Цитировать
Kaluvilla, Bincy Baburaj, et al. “AI-driven extraction and intelligent retrieval of missionary archives in Malabar: advancing preservation and accessibility with machine learning.” Performance Measurement and Metrics, vol. 26, no. 4, Sep. 2025, pp. 253-267. https://www.emerald.com/pmm/article/doi/10.1108/PMM-02-2025-0008/1278374/AI-driven-extraction-and-intelligent-retrieval-of.
Ошибка в публикации?