Open Access
Lecture Notes in Computer Science, pages 98-110
Recognition of Hand-Drawn Hydrocarbon Structure Formulas Using Anchor-Free Detector
Jia-Jun Tao
1
,
Wei Liu
1
,
Xiaowang Peng
1
,
Xianyu He
1
,
Yanghong Luo
1
Publication type: Book Chapter
Publication date: 2024-11-12
Journal:
Lecture Notes in Computer Science
scimago Q2
SJR: 0.606
CiteScore: 2.6
Impact factor: —
ISSN: 03029743, 16113349, 18612075, 18612083
Abstract
The recognition of hand-drawn chemical molecular formulas is crucial for applications such as electronic note-taking and automated grading. Despite the challenges posed by stylistic variations in hand-drawn chemical structure diagrams, we introduce a novel recognition algorithm for hand-drawn hydrocarbon molecular formulas using anchor-free object detection methods. First, we employ an anchor-free detector based on irregular quadrilaterals to identify all potential chemical bonds in input images. By analyzing the collision relationships between these bonds, we then reconstruct all unspecified carbon atoms and assemble them into an adjacency matrix. Finally, we use the RDKit to convert the adjacency matrix into a SMILES string. Notably, our method does not rely on the SMILES string used during training, thereby enabling it to recognize previously unseen hydrocarbons. To verify the effectiveness of the algorithm, we collected a dataset containing 4,217 hand-drawn hydrocarbon molecular structures. Using RepVGG-A0 at a
$$512\,\times \,512$$
resolution, our algorithm achieved a recognition accuracy of 85.86%.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.