Domain classification of technical terms using the Web
Publication type: Journal Article
Publication date: 2007-09-24
Hardware and Architecture
Computational Theory and Mathematics
Information Systems
Theoretical Computer Science
Abstract
This paper proposes a method of domain classification of technical terms using the Web. In the proposed method, it is assumed that, for a certain technical domain, a list of known technical terms of the domain is given. Technical documents of the domain are collected through the Web search engine, which are then used for generating a vector space model for the domain. The domain specificity of a target term is estimated according to the distribution of the domain of the sample pages of the target term. Experimental evaluation results show that the proposed method of domain classification of a technical term achieved mostly 90% precision/recall. We then apply this technique of estimating domain specificity of a term to the task of discovering novel technical terms that are not included in any existing lexicons of technical terms of the domain. Out of 1000 randomly selected candidates of technical terms per domain, we discovered about 100 to 200 novel technical terms. © 2007 Wiley Periodicals, Inc. Syst Comp Jpn, 38(14): 11–19, 2007; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/scj.20852
Found
Nothing found, try to update filter.
Found
Nothing found, try to update filter.
Top-30
Journals
|
1
2
|
|
|
Terminology
2 publications, 28.57%
|
|
|
Online Information Review
1 publication, 14.29%
|
|
|
Language Resources and Evaluation
1 publication, 14.29%
|
|
|
IEICE Transactions on Information and Systems
1 publication, 14.29%
|
|
|
1
2
|
Publishers
|
1
2
|
|
|
John Benjamins Publishing Company
2 publications, 28.57%
|
|
|
Emerald
1 publication, 14.29%
|
|
|
Springer Nature
1 publication, 14.29%
|
|
|
1 publication, 14.29%
|
|
|
1
2
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
7
Total citations:
7
Citations from 2024:
0
Cite this
GOST |
RIS |
BibTex |
MLA
Cite this
GOST
Copy
Kida M. et al. Domain classification of technical terms using the Web // Systems and Computers in Japan. 2007. Vol. 38. No. 14. pp. 11-19.
GOST all authors (up to 50)
Copy
Kida M., Tonoike M., Utsuro T., Sato S. Domain classification of technical terms using the Web // Systems and Computers in Japan. 2007. Vol. 38. No. 14. pp. 11-19.
Cite this
RIS
Copy
TY - JOUR
DO - 10.1002/scj.20852
UR - https://doi.org/10.1002/scj.20852
TI - Domain classification of technical terms using the Web
T2 - Systems and Computers in Japan
AU - Kida, Mitsuhiro
AU - Tonoike, Masatsugu
AU - Utsuro, Takehito
AU - Sato, Satoshi
PY - 2007
DA - 2007/09/24
PB - Wiley
SP - 11-19
IS - 14
VL - 38
SN - 0882-1666
SN - 1520-684X
ER -
Cite this
BibTex (up to 50 authors)
Copy
@article{2007_Kida,
author = {Mitsuhiro Kida and Masatsugu Tonoike and Takehito Utsuro and Satoshi Sato},
title = {Domain classification of technical terms using the Web},
journal = {Systems and Computers in Japan},
year = {2007},
volume = {38},
publisher = {Wiley},
month = {sep},
url = {https://doi.org/10.1002/scj.20852},
number = {14},
pages = {11--19},
doi = {10.1002/scj.20852}
}
Cite this
MLA
Copy
Kida, Mitsuhiro, et al. “Domain classification of technical terms using the Web.” Systems and Computers in Japan, vol. 38, no. 14, Sep. 2007, pp. 11-19. https://doi.org/10.1002/scj.20852.