Issue |
EPL
Volume 98, Number 5, June 2012
|
|
---|---|---|
Article Number | 58001 | |
Number of page(s) | 6 | |
Section | Interdisciplinary Physics and Related Areas of Science and Technology | |
DOI | https://doi.org/10.1209/0295-5075/98/58001 | |
Published online | 28 May 2012 |
Word sense disambiguation via high order of learning in complex networks
1
Institute of Mathematics and Computer Science, University of São Paulo - P. O. Box 369, Postal Code 13560-970, São Carlos, São Paulo, Brazil
2
Institute of Physics of São Carlos, University of São Paulo - P. O. Box 369, Postal Code 13560-970, São Carlos, São Paulo, Brazil
a
thiagoch@icmc.usp.br
b
thiagochris@gmail.com
c
diego.amancio@usp.br
d
diegoraphael@gmail.com
Received:
11
April
2012
Accepted:
7
May
2012
Complex networks have been employed to model many real systems and as a modeling tool in a myriad of applications. In this paper, we use the framework of complex networks to the problem of supervised classification in the word disambiguation task, which consists in deriving a function from the supervised (or labeled) training data of ambiguous words. Traditional supervised data classification takes into account only topological or physical features of the input data. On the other hand, the human (animal) brain performs both low- and high-level orders of learning and it has facility to identify patterns according to the semantic meaning of the input data. In this paper, we apply a hybrid technique which encompasses both types of learning in the field of word sense disambiguation and show that the high-level order of learning can really improve the accuracy rate of the model. This evidence serves to demonstrate that the internal structures formed by the words do present patterns that, generally, cannot be correctly unveiled by only traditional techniques. Finally, we exhibit the behavior of the model for different weights of the low- and high-level classifiers by plotting decision boundaries. This study helps one to better understand the effectiveness of the model.
PACS: 89.75.Hc – Networks and genealogical trees / 89.20.Ff – Computer science and technology / 02.50.Sk – Multivariate analysis
© EPLA, 2012
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.