Servicios Personalizados
Revista
Articulo
Indicadores
- Citado por SciELO
- Accesos
Links relacionados
- Similares en SciELO
Compartir
Polibits
versión On-line ISSN 1870-9044
Polibits no.40 México jul./dic. 2009
Special section: Information Retrieval and Natural Language Processing
Using Sense Clustering for the Disambiguation of Words
Henry AnayaSánchez1, Aurora PonsPorrata1, and Rafael BerlangaLlavori2
1 Center for Pattern Recognition and Data Mining, Universidad de Oriente, Santiago de Cuba, Cuba. (henry@cepramid.co.cu, aurora@cepramid.co.cu).
2 Department of Langauges and Computer Systems, Universitat Jaume I, Castello, Spain. (berlanga@lsi.uji.es).
Manuscript received November 4, 2008.
Manuscript accepted for publication August 28, 2009.
Abstract
Clustering methods have been extensively used in the solution of many Information Processing tasks in order to capture unknown object categories. This paper presents an approach to Word Sense Disambiguation based on clustering. The underlying idea is that the clustering of word senses provides a useful way to discover semantically related senses. We evaluate our proposal regarding both fine and coarsegrained disambiguation. Experimental results over Senseval3 allwords, SemCor 2.0 and SemEval2007 corpora are presented. Promising values of precision and recall are obtained.
Key words: Word sense disambiguation, clustering.
DESCARGAR ARTÍCULO EN FORMATO PDF
REFERENCES
[1] E. Agirre and O. López, "Clustering wordnet word senses," in Proceedings of the Conference on Recent Advances on Natural Language Processing, Bulgary, 2003, pp. 121130. [ Links ]
[2] E. Agirre and G. Rigau, "Word Sense Disambiguation Using Conceptual Density," in Proceedings of the 16th Conference on Computational Linguistic, Vol. 1, Denmark, 1996, pp. 1622. [ Links ]
[3] S. Bordag, "Word Sense Induction: TripletBased Clustering and Automatic Evaluation," in 11st Conference of the European Chapter ofthe Association for Computational Linguistic, Italy, 2006. [ Links ]
[4] D. Buscaldi and P. Rosso, "UPVWSD: Combining different WSD Methods by means of Fuzzy Borda Voting," in Proceedings of the 4th International Workshop on Semantic Evaluations, Association for Computational Linguistic, Prage, 2007, pp. 434437. [ Links ]
[5] Y. Chali and S. R. Joty, "UofL: Word Sense Disambiguation Using Lexical Cohesion," in Proceedings ofthe 4th International Workshop on Semantic Evaluations, Association for Computational Linguistic, Prage, 2007, pp. 476479. [ Links ]
[6] D. FernándezAmorós, J. Gonzalo and F. Verdejo, "The Role of Conceptual Relations in Word Sense Disambiguation," in Proceedings ofthe 6th International Workshop on Applications of Natural Language for Information Systems, Spain, 2001, pp. 8798. [ Links ]
[7] R. GilGarcía, J. M. BadiaContelles and A. PonsPorrata, "Extended Star Clustering Algorithm," Progress in Pattern Recognition, Speech and Image Analysis, Lecture Notes on Computer Sciences, Vol. 2905, SpringerVerlag, 2003, pp. 480487. [ Links ]
[8] N. Ide and J. Veronis, "Word Sense Disambiguation: The State of the Art," Computational Linguistics 24:1, 1998, pp. 140. [ Links ]
[9] R. Ion and D. Tufis, "RACAI: Meaning Affinity Models," in Proceedings of the 4th International Workshop on Semantic Evaluations, Association for Computational Linguistic, Prage, 2007, pp. 277281. [ Links ]
[10] R. Koeling and D. McCarthy, "Sussx: WSD using Automatically Acquired Predominant Senses," in Proceedings of the 4th International Workshop on Semantic Evaluations, Association for Computational Linguistic, Prage, 2007, pp. 314317. [ Links ]
[11] M. Lesk, "Automatic Sense Disambiguation Using Machine Readable Dictionaries: How to Tell a Pine Cone from an Ice Cream Cone," in Proceedings of the 5th Annual International Conference on Systems Documentation, Canada, 1986, pp. 2426. [ Links ]
[12] C.Y. Lin and E. Hovy, "The Automated Acquisition of Topic Signatures for Text Summarization," in Proceedings of the COLING Conference, France, 2000, pp. 495501. [ Links ]
[13] R. Mihalcea and D.I. Moldovan, "EZ. WordNet: Principles for Automatic Generation of a Coarse Grained WordNet," in Proceedings of the FLAIRS Conference, Florida, 2001, pp. 454458. [ Links ]
[14] G. Miller, "WordNet: A Lexical Database for English," Communications of the ACM 38:11, 1995, pp. 3941. [ Links ]
[15] A. Montoyo, A. Suárez, G. Rigau and M. Palomar, "Combining Knowledge and Corpusbased WordSenseDisambiguation Methods," Journal of Artificial Intelligence Research 23, 2005, pp. 299330. [ Links ]
[16] R. Navigli, K.C. Litkowski and O. Hargraves, "SemEval2007 Task 07: CoarseGrained English AllWords Task," in Proceedings of the 4th International Workshop on Semantic Evaluations, Association for Computational Linguistic, Prage, 2007, pp. 3035. [ Links ]
[17] C. Niu, W. Li, R. K. Srihari, H. Li and L. Crist, "Context Clustering for Word Sense Disambiguation Based on Modeling Pairwise Context Similarities," in SENSEVAL3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Spain, 2004, pp. 187190. [ Links ]
[18] T. Pedersen, A. Purandare and A. Kulkarni, "Name Discrimination by Clustering Similar Contexts," in Proceedings of the 6th International Conference on Computational Linguistics and Intelligent Text Processing, Mexico, 2005, pp. 226237. [ Links ]
[19] G. Salton, A. Wong and C. S. Yang, "A Vector Space Model for Information Retrieval," Journal ofthe American Society for Information Science 18:11, 1975, pp. 613620. [ Links ]
[20] B. Snyder and M. Palmer, "The English allwords task," in Proceedings of the third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Spain, 2004, pp. 4143. [ Links ]
[21] G. Udani, S. Dave, A. Davis and T. Sibley, "Noun Sense Induction Using Web Search Results," in Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Brazil, 2005, pp. 657658. [ Links ]