SciELO - Scientific Electronic Library Online

 
 número40English-to-Japanese Cross-Language Question-Answering System using Weighted Adding with Multiple AnswersImproving Named Entity Extraction Accuracy using Unlabeled Data and Several Extractors índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay artículos similaresSimilares en SciELO

Compartir


Polibits

versión On-line ISSN 1870-9044

Polibits  no.40 México jul./dic. 2009

 

Special section: Information Retrieval and Natural Language Processing

 

Using Sense Clustering for the Disambiguation of Words

 

Henry Anaya–Sánchez1, Aurora Pons–Porrata1, and Rafael Berlanga–Llavori2

 

1 Center for Pattern Recognition and Data Mining, Universidad de Oriente, Santiago de Cuba, Cuba. (henry@cepramid.co.cu, aurora@cepramid.co.cu).

2 Department of Langauges and Computer Systems, Universitat Jaume I, Castello, Spain. (berlanga@lsi.uji.es).

 

Manuscript received November 4, 2008.
Manuscript accepted for publication August 28, 2009.

 

Abstract

Clustering methods have been extensively used in the solution of many Information Processing tasks in order to capture unknown object categories. This paper presents an approach to Word Sense Disambiguation based on clustering. The underlying idea is that the clustering of word senses provides a useful way to discover semantically related senses. We evaluate our proposal regarding both fine– and coarse–grained disambiguation. Experimental results over Senseval–3 all–words, SemCor 2.0 and SemEval–2007 corpora are presented. Promising values of precision and recall are obtained.

Key words: Word sense disambiguation, clustering.

 

DESCARGAR ARTÍCULO EN FORMATO PDF

 

REFERENCES

[1] E. Agirre and O. López, "Clustering wordnet word senses," in Proceedings of the Conference on Recent Advances on Natural Language Processing, Bulgary, 2003, pp. 121–130.         [ Links ]

[2] E. Agirre and G. Rigau, "Word Sense Disambiguation Using Conceptual Density," in Proceedings of the 16th Conference on Computational Linguistic, Vol. 1, Denmark, 1996, pp. 16–22.         [ Links ]

[3] S. Bordag, "Word Sense Induction: Triplet–Based Clustering and Automatic Evaluation," in 11st Conference of the European Chapter ofthe Association for Computational Linguistic, Italy, 2006.         [ Links ]

[4] D. Buscaldi and P. Rosso, "UPV–WSD: Combining different WSD Methods by means of Fuzzy Borda Voting," in Proceedings of the 4th International Workshop on Semantic Evaluations, Association for Computational Linguistic, Prage, 2007, pp. 434–437.         [ Links ]

[5] Y. Chali and S. R. Joty, "UofL: Word Sense Disambiguation Using Lexical Cohesion," in Proceedings ofthe 4th International Workshop on Semantic Evaluations, Association for Computational Linguistic, Prage, 2007, pp. 476–479.         [ Links ]

[6] D. Fernández–Amorós, J. Gonzalo and F. Verdejo, "The Role of Conceptual Relations in Word Sense Disambiguation," in Proceedings ofthe 6th International Workshop on Applications of Natural Language for Information Systems, Spain, 2001, pp. 87–98.         [ Links ]

[7] R. Gil–García, J. M. Badia–Contelles and A. Pons–Porrata, "Extended Star Clustering Algorithm," Progress in Pattern Recognition, Speech and Image Analysis, Lecture Notes on Computer Sciences, Vol. 2905, Springer–Verlag, 2003, pp. 480–487.         [ Links ]

[8] N. Ide and J. Veronis, "Word Sense Disambiguation: The State of the Art," Computational Linguistics 24:1, 1998, pp. 1–40.         [ Links ]

[9] R. Ion and D. Tufis, "RACAI: Meaning Affinity Models," in Proceedings of the 4th International Workshop on Semantic Evaluations, Association for Computational Linguistic, Prage, 2007, pp. 277–281.         [ Links ]

[10] R. Koeling and D. McCarthy, "Sussx: WSD using Automatically Acquired Predominant Senses," in Proceedings of the 4th International Workshop on Semantic Evaluations, Association for Computational Linguistic, Prage, 2007, pp. 314–317.         [ Links ]

[11] M. Lesk, "Automatic Sense Disambiguation Using Machine Readable Dictionaries: How to Tell a Pine Cone from an Ice Cream Cone," in Proceedings of the 5th Annual International Conference on Systems Documentation, Canada, 1986, pp. 24–26.         [ Links ]

[12] C.–Y. Lin and E. Hovy, "The Automated Acquisition of Topic Signatures for Text Summarization," in Proceedings of the COLING Conference, France, 2000, pp. 495–501.         [ Links ]

[13] R. Mihalcea and D.I. Moldovan, "EZ. WordNet: Principles for Automatic Generation of a Coarse Grained WordNet," in Proceedings of the FLAIRS Conference, Florida, 2001, pp. 454–458.         [ Links ]

[14] G. Miller, "WordNet: A Lexical Database for English," Communications of the ACM 38:11, 1995, pp. 39–41.         [ Links ]

[15] A. Montoyo, A. Suárez, G. Rigau and M. Palomar, "Combining Knowledge– and Corpus–based Word–Sense–Disambiguation Methods," Journal of Artificial Intelligence Research 23, 2005, pp. 299–330.         [ Links ]

[16] R. Navigli, K.C. Litkowski and O. Hargraves, "SemEval–2007 Task 07: Coarse–Grained English All–Words Task," in Proceedings of the 4th International Workshop on Semantic Evaluations, Association for Computational Linguistic, Prage, 2007, pp. 30–35.         [ Links ]

[17] C. Niu, W. Li, R. K. Srihari, H. Li and L. Crist, "Context Clustering for Word Sense Disambiguation Based on Modeling Pairwise Context Similarities," in SENSEVAL–3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Spain, 2004, pp. 187–190.         [ Links ]

[18] T. Pedersen, A. Purandare and A. Kulkarni, "Name Discrimination by Clustering Similar Contexts," in Proceedings of the 6th International Conference on Computational Linguistics and Intelligent Text Processing, Mexico, 2005, pp. 226–237.         [ Links ]

[19] G. Salton, A. Wong and C. S. Yang, "A Vector Space Model for Information Retrieval," Journal ofthe American Society for Information Science 18:11, 1975, pp. 613–620.         [ Links ]

[20] B. Snyder and M. Palmer, "The English all–words task," in Proceedings of the third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Spain, 2004, pp. 41–43.         [ Links ]

[21] G. Udani, S. Dave, A. Davis and T. Sibley, "Noun Sense Induction Using Web Search Results," in Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Brazil, 2005, pp. 657–658.         [ Links ]

Creative Commons License Todo el contenido de esta revista, excepto dónde está identificado, está bajo una Licencia Creative Commons