SciELO - Scientific Electronic Library Online

 
 número39Mining Reviews for Product Comparison and RecommendationCLAU - A Service-Oriented System for Complex Language Alignment: Architectural Aspects índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Polibits

versão On-line ISSN 1870-9044

Polibits  no.39 México Jan./Jun. 2009

 

Articles

 

SMM: Detailed, Structured Morphological Analysis for Spanish

 

Cerstin Mahlow and Michael Piotrowski

 

Institute of Computational Linguistics, University of Zurich, Binzmühlestrasse 14, 8050 Zurich, Switzerland; e–mail: mahlow@cl.uzh.ch; mxp@cl.uzh.ch.

 

Manuscript received January 30, 2009.
Manuscript accepted for publication March 16, 2009.

 

Abstract

We present a morphological analyzer for Spanish called SMM. SMM is implemented in the grammar development framework Malaga, which is based on the formalism of Left–Associative Grammar. We briefly present the Malaga framework, describe the implementation decisions for some interesting morphological phenomena of Spanish, and report on the evaluation results from the analysis of corpora. SMM was originally only designed for analyzing word forms; in this article we outline two approaches for using SMM and the facilities provided by Malaga to also generate verbal paradigms. SMM can also be embedded into applications by making use of the Malaga programming interface; we briefly discuss some application scenarios.

Key words: Natural language processing, morphology, Malaga, Spanish.

 

DESCARGAR ARTÍCULO EN FORMATO PDF

 

REFERENCES

[1] R. Hausser, Foundations of Computational Linguistics: Human–Computer Communication in Natural Language, 2nd ed. Berlin/Heidelberg: Springer, 2001.         [ Links ]

[2] ––––––––––, "Three principled methods of automatic word form recognition." in VEXTAL: Proceedings of the Conference, 22—24 November 1999, Venice, Italy. Padova: Unipress, 1999, pp. 91–100.         [ Links ]

[3] Real Academia Española Comisión de Gramática, Esbozo de una nueva gramática de la lengua española, 2nd ed. Madrid: Espasa Calpe, 1974.         [ Links ]

[4] J. Alcina Franch and J. Manuel Blecua, Gramatica española, 9th ed. Barcelona: Ariel, 1994.         [ Links ]

[5] I. Bosque and V. Demonte, Eds., Gramatica descriptiva de la lengua española. Madrid: Real Academia Española/Espasa Calpe, 1999.         [ Links ]

[6] J. M. Goñi Menoyo and J. C. González Cristóbal, "A framework for lexical representation," in AI95: Fifteenth International Conference. Language Engineering, June 1995, pp. 243–252.         [ Links ]

[7] A. Moreno–Sandoval and J. M. Goñi Menoyo, "Spanish inflectional morphology in DATR," Journal of Logic, Language and Information, vol. 11, no. 1, pp. 79–105, 2002.         [ Links ]

[8] S. Rodríguez and J. Carretero, "A formal approach to Spanish morphology: the COES tools," in XII Congreso de la Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN'96), 1996, pp. 118–126.         [ Links ]

[9] J. M. Goñi Menoyo, J. C. González Cristóbal, and A. Moreno, "ARIES: A lexical platform for engineering Spanish processing tools," Nat. Lang. Eng., vol. 3, no. 4, pp. 317–345, 1997.         [ Links ]

[10] A. Zielinski and C. Simon, "Morphisto: An open–source morphological analyzer for German," in Seventh International Workshop on Finite–State Methods and Natural Language Processing, 2008, pp. 177–184.         [ Links ]

[11] F. Sánchez León, "A Spanish tagset for the CRATER project," Jun 1994. [Online]. Available: http://arxiv.org/abs/cmp–lg/9406023v1        [ Links ]

[12] S. Sharoff, "Creating general–purpose corpora using automated search engine queries," in Wacky! Working Papers on the Web as Corpus, M. Baroni and S. Bernardini, Eds. Bologna: GEDIT, 2006.         [ Links ]

[13] S. Rodríguez and J. Carretero, "Formalización de reglas morfológicas para un nuevo corrector ortográfico en español," Revista Española de lingüística, vol. 26, no. 2, pp. 379–387, November 1996.         [ Links ]

[14] A. Gelbukh and G. Sidorov, "Approach to construction of automatic morphological analysis systems for inflective languages with little effort," in Computational Linguistics and Intelligent Text Processing, 4th International Conference, CICLing 2003, Mexico City, Mexico, February 16– 22, 2003. Berlin/Heidelberg: Springer, 2003, pp. 157–162.         [ Links ]

[15] J. Atserias, B. Casas, E. Comelles, M. González, L. Padró, and M. Padró, "FreeLing 1.3: Syntactic and semantic services in an open–source NLP library," in Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC'06), 2006, pp. 48–55.         [ Links ]

[16] C. Mahlow and M. Piotrowski, "Linguistic support for revising and editing," in Computational Linguistics and Intelligent Text Processing: 9th International Conference, CICLing 2008, Haifa, Israel, February 17– 23, 2008, A. Gelbukh, Ed. Berlin/Heidelberg: Springer, 2008, pp. 631–642.         [ Links ]

Creative Commons License Todo o conteúdo deste periódico, exceto onde está identificado, está licenciado sob uma Licença Creative Commons