SciELO - Scientific Electronic Library Online

 
vol.12 issue1On the Security of Mexican Digital Fiscal DocumentsAn Overview of Argumentation Semantics author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Computación y Sistemas

On-line version ISSN 2007-9737Print version ISSN 1405-5546

Comp. y Sist. vol.12 n.1 Ciudad de México Jul./Sep. 2008

 

Mexican Experience in Spanish Question Answering

 

Experiencia Mexicana en la Búsqueda de Respuestas en Español

 

Manuel Montes y Gómez, Luis Villaseñor Pineda and Aurelio López López

 

Laboratorio de Tecnologías del Lenguaje, Coordinación de Ciencias Computacionales, Instituto Nacional de Astrofísica, Óptica y Electrónica (INAOE). Luis Enrique Erro #1, Tonatzintla, Puebla, México. e–mails: mmontesg@inaoep.mx, villasen@inaoep.mx, allopez@inaoep.mx

 

Article received on April 14, 2008
Accepted on June 20, 2008

 

Abstract.

Nowadays, due to the great advances in communication and storage media, there is more information available than ever before. This information can satisfy almost every information need; nevertheless, without the appropriate manage facilities, all of it is practically useless. This fact has motivated the emergence of several text processing applications that help in accessing large document collections. Currently, there are three main approaches for this purpose: information retrieval, information extraction, and question answering. Question answering (QA) systems aim to identify the exact answer to a question from a given document collection. This paper presents a survey of the Mexican experience in Spanish QA. In particular, it presents an overview of the participations of the Language Technologies Laboratory of INAOE (LabTL) in the Spanish QA evaluation task at CLEF, from 2004 to 2007. Through these participations, the LabTL has mainly explored two different approaches for QA: a language independent approach based on statistical methods, and a language dependent approach supported by sophisticated linguistic analyses of texts. It is important to point out that, due to these works, the LabTL has become one of the leading research groups in Spanish QA.

Keywords: Question Answering, Passage Retrieval, Answer Extraction, Machine Learning.

 

Resumen.

En la actualidad, debido a los grandes avances en los medios de comunicación y de almacenamiento, hay más información disponible como nunca antes se ha visto. Esta información puede satisfacer casi todas las necesidades de información, sin embargo, sin una adecuada gestión ésta es prácticamente inútil. Este hecho ha motivado la aparición de diferentes aplicaciones para el procesamiento de texto orientadas a facilitar el acceso a grandes colecciones de documentos. Hoy en día, existen tres enfoques principales para este propósito: la recuperación de información, la extracción de información, y los sistemas de búsqueda de respuestas. Los sistemas de búsqueda de respuestas (QA por sus siglas en inglés) tienen por objeto identificar la respuesta exacta a una pregunta dentro de una determinada colección de documentos. Este trabajo presenta un panorama general de la experiencia mexicana en QA en español. En particular, se presentan las participaciones del Laboratorio de Tecnologías del Lenguaje del INAOE (LabTL) en la tarea de QA en español dentro del foro de evaluación CLEF, desde 2004 a 2007. A través de estas participaciones, el LabTL ha explorado principalmente dos enfoques diferentes en QA: un enfoque independiente del lenguaje basado en métodos estadísticos, y un enfoque dependiente del lenguaje apoyado en un complejo análisis lingüístico del texto. Es importante señalar que, debido a estos trabajos, el LabTL se ha convertido en uno de los principales grupos de investigación de QA en español.

Palabras Claves: Búsqueda de Respuestas, Recuperación de Pasajes, Extracción de Respuestas, Aprendizaje Automático.

 

DESCARGAR ARTÍCULO EN FORMATO PDF

 

Acknowledgements

The authors want to recognize the direct contribution of doctoral students Alberto Téllez–Valero, Manuel Pérez–Coutiño, Rita M. Aceves–Pérez, and master students Antonio Juárez–González, Claudia Denicia–Carral, as well as the indirect efforts of doctoral students Thamar Solorio, René García–Hernández, and master students Gustavo Hernández–Rubio, Esaú Villatoro–Tello, Aarón Pancardo–Rodríguez, and Ma. del Rosario Peralta–Calvo. This research was partially supported by Conacyt through research grants C01–39957, 43990A–1, and SNI.

 

References

1. Ageno, A., Ferrés, D., González, E., Kanaan, S., Rodríguez H., Surdeanu, M., and Turmo J. TALP–QA System for Spanish at CLEF–2004. In CLEF 2004 Working Notes. Bath, UK. September 2004.        [ Links ]

2. Agrawal, R., and Srikant, R. Fast Algorithms for Mining Association Rules. Proceedings of the 20th. VLDB Conference. Santiago de Chile, Chile. 1994.        [ Links ]

3. Amaral, C., Cassan, A., Figueira, H., Martins, A., Mendes, A., Mendes, P., Pinto, C., and Vidal, D. Priberam's Question Answering System in QA@CLEF 2007. In CLEF 2007 Working Notes. Budapest, Hungary. September 2007.        [ Links ]

4. Aunimo, L., and Kuuskoski, R. Question Answering using Semantic Annotation. In CLEF 2005 Working Notes. Vienna, Austria. September 2005.        [ Links ]

5. Bertagna, F., Chiran, L., and Simi, M. QA at ILC–UniPi: Description of the Prototype. In CLEF 2004 Working Notes. Bath, UK. September 2004.        [ Links ]

6. Brill, E., Lin, J., Banko, M., Dumais, S., and Ng, A., Data–intensive Question Answering. In TREC 2001 Proceedings. Maryland, USA. November 2001.        [ Links ]

7. Buscaldi, D., Gómez, J. M., Rosso, P., and Sanchis, E. The UPV at QA@CLEF 2006. In CLEF 2006 Working Notes. Alicante, Spain. September 2006.        [ Links ]

8. Buscaldi, D., Benajiba, Y., Rosso P. and Sanchis. E. The UPV at QA@CLEF 2007. In CLEF 2007 Working Notes. Budapest, Hungary. September, 2007.        [ Links ]

9. Cassan, A., Figueira, H., Martins, A., Mendes, A., Mendes, P., Pinto, C., and Vidal, D. Priberam's Question Answering System in a Cross–Language Environment. In CLEF 2006 Working Notes. Alicante, Spain. September 2006.        [ Links ]

10. Del–Castillo, A., Montes–y–Gómez, M., and Villaseñor–Pineda, L. QA on the web: A preliminary study for Spanish language. In Proceedings of the 5th Mexican International Conference on Computer Science (ENC04), Colima, Mexico. September 2004.        [ Links ]

11. Denicia–Carral, C., Montes–y–Gómez, M., Villaseñor–Pineda, L., and García–Hernández, R. A Text Mining Approach for Definition Question Answering. In Proceedings for the Fifth International Conference on Natural Language Processing, FinTAL 2006. Turku, Finland. August 2006.        [ Links ]

12. de–Pablo–Sánchez, C., Martínez–Fernández, J.L. , Martínez, P., Villena, J., García–Serrano, A.M., Goñi, J.M. and González, J.C. miraQA: Initial Experiments in Question Answering. In CLEF 2004 Working Notes. Bath, UK. September 2004.        [ Links ]

13. de–Pablo–Sánchez, C., González–Ledesma, A., Martinez–Fernández, J.L., Guirao, J.M., Martínez, P., and Moreno, A., MIRACLE's 2005 Approach to Cross–Lingual Question Answering, In CLEF 2005 Working Notes. Vienna, Austria. September 2005.        [ Links ]

14. de–Pablo–Sánchez, C., González–Ledesma, A., Moreno, A., Martínez–Fernández, J. L., and Martínez, P., MIRACLE at the Spanish CLEF@QA 2006 Track. In CLEF 2006 Working Notes. Alicante, Spain. September 2006.        [ Links ]

15. de–Pablo–Sánchez, C. Martínez, J.L., García–Ledesma, A., Samy, D., Martínez, P., Moreno–Sandoval A., and Al–Jumaily, H. MIRACLE Question Answering System for Spanish at CLEF 2007. In CLEF 2007 Working Notes. Budapest Hungary. September, 2007.        [ Links ]

16. Ferrández, S., López–Moreno, P., Roger, S.,. Ferrández, Peral, J., Alvarado, X., Noguera, E., and Llopis, F. AliQAn and BRILI QA Systems at CLEF 2006. In CLEF 2006 Working Notes. Alicante, Spain. September 2006.        [ Links ]

17. Ferrés, D. Kanaan, S., González, E., Ageno, A., Rodríguez, H., and Turmo, J. The TALP–QA System for Spanish at CLEF–2005. In CLEF 2005 Working Notes. Vienna, Austria. September 2005.        [ Links ]

18. García–Cumbreras, M. A., Ureña–López, L. A., Martínez–Santiago, F., and Perea–Ortega, J. M. BRUJA System. The University of Jaén at the Spanish Task of CLEFQA 2006. In CLEF 2006 Working Notes. Alicante, Spain. September 2006.        [ Links ]

19. Giampiccolo, D., Forner, P., Peñas, A., Ayache, C., Cristea, D., Jijkoun, V., Osenova, P., Rocha, P., Sacaleanu, B., and Sutcliffe, R. Overview of the CLEF 2007 Multilingual Question Answering Track. In CLEF 2007 Working Notes. Budapest, Hungary. September 2007.        [ Links ]

20. Gómez–Soriano, J.M., Montes–y–Gómez, M., Sanchis–Arnal, E., and Rosso, P. A Passage Retrieval System for Multilingual Question Answering. In Proceedings of the 8th International Conference on Text, Speech and Dialog, TSD 2005. Karlovy Vary, Czech Republic, September 2005.        [ Links ]

21. Gómez–Soriano, J.M., Bisbal–Asensi, E., Buscaldi, D., Rosso, P., and Sanchís, E. Monolingual and Cross–language QA using a QA–oriented Passage Retrieval System. In CLEF 2005 Working Notes. Vienna, Austria. September 2005.        [ Links ]

22. Juárez–González, A., Téllez–Valero, A., Denicia–Carral, C., Montes–y–Gómez, M., and Villaseñor–Pineda, L. INAOE at CLEF 2006: Experiments in Spanish Question Answering. In CLEF 2006 Working Notes. Alicante, Spain. September 2006.        [ Links ]

23. Magnini, B., Romagnoli, S., Vallin, A., Herrera, J., Peñas, A., Peinado, V., Verdejo F., and de Rijke M. The Multiple Language Question Answering Track at CLEF 2003. In CLEF 2003 Workshop Notes. Trondheim, Norway. August 2003.        [ Links ]

24. Magnini, B., Vallin A., Ayache C., Erbach G., Peñas A., de Rijke M., Rocha P., Simov K. and Sutcliffe R. Overview of the CLEF 2004 Multilingual Question Answering Track. In CLEF 2004 Working Notes. Bath, UK. September 2004.        [ Links ]

25. Magnini, B., Giampiccolo, D., Forner, P., Ayache, C., Osenova, P., Peñas, A., Jijkoun, V., Sacaleanu, B., Rocha, P., and Sutcliffe, R. Overview of the CLEF 2006 Multilingual Question Answering Track. In CLEF 2006 Working Notes. Alicante, Spain. September 2006.        [ Links ]

26. Méndez–Díaz, E., Vilares–Ferro, J. and Cabrero–Souto, D. COLE at CLEF 2004: Rapid Prototyping of a QA system for Spanish. In CLEF 2004 Working Notes, Bath, UK, September 2004.        [ Links ]

27. Montes–y–Gómez, M., Villaseñor–Pineda, L., Pérez–Coutiño, M., Gómez–Soriano, J.M., Sanchis–Arnal, E. and Rosso, P. INAOE–UPV Joint Participation at CLEF 2005: Experiments in Monolingual Question Answering. In CLEF 2005 Working Notes. Vienna, Austria. September 2005.        [ Links ]

28. Pérez–Coutiño, M., Solorio, T., Montes–y–Gómez, M., López–López, A., and Villaseñor–Pineda, L. The Use of Lexical Context in Question Answering for Spanish. In CLEF 2004 Working Notes, Bath, UK. September 2004.        [ Links ]

29. Pérez–Coutiño, M., Solorio, T., Montes–y–Gómez, M., López–López, A., and Villaseñor–Pineda, L. Toward a Document Model for Question Answering Systems. In Proceedings of the Second International Atlantic Web Intelligence Conference. AWIC 2004. Cancun, Mexico. May 2004.        [ Links ]

30. Pérez–Coutiño, M., Montes–y–Gómez, M., López–López, A., and Villaseñor–Pineda L. Experiments for Tuning the Values of Lexical Features in Question Answering for Spanish. In CLEF 2005 Working Notes. Vienna, Austria. September 2005.        [ Links ]

31. Pérez–Coutiño, M., Montes–y–Gómez, M. López–López, A., Villaseñor–Pineda, L., and Pancardo– Rodríguez, A. A Shallow Approach for Answer Selection based on Dependency Trees and Term Density. In CLEF 2006 Working Notes. Alicante, Spain. September 2006.        [ Links ]

32. Prager J., Radev D., Brown E., Coden A. and Samn V. The Use of Predictive Annotation for Question Answering in TREC8. In TREC 8 Proceedings. Maryland, USA. November 1999.        [ Links ]

33. Ravichandran, D., and Hovy, E. Learning Surface Text Patterns for a Question Answering System. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), Philadelphia, USA. July 2002.        [ Links ]

34. Roger S., Ferrández S., Ferrández A., Peral J., Llopis F., Aguilar, A. and Tomás D. AliQAn, Spanish QA System at CLEF–2005. In CLEF 2005 Working Notes. Vienna, Austria. September 2005.        [ Links ]

35. Saggion, H. Identifying Definitions in Text Collections for Question Answering. The fourth international conference on Language Resources and Evaluation, LREC 2004. Lisbon, Portugal. May 2004.        [ Links ]

36. Tanev, H., Kouylekov, M., Magnini B., Negri, M., and Simov, K., Exploiting Linguistic Indices and Syntactic Structures for Multilingual Question Answering: ITC–irst at CLEF 2005. In CLEF 2005 Working Notes. Vienna, Austria. September 2005.        [ Links ]

37. Téllez–Valero, A., Juárez–González, A., Hernández–Rubio, G., Delicia–Carral, C., Villatoro–Tello, E., Montes–y–Gómez, M., and Villaseñor–Pineda, L. INAOE's Participation at QA@CLEF 2007. In CLEF 2007 Working Notes. Budapest Hungary, September, 2007.        [ Links ]

38. Téllez–Valero, A., Montes–y–Gómez, M., and Villaseñor–Pineda, L. INAOE at AVE 2007: Experiments in Spanish Answer Validation. In CLEF 2007 Working Notes. Budapest, Hungary. September 2007.        [ Links ]

39. Tomás, D., Vicedo, J. L., Saiz, M., and Izquierdo, R. Building an XML Framework for Question Answering. In CLEF 2005 Working Notes. Vienna, Austria. September 2005.        [ Links ]

40. Tomás, D., Vicedo, J. L., Bisbal, E., and Moreno, L. Experiments with LSA for Passage Re–Ranking in Question Answering. In CLEF 2006 Working Notes. Alicante, Spain. September 2006.        [ Links ]

41. Vallin A., Giampiccolo D., Aunimo L., Ayache C., Osenova P., Peñas A., de Rijke M., Sacaleanu B., Santos D., and Sutcliffe R. Overview of the CLEF 2005 Multilingual Question Answering Track. In CLEF 2005 Working Notes. Vienna, Austria. September 2005.        [ Links ]

42. Vicedo, J.L., Izquierdo, R., Llopis, F., and Muñoz, R., Question Answering in Spanish., In CLEF 2003 Workshop Notes. Trondheim, Norway. August 2003.        [ Links ]

43. Vicedo, J. L., Saiz, M., and Izquierdo, R. Does English help Question Answering in Spanish? In CLEF 2004 Working Notes, Bath, UK. September 2004.        [ Links ]

44. Witten H. and Frank E. Data Mining: Practical Machine Learning Tools and Techniques. Second edition. Morgan Kaufmann. 2005.        [ Links ]

Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License