Improving Question Analysis for Arabic Question Answering in the Medical Domain

Dardour, Sondes; Fehri, Hela; Haddar, Kais; Dardour, Sondes; Fehri, Hela; Haddar, Kais

doi:10.13053/cys-26-3-4345

Servicios Personalizados

Revista

Articulo

Indicadores

Citado por SciELO
Accesos

Links relacionados

Similares en SciELO

Otros
Otros

Permalink

Computación y Sistemas

versión On-line ISSN 2007-9737versión impresa ISSN 1405-5546

Comp. y Sist. vol.26 no.3 Ciudad de México jul./sep. 2022 Epub 02-Dic-2022

https://doi.org/10.13053/cys-26-3-4345

Articles

Improving Question Analysis for Arabic Question Answering in the Medical Domain

Sondes Dardour¹^*

Hela Fehri¹

Kais Haddar¹

¹1 University of Sfax, MIRACL Laboratory, Tunisia. kais.haddar@yahoo.fr, dardour.sondes@yahoo.com.

Abstract:

Question analysis is a basic module in a question answering (QA) system, and its quality affects the performance of QA system. In this paper, we address the problem of Arabic question analysis in the medical domain where several specific challenges are met. The major challenging issue in processing Arabic medical question is the need for ambiguity resolution. Nevertheless, this issue has not been well studied in related works. Our question analysis uses dictionaries and transducers to analyze any medical question, factoid or complex. This module detects important elements of the question, including: different words in the question that identify what the user wants to ask for, and the nature of the expected answer. To identify well these elements a step of disambiguation is applied. Then, the words used in the question will be extended by adding new words that connect semantically to those in the question. Experimentation of the question analysis module of our Arabic medical question answering system show interesting results.

Keywords: Question answering; Arabic; disambiguation; medical domain; dictionary; transducer

1 Introduction

Nowadays, due to the continuous exponential growth of information produced in the medical domain, and due to the important impact of such information upon research and upon real world applications, there is a particularly great and growing demand for Question Answering (QA) systems that can effectively and efficiently aid users in their medical information search [¹].

QA system takes a question posted in natural language instead of a set of key-words, analyzes and understands the meaning of the question, and then provides the exact answer from a set of knowledge resources [²]. The QA system consists of three main processing modules, namely, question processing, passages retrieval processing, and answer processing. A question processing is the primary and basic source through which a search process is directed for answer. Therefore, an accurate and careful analysis to the question is required. Thus, question processing is the most fundamental module in any QA system, and the performance of its results significantly impacts on the following modules of information retrieval and answer extraction.

To our knowledge, proposed Arabic medical QA systems are so limited either in terms of their performance as well as in terms of the types of questions they are designed to answer. Moreover, the most attention in Arabic has been paid to answering factoid questions, in which the answer is a single word or a short phrase [³].

Ambiguity is a common phenomenon in human natural language. In QA, ambiguity is a critical challenge in extracting what the user looking for in his question. Therefore, ambiguity can cause confusion in interpretation of the question, and then impacts negatively the performance of the QA system.

In this paper, we propose a new approach to handle medical questions (factoid and complex questions) for the Arabic language. Moreover, our approach overcomes the ambiguity in the question processing module, an issue that has not been appropriately addressed in the field of Arabic QA.

The remainder of this paper is structured as follows. Section 2 presents the related works. Ambiguity problems are presented in section 3. Section 4 describes our approach. Section 5 deals with the experimentation carried out to evaluate the efficiency of our question analysis module. Finally, section 6 draws the main contributions and proposes further perspectives.

2 Related Works

The problem of answering questions formulated in natural language has been studied in the field of Information Retrieval (IR) since the mid-1990s [⁴]. However, unlike IR, the QA system returns simple and precise answer to a natural language question instead of a large number of documents [⁵, ⁶] . As we mentioned, the QA system is composed of three modules: question analysis, passage or document retrieval and answer extraction. Different QA systems may use different implementation for each module [⁷, ⁸]. In this section, we focus on some studies for the question analysis module.

Until now, very little effort was directed toward the development of QA system for the medical domain in the Arabic language, compared to other languages such as French and English. This is mainly attributed to the particularities of the medical domain and the language (see Section 3). The situation is further aggravated by the lack of linguistic resources and Natural Language Processing (NLP) tools that is available for Arabic [⁹, ¹⁰].

In an effort to achieve a better question analysis, [²] analyzed the question to extract type and category of desired answer whether it is a place, a quantity, a name or a date, which makes the answer extraction easier.

[¹⁰] analyzed Arabic questions by formulating the query, extracting the expected answer type, the question focus and the question keywords. The focus is the noun phrase of the question which the user wants to ask about. For instance, if the user’s question is “What is the capital of Canada?” then the question focus is “Canada” and the keyword is “capital” and the expected answer type is a named entity for a location.

[¹¹] analyzed the question by:

— Tokenization and normalization.
— Determining answer type by question words (When, What...)
— Named entity recognition.
— Focus determination by extracting the main named entity.
— Keywords extraction.
— Removing stop words using the Khoja stop list.
— Query expansion using the Arabic dictionary of synonyms. Named entities are not expanded to avoid ambiguity.
— Stemming by Khoja’s Stemmer and named entities are not stemmed.
— Query generation of keywords into a boolean formula.

[³] made six steps to process Why-questions. They tokenized the question, then normalized it, then removed stop words (optional step). After that, they applied khoja’s stemmer to obtain the root of each non-stop word in the question. Then, they used the extracted keywords to formulate and generate the query. Finally, they extended the list of keywords by including synonyms and words that share the same root.

The systems of [¹²] and [¹³] are developed for the medical domain in Arabic language. These systems analyze only factoid questions by extracting the topic and the focus of the question, and extracting named entities. The system of [¹²] classifies the questions into organization, location, person, viruses, diseases, treatment.

We can confirm, from literature reviews, that most Arabic QA systems ensure analysis of factoid questions. Nevertheless, there are few studies that have addressed the problem of answering complex questions. In addition, there are few works that have integrated semantic analysis and treated the medical field in the Arabic language, which makes the development of a new Arabic QA system is crucial.

3 Ambiguity

A study of different questions showed us the existence of several linguistic phenomena which can cause ambiguities in the question processing. Indeed, if we solve these problems, the errors will be so minimal and our system will be more relevant compared to existing Arabic QA systems.

3.1 Specific Arabic Difficulties

Arabic specific difficulties consist in its richness that needs special processing, which makes regular NLP systems, designed for other languages, unable to process it. One of the Arabic-specific difficulties is the lack of diacritics (i.e. kasra, fatha, damma), which leads to more ambiguous situations than any other language. This issue can be explained through the question (Who was killed in Uganda?).

The lack of diacritics in verb (to kill) presents at least two cases for the question processing:

— qutila which means that the question is “Who was killed in Uganda?”, so in this question means (was killed).
— qatala which means that the question is “Who did kill in Uganda?”, so in this question means (kill).

Arabic language morphology is challenging when compared to other languages. This is because Arabic is a highly agglutinative and derivational language where a word token can replace a whole sentence in other languages. For example, for the question (Do we can prevent the clot?), the sentence “Do we can” can be expressed in one Arabic word which includes the verb (can), the prefix (do) and the pronoun (we). Therefore, extracting keywords from an Arabic question will be more complex than any other language. Furthermore, in a question like (Who are the two scientists who won the Nobel Prize in medicine for cancer treatment?), the user looks for the name of two persons (i.e. James P. Allison and Tasuku Honjo). In English, the system catches this user require through the word “two”. In Arabic QA, this keyword is embedded in the word Alt abiybaAni (two scientists) thanks to the suffix Ani. Actually, the question above is just an example; the morphology of an Arabic word may contain multiple information (basic POS, number, gender, etc.) which are important for each module of Arabic QA. Unlike English and most Latin-based languages, Arabic does not have capital letters which makes Named Entity Recognition (NER) harder [¹⁴].

3.2 Specific Difficulties of Medical Domain

Apart from ambiguity in Arabic language, ambiguity also appears in medical terms. We observed that the more ambiguous terms are diseases names. For example, the term means both an insect and a dermatological disease. This issue can be explained through the question (What is a louse?); such system can extract the following answers:

(Louse is a kind of harmful insect that feeds on human blood)

Definition of an insect.

(Louse is a disease that affects the scalp)

Definition of a disease.

In fact, to extract the right answer, the system must understand the context. For example, in (2), the keyword (disease) indicate that it is a definition of the disease (Louse). Furthermore, in open-domain, the nature of the expected answer is known from the interrogative pronouns. For instance, in a When-question (When America discovered?), the nature of the expected answer is a time. Nevertheless, in medical-domain, a When-question can indicate an age, a condition or a time. Table 1 gives an example. To extract the correct answer, we must define a sequence of keywords which define the question and disambiguate it in the sense that it indicates what the question is looking for.

Table 1 Ambiguity of the question (When)

Question	Translation	Question type	Expected answer
	When is the fetus heart developed?	Wh-question	Age
	When should visit a psychiatrist?	Wh-question	Condition
	When did Alzheimer’s disease be	Wh-question	Time

4 Proposed Method

The challenges discussed in the previous section make clear the need for new method to deal with Arabic medical QA. In addition, the most of previous studies are based on a superficial analysis of factoid questions (i.e. where, when, how much/many, who and what).

The originality of our approach lies in the disambiguation and the semantic analysis of factoid and complex questions (i.e. why and how to). In our proposal, the question analysis module is based on five steps as illustrated in Fig 1: Corpus study, Named Entity Recognition (NER), Stop word removal, Disambiguation, and Question Expansion (QE). In the first step, questions are gathered and studied to define the disambiguation patterns.

Fig. 1 Proposed method for question analysis module

These patterns are transformed into transducers to process any type of medical question in Arabic language. Questions will be processed by the parallel steps (NER, Stop word removal, and disambiguation) using dictionaries, syntactic grammars, and morphological grammar in order to get some useful information. Finally, the last step will extend the extracted keywords.

4.1 Corpus Study

The need to have an Arabic corpus is a necessity for processing Arabic QA systems. Indeed, the questions are gathered from several sources, namely, discussion forums, frequently asked questions (FAQ) and some questions translated from Text REtrieval Conference (TREC). Currently, we collected 350 questions which contain seven categories (see Table 2). The questions are then subjected to an analysis step.

Table 2 Collected questions

Question type	What	When	Where	Who	How many/much	How	Why
Number	85	53	42	38	45	39	48

According to our study, we identify 158 question disambiguation patterns. Table 3 shows some patterns of the question (When). These patterns will be transformed into transducers to parse the questions.

Table 3 Some question disambiguation patterns of the question (When)

Question	Expected answer
When<Verb>Fetus_\|Child _\|Infant?	Age
When <Verb><Noun>Child_\|Fetus _\|Infant?	Age
When <Verb><Noun><Condition>	Condition
When <Verb><Trigger ><Virus>	Time
When <Verb><Virus>	Time

4.2 Named Entity Recognition (NER)

The previous studies emphasize that the NER is important for all the QA system components. Indeed, the integration of a NER step will definitely boost our system performance because the answer of a factoid question is a named entity.

In our case, we developed our own NER tool especially formulated for our proposal. This step is based on dictionaries and transducers. We have considered five categories:

— Organ: Names of medical organs.
— Location: Names of location.
— Disease: Names of diseases, sickness, illness.
— Virus: Names of medical viruses.
— Treatment: Names of Treatments.

4.3 Stop Words Removal

This step removes the conjunctions, prepositions and interrogative pronouns. After removing the stop words, the important terms in the question will be remaining. In our proposal, the stop words are eliminated from the outputs of the syntactic transducers (see Fig 3).

Fig. 2 Extract of dictionary

Fig. 3 Transducer of pattern When<Verb>Fetus| Child |Infant?

4.4 Disambiguation

Our system is based on dictionaries and transducers. These resources allow us to disambiguate ambiguous words and the nature of the expected answer (Problems mentioned in the previous section).

4.4.1 Word Sense Disambiguation (WSD)

WSD process is required in application such as a QA application [¹⁵]. Some ambiguous words which have a different sense influence negatively the extraction of the correct answer. Let’s take the following questions as an example:

(When does the brain generate electricity?)

(When a baby who has congenital anomalies is born?)

As shown above, the verb have the sense of “generate” in the question (1) and the sense of “born” in the question (2). To resolve this problem, as shown in our dictionary in Fig 2, each ambiguous word is associated with semantic feature to identify the sense of the entry (sens-generer, sens-naitre). This feature is used in the syntactic transducers (see Fig 3).

The WSD process allows also our system to define the correct stem. For instance, the stem of ywld in question (1) is wal ada and in the question (2) is >awolada.

4.4.2 Disambiguation of the Nature of the Expected Answer

For a reliable disambiguation, each defined pattern in the corpus study step is transformed into transducers. The identification of the nature of the expected answer is related to the focus of the question. For example, the transducer of Fig 3 describes the paths of the pattern ”When<Verb>Fetus| Child |Infant?” (see Table 3). This transducer can analyze a question like (When the child crawls?). The focus in this example is (child), so the nature of the expected answer is “Age”.

4.5 Question Expansion

Extraction only original question keyword is proved to have some limitations. To get rid of these limitations, we need to define the meaning the user looking for.

Therefore, in question expansion (QE), we extend the list of the exact words of the user’s question by adding new words that connect semantically to those in the question. Since the documents may not contain the terms that the user used in his question, expanding question will increase the chance of getting the answer [¹⁶].

In the previous works, QE is achieved using Arabic WordNet^{^fn}. In our dictionaries, the feature Syno (for synonyms) is used to expand questions. This feature is called in the QE transducer as shown in Fig 4.

Fig. 4 QE transducer to extract synonyms

After processing the question (What is the appropriate use of sunscreen for a baby?) with the previous steps, the transducer of Fig 4 can extract the synonym of <isotixdaAm which is <isotiEomaAl, the synonym of munaAsib which is mulaAim and the synonym of raDiyEo which is Tifl.

Expanding question can be applied also in order to overcome the situations where the Passage Retrieval (PR) module eliminates relevant passages containing other forms of the question keywords. The idea now is adding other forms of the keywords that share the same root (see Fig 5).

Fig. 5 QE transducer to extract different forms

Let’s continue with the same question .

Thanks to QE process, the PR module can extract not only the passages that contain the keyword raDiyEo but also its broken plural form ruDãEo. This process is applied also to extracted synonyms. Therefore, we consider each keyword with its synonym and its different forms since the QE would theoretically generate all these terms.

The expanded list of terms extracted from the question will be sent to the PR module to extract the passages that may contain the answer.

5 Experimentation and Evaluation

In our proposal, linguistic resources are built with the linguistic platform NooJ [¹⁷]. We conduct a set of experimentation to evaluate the performance of our question analysis module. Therefore, we exploit a test corpus which contains 399 questions. For each question type ( “What”, “When”, “Where”, “Who”, “How many/much”, “How”, “Why”) a set of 57 questions is used.

The results of applying the transducer that extracts the type of the expected answer and keywords are illustrated in Fig 6. This transducer allows the NER, stop words removal, and disambiguation. Then, the keywords are expended by the QE transducers.

Fig. 6 Extract of concordance table

After applying the analysis on the test corpus using our linguistic resources, we obtain the results illustrated in Table 4.

Table 4 Summarizing the measure values

Method	Without disambiguation	With disambiguation
Precision	66%	93%
Recall	58%	87%
F-Measure	61%	89%

Table 4 shows that the disambiguation process enhances the F-Measure by 28%. It is then concluded that by reducing ambiguity, especially when processing the medical domain in the Arabic language, the obtained results will be increased.

Errors are often due to the problem in writing some Arabic letters such as the letter “A” which can also be writing like > or | or <. For example, in some question, we can find the word “inflammation” written like AlotihaAbo or <ilotihaAbo. To resolve this problem, we need to rewrite the question by unifying all variants of a letter into a single form. Furthermore, the presented errors in the question analysis are due to dictionaries’ coverage that must be improved and the complexity of some questions that requires special handling techniques.

6 Conclusion

In the present paper, we have developed a question analysis module (QAM) for our system to analyze an Arabic medical question.

Our QAM is mainly concerned with the identification of four factors, namely, keywords extraction, disambiguation, question expansion, and nature of the expected answer extraction. This analysis of question allows extracting all the necessary information that will be used as inputs for the other QA components. Our proposed method achieves satisfactory results.

In the future work, we seek to add a pre-processing to normalize the question. We also seek to improve our linguistic resources by adding new terms in the dictionaries.

References

1. Athenikos, S. J., Han, H. (2010). Biomedical question answering: A survey. Computer methods and programs in biomedicine, Vol. 99, No. 1, pp. 1–24. DOI: 10.1016/j.cmpb.2009.10.003. [ Links ]

2. Hammo, B., Abuleil, S., Lytinen, S., Evens, M. (2004). Experimenting with a question answering system for the Arabic language. Computers and the Humanities, Vol. 38, No. 4, pp. 397—415. DOI: 10.1007/s10579-004-1917-3. [ Links ]

3. Azmi, A. M., Alshenaifi, N. A. (2017). Lemaza: An Arabic why-question answering system. Natural Language Engineering, Vol. 23, No. 6, pp. 877–903. DOI: 10.1017/S1351324917000304 [ Links ]

4. Verberne, S. (2010). In Search of the Why. PhD Thesis, University of Nijmegen, The Netherlands. [ Links ]

5. Kanaan, G., Hammouri, A., Al-Shalabi, R., Swalha, M. (2009). A new question answering system for the Arabic language. American Journal of Applied Sciences, Vol. 6, No. 4, pp. 797–825. [ Links ]

6. Trigui, O., Belguith, L. H., Rosso, P. (2010). DefArabicQA: Arabic definition question answering system. Workshop on Language Resources and Human Language Technologies for Semitic Languages, 7th LREC, Valletta, Malta, pp. 40–45. [ Links ]

7. Benajiba, Y., Rosso, P., Gómez-Soriano, J. M. (2007). Adapting the JIRS passage retrieval system to the Arabic language. Lecture Notes in Computer Science, vol 4394, Springer, Berlin, Heidelberg. pp. 530–541. DOI: 10.1007/978-3-540-70939-847. [ Links ]

8. Ezzeldin, A. M., Shaheen, M. (2012). A survey of Arabic question answering: challenges, tasks, approaches, tools, and future trends. Proceedings of the 13th International Arab Conference on Information Technology (ACIT´12), pp. 1–8. [ Links ]

9. Abouenour, L., Bouzoubaa, K., Rosso, P. (2008). Improving Q/A using Arabic Wordnet. Proceedings of the 2008 International Arab Conference on Information Technology (ACIT’2008), Tunisia, December. [ Links ]

10. Brini, W., Ellouze, M., Mesfar, S., Belguith, L. H. (2009). An Arabic question-answering system for factoid questions. IEEE International Conference on Natural Language Processing and Knowledge Engineering. NLP-KE´09, pp. 1–7. DOI: 10.1109/NLPKE.2009.5313730. [ Links ]

11. Abdelbaki, H., Shaheen, M., Badawy, O. (2011). ARQA high performance Arabic question answering system. Proceedings of Arabic Language Technology International Conference (ALTIC). [ Links ]

12. Bessaies, E., Mesfar, S., Ghzela, H. B. (2018). Processing medical binary questions in standard Arabic using nooJ. Lecture Notes in Computer Science, Vol. 10859, Springer, Cham. DOI: 10.1007/978-3-319-91947-8_19. [ Links ]

13. Ennasri, I., Dardour, S., Fehri, H., Haddar, K. (2017). Question-response system using the nooJ linguistic platform. In: Mbarki, S., Mourchid, M., Silberztein, M., eds. Formalizing Natural Languages with NooJ and Its Natural Language Processing Applications. NooJ´17. Communications in Computer and Information Science, vol 811. Springer, Cham. DOI: 10.1007/978-3-319-73420-016 [ Links ]

14. Shaheen, M., Ezzeldin, A. M. (2014) Arabic question answering: systems, resources, tools, and future trends. Arabian Journal for Science and Engineering, Vol. 39, No. 6, pp. 4541–4564. [ Links ]

15. Ferrández, S., Roger, S., Ferrández, A., Aguilar, A., López-Moreno, P. (2006). A new proposal of Word Sense Disambiguation for nouns on a Question Answering System. Advances in Natural Language Processing. Research in Computing Science, Vol. 18, pp. 83–92. [ Links ]

16. Al-Chalabi, H., Ray, S., Shaalan, K. (2015). Semantic based query expansion for Arabic question answering systems. Arabic Computational Linguistics (ACLing´15), First International Conference on IEEE, pp. 127–132. DOI: 10.1109/ACLing.2015.25. [ Links ]

17. Silberztein, M. (2018). Using linguistic resources to evaluate the quality of annotated corpora. Proceedings of the First Workshop on Linguistic Resources for Natural Language, pp. 2–11. [ Links ]

http://globalwordnet.org/arabic-wordnet/

Received: February 20, 2019; Accepted: January 09, 2021

^* Corresponding author: Sondes Dardour, e-mail: hela.fehri@yahoo.fr

This is an open-access article distributed under the terms of the Creative Commons Attribution License