<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462017000400647</article-id>
<article-id pub-id-type="doi">10.13053/cys-21-4-2867</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Parsing Arabic Nominal Sentences with Transducers to Annotate Corpora]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Hammouda]]></surname>
<given-names><![CDATA[Nadia Ghezaiel]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Haddar]]></surname>
<given-names><![CDATA[Kais]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Higher Institute of Computer and Communication Technologies of Hammam Sousse  ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Tunisia</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,University of Sfax Faculty of Sciences of Sfax ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Tunisia</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2017</year>
</pub-date>
<volume>21</volume>
<numero>4</numero>
<fpage>647</fpage>
<lpage>656</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462017000400647&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462017000400647&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462017000400647&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract: Studying Arabic nominal sentences is important to analyze and annotate successfully Arabic corpora. This type of sentences is frequent in Arabic text and speech. Transducers can be used to realize local grammars and treat several linguistic phenomena. In this paper, we propose a parsing approach for Arabic nominal sentences using transducers. To do this, we first study the typology of the Arabic nominal sentence indicating its different forms. Then, we develop a set of lexical and syntactic rules dealing with this type of sentences and respecting the specificity of the Arabic language. After that, we present our parsing approach based on the transducers and on certain principles. In fact, this approach allows the annotation of Arabic nominal sentences but also the Arabic verbal ones. Finally, we present an idea about the implementation and experimentation of our approach in NooJ platform. The obtained results are satisfactory.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Arabic nominal sentence]]></kwd>
<kwd lng="en"><![CDATA[disambiguation process]]></kwd>
<kwd lng="en"><![CDATA[topic]]></kwd>
<kwd lng="en"><![CDATA[transducer]]></kwd>
<kwd lng="en"><![CDATA[NooJ platform]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Al-Taani]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Al-Rub]]></surname>
<given-names><![CDATA[S. Ab.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A Rule-Based Approach for Tagging Non-Vocalized Arabic Words, International Arab]]></article-title>
<source><![CDATA[Journal of Information Technology (IAJIT)]]></source>
<year>2009</year>
<volume>6</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>320-8</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Attia]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<source><![CDATA[Handling Arabic Morphological and Syntactic Ambiguity within the LFG Framework with a View to Machine Translation]]></source>
<year>2008</year>
<volume>279</volume>
</nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Beesley]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
</person-group>
<source><![CDATA[Finite-State Morphological Analysis and Generation of Arabic at Xerox]]></source>
<year>2001</year>
<conf-name><![CDATA[ 16th conference on Computational linguistics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>89-94</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dichy]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Alrahabi]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Levée d&#8217;ambiguité par la methode d&#8217;exploration contextuelle: la sequence &#8216;alif-nûn&#8217; en arabic]]></source>
<year>2009</year>
<conf-name><![CDATA[ 2International Conference SIIE]]></conf-name>
<conf-loc> </conf-loc>
<page-range>573-85</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Diab]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<source><![CDATA[Second Generation Tools (AMIRA 2.0): Fast and Robust Tokenization, POS tagging, and Base Phrase Chunking]]></source>
<year>2009</year>
<conf-name><![CDATA[ 2International Conference on Arabic Language Resources and Tools]]></conf-name>
<conf-loc> </conf-loc>
<page-range>285-8</page-range></nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Diab]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Hacioglu]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Jurafsky]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Automatic Tagging of Arabic Text: From Raw Text to Base Phrase Chunks]]></source>
<year>2004</year>
<conf-name><![CDATA[ HLT-NAACL]]></conf-name>
<conf-loc> </conf-loc>
<page-range>149-52</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ellouze]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Haddar]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Abdelhamid]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Etude et analyse du pluriel brisé arabe avec la plateforme NooJ]]></source>
<year>2009</year>
<conf-name><![CDATA[ NooJ Conference and Workshop]]></conf-name>
<conf-loc> </conf-loc>
<page-range>6-17</page-range><publisher-loc><![CDATA[Tunisia ]]></publisher-loc>
<publisher-name><![CDATA[Tozeur]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Haddar]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Abdelkader]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Ben]]></surname>
<given-names><![CDATA[H.A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Etude et analyse de la phrase nominale arabe en HPSG]]></article-title>
<source><![CDATA[Traitement Automatique des Langue Naturelles]]></source>
<year>2006</year>
<page-range>379-88</page-range><publisher-loc><![CDATA[Louvain ]]></publisher-loc>
<publisher-name><![CDATA[UCL Presses de Louvain]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hammouda]]></surname>
<given-names><![CDATA[G.N.]]></given-names>
</name>
<name>
<surname><![CDATA[Haddar]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<source><![CDATA[Toward the Resolution of Arabic Lexical Ambiguities with Transduction on Text Automaton]]></source>
<year>2015</year>
<page-range>37-42</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hammouda]]></surname>
<given-names><![CDATA[G.N.]]></given-names>
</name>
<name>
<surname><![CDATA[Haddar]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Integration of a segmentation tool for Arabic corpora in NooJ platform to build an automatic annotation tool]]></article-title>
<source><![CDATA[Communications in Computer and Information Science]]></source>
<year>2016</year>
<volume>667</volume>
<page-range>89-100</page-range></nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Habash]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Rambow]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
<name>
<surname><![CDATA[Roth]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[MADA+TOKAN: A toolkit for Arabic Tokenization, diacritization, morphological disambiguation, POS tagging, stemming and lemmatization]]></source>
<year>2009</year>
<conf-name><![CDATA[ Proceedings of the Second International Conference on Arabic Language Resources and Tools]]></conf-name>
<conf-loc>Cairo, Egypt </conf-loc>
<page-range>102-9</page-range></nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mesfar]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
</person-group>
<source><![CDATA[Thesis: Analyse morpho-syntaxique automatique et reconnaissance des entités nommées en arabe strandard]]></source>
<year>2008</year>
<page-range>235</page-range><publisher-name><![CDATA[University of Franche Comté]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Silberztein]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<source><![CDATA[Disambiguation Tools for NooJ]]></source>
<year>2010</year>
<conf-name><![CDATA[ International NooJ Conference]]></conf-name>
<conf-loc> </conf-loc>
<page-range>158-71</page-range><publisher-loc><![CDATA[Newcastle ]]></publisher-loc>
<publisher-name><![CDATA[Cambridge Scholars Publishing]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Shaalan]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Othman]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Rafea]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Towards Resolving Ambiguity in Understanding Arabic Sentence]]></source>
<year>2004</year>
<conf-name><![CDATA[ Proceedings of the International Conference on Arabic Language Resources and Tools]]></conf-name>
<conf-loc>Cairo, Egypt </conf-loc>
<page-range>118-22</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zalila]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Haddar]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Construction of an HPSG Grammar for the Arabic Relative Sentences]]></article-title>
<source><![CDATA[Natural Language Processing, RANLP]]></source>
<year>2011</year>
<page-range>12-4</page-range></nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gross]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The construction of local grammar]]></article-title>
<source><![CDATA[Finite-State Language Processing]]></source>
<year>1997</year>
<page-range>329-54</page-range><publisher-loc><![CDATA[England ]]></publisher-loc>
<publisher-name><![CDATA[MIT Press]]></publisher-name>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
