SciELO - Scientific Electronic Library Online

 
vol.22 issue3Using BiLSTM in Dependency Parsing for VietnameseWord Sense Disambiguation Features for Taxonomy Extraction author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Computación y Sistemas

On-line version ISSN 2007-9737Print version ISSN 1405-5546

Abstract

TERBEH, Naim; MARAOUI, Mohsen  and  ZRIGUI, Mounir. Arabic Dialect Identification based on Probabilistic-Phonetic Modeling. Comp. y Sist. [online]. 2018, vol.22, n.3, pp.863-870. ISSN 2007-9737.  https://doi.org/10.13053/cys-22-3-3020.

The identification of Arabic dialects is considered to be the first pre-processing component for any natural language processing problem. This task is useful for automatic translation, information retrieval, opinion mining and sentiment analysis. In this purpose, we propose a statistical approach based on the phonetic modeling to identify the correspondent Arabic dialect for each input acoustic signal. The main idea consists first, and for each dialect, in calculating a referenced phonetic model. Second, for every input audio signal, we calculate an appropriate phonetic model. Third, we compare this latter to all referenced Arabic dialect models. Finally, we associate the input acoustic signal to the dialect where the referenced phonetic model minimizes the cosine similarity. The obtained results are satisfactory. Indeed, based on 117 audio sequences, we attain a classification rate of 93%. Supporting the achieved results and the coverage of most of Arabic dialects, this study can be a reference for future work addressing dialectical speech processing applications.

Keywords : Arabic dialects; probabilistic-phonetic model; dialect identification; cosine similarity.

        · text in English     · English ( pdf )