SciELO - Scientific Electronic Library Online

 
vol.22 número3Using BiLSTM in Dependency Parsing for VietnameseWord Sense Disambiguation Features for Taxonomy Extraction índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Computación y Sistemas

versão On-line ISSN 2007-9737versão impressa ISSN 1405-5546

Resumo

TERBEH, Naim; MARAOUI, Mohsen  e  ZRIGUI, Mounir. Arabic Dialect Identification based on Probabilistic-Phonetic Modeling. Comp. y Sist. [online]. 2018, vol.22, n.3, pp.863-870. ISSN 2007-9737.  https://doi.org/10.13053/cys-22-3-3020.

The identification of Arabic dialects is considered to be the first pre-processing component for any natural language processing problem. This task is useful for automatic translation, information retrieval, opinion mining and sentiment analysis. In this purpose, we propose a statistical approach based on the phonetic modeling to identify the correspondent Arabic dialect for each input acoustic signal. The main idea consists first, and for each dialect, in calculating a referenced phonetic model. Second, for every input audio signal, we calculate an appropriate phonetic model. Third, we compare this latter to all referenced Arabic dialect models. Finally, we associate the input acoustic signal to the dialect where the referenced phonetic model minimizes the cosine similarity. The obtained results are satisfactory. Indeed, based on 117 audio sequences, we attain a classification rate of 93%. Supporting the achieved results and the coverage of most of Arabic dialects, this study can be a reference for future work addressing dialectical speech processing applications.

Palavras-chave : Arabic dialects; probabilistic-phonetic model; dialect identification; cosine similarity.

        · texto em Inglês     · Inglês ( pdf )