SciELO - Scientific Electronic Library Online

 
vol.23 número3Sentence Generation Using Selective Text PredictionSimilarity Driven Unsupervised Learning for Materials Science Terminology Extraction índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Computación y Sistemas

versão On-line ISSN 2007-9737versão impressa ISSN 1405-5546

Resumo

SHASHAVALI, D. et al. Sentence Similarity Techniques for Short vs Variable Length Text using Word Embeddings. Comp. y Sist. [online]. 2019, vol.23, n.3, pp.999-1004.  Epub 09-Ago-2021. ISSN 2007-9737.  https://doi.org/10.13053/cys-23-3-3273.

In goal-oriented conversational agents like Chatbots, finding the similarity between user input and representative text result is a big challenge. Generally, the conversational agent developers tend to provide a minimal number of utterances per intent, which makes the classification task difficult. The problem becomes more complex when the length of the representative text per action is short and the length of the user input is long. We propose a methodology that derives Sentence Similarity score based on N-gram and Sliding Window and uses the FastText Word Embeddings technique which outperforms the current state-of-the-art Sentence Similarity results. We are also publishing a dataset on the shopping domain, to build conversational agents. And the extensive experiments done on the dataset fetched better results in accuracy, precision and recall by 6%, 2% and 80% respectively. It also evinces that our solution generalizes well on the low corpus and requires no training.

Palavras-chave : Sentence similarity; word embeddings; natural language processing; sliding window; N-grams; text classification.

        · texto em Inglês     · Inglês ( pdf )