SciELO - Scientific Electronic Library Online

 
 número53Data Reduction and Regression Using Principal Component Analysis in Qualitative Spatial Reasoning and Health InformaticsImproving Corpus Annotation Quality Using Word Embedding Models índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Polibits

versão On-line ISSN 1870-9044

Resumo

JEBARI, Chaker. A Segment-based Weighting Technique for URL-based Genre Classification of Web Pages. Polibits [online]. 2016, n.53, pp.43-47. ISSN 1870-9044.  https://doi.org/10.17562/PB-53-4.

We propose a segment-based weighting technique for genre classification of web pages. This technique exploits character n-grams extracted from the URL of the web page rather than its textual content. The main idea of our technique is to segment the URL and assigns a weight for each segment. Experiments conducted on three known genre datasets show that our method achieves encouraging results.

Palavras-chave : URL; genre classification; web page; segment weight.

        · texto em Inglês     · Inglês ( pdf )