SciELO - Scientific Electronic Library Online

 
 número53Data Reduction and Regression Using Principal Component Analysis in Qualitative Spatial Reasoning and Health InformaticsImproving Corpus Annotation Quality Using Word Embedding Models índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay artículos similaresSimilares en SciELO

Compartir


Polibits

versión On-line ISSN 1870-9044

Resumen

JEBARI, Chaker. A Segment-based Weighting Technique for URL-based Genre Classification of Web Pages. Polibits [online]. 2016, n.53, pp.43-47. ISSN 1870-9044.  https://doi.org/10.17562/PB-53-4.

We propose a segment-based weighting technique for genre classification of web pages. This technique exploits character n-grams extracted from the URL of the web page rather than its textual content. The main idea of our technique is to segment the URL and assigns a weight for each segment. Experiments conducted on three known genre datasets show that our method achieves encouraging results.

Palabras llave : URL; genre classification; web page; segment weight.

        · texto en Inglés     · Inglés ( pdf )