SciELO - Scientific Electronic Library Online

 número53Data Reduction and Regression Using Principal Component Analysis in Qualitative Spatial Reasoning and Health InformaticsImproving Corpus Annotation Quality Using Word Embedding Models índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados




Links relacionados

  • No hay artículos similaresSimilares en SciELO



versión On-line ISSN 1870-9044


JEBARI, Chaker. A Segment-based Weighting Technique for URL-based Genre Classification of Web Pages. Polibits [online]. 2016, n.53, pp.43-47. ISSN 1870-9044.

We propose a segment-based weighting technique for genre classification of web pages. This technique exploits character n-grams extracted from the URL of the web page rather than its textual content. The main idea of our technique is to segment the URL and assigns a weight for each segment. Experiments conducted on three known genre datasets show that our method achieves encouraging results.

Palabras llave : URL; genre classification; web page; segment weight.

        · texto en Inglés     · Inglés ( pdf )