Detail publikačního výsledku

Text-Based Web Page Classification with Use of Visual Information

BARTÍK, V.

Originální název

Text-Based Web Page Classification with Use of Visual Information

Anglický název

Text-Based Web Page Classification with Use of Visual Information

Druh

Stať ve sborníku mimo WoS a Scopus

Originální abstrakt

As the number of pages on the web is permanently increasing, there is a need to classify pages into categories to facilitate indexing or searching them. In the method proposed here, we use both textual and visual information to find a suitable representation of web page content. In this paper, several term weights, based on TF or TF-IDF weighting are proposed. Modification is based on visual areas, in which the text appears and their visual properties. Some results of experiments are included in the final part of the paper.

Anglický abstrakt

As the number of pages on the web is permanently increasing, there is a need to classify pages into categories to facilitate indexing or searching them. In the method proposed here, we use both textual and visual information to find a suitable representation of web page content. In this paper, several term weights, based on TF or TF-IDF weighting are proposed. Modification is based on visual areas, in which the text appears and their visual properties. Some results of experiments are included in the final part of the paper.

Klíčová slova

web page classification, term weights, text classification, TF-IDF weight, visual information, visual  blocks

Klíčová slova v angličtině

web page classification, term weights, text classification, TF-IDF weight, visual information, visual  blocks

Autoři

BARTÍK, V.

Rok RIV

2011

Vydáno

13.08.2010

Nakladatel

IEEE Computer Society

Místo

Odense

ISBN

978-0-7695-4138-9

Kniha

2010 International Conference on Advances in Social Network Analysis and Mining

Strany od

416

Strany do

420

Strany počet

5

BibTex

@inproceedings{BUT35625,
  author="Vladimír {Bartík}",
  title="Text-Based Web Page Classification with Use of Visual Information",
  booktitle="2010 International Conference on Advances in Social Network Analysis and Mining",
  year="2010",
  pages="416--420",
  publisher="IEEE Computer Society",
  address="Odense",
  isbn="978-0-7695-4138-9"
}