Detail publikačního výsledku

Isomorphic mapping of DOM trees for Cluster-Based Page Segmentation

ZELENÝ, J.; BURGET, R.

Originální název

Isomorphic mapping of DOM trees for Cluster-Based Page Segmentation

Anglický název

Isomorphic mapping of DOM trees for Cluster-Based Page Segmentation

Druh

Stať ve sborníku mimo WoS a Scopus

Originální abstrakt

In our previous work we have designed a method for fast and precise Web page segmentation. In this paper we propose a complementary algorithm and data structures that extend the original design. The extension is focused on isomorphic mapping between two DOM trees. Our main objective is to improve robustness of our original solution. We successfully design and implement a solution that is more robust while keeping the efficiency of the original simple one. To prove qualities of our new design we also offer an experimental evaluation of the new implementation.

Anglický abstrakt

In our previous work we have designed a method for fast and precise Web page segmentation. In this paper we propose a complementary algorithm and data structures that extend the original design. The extension is focused on isomorphic mapping between two DOM trees. Our main objective is to improve robustness of our original solution. We successfully design and implement a solution that is more robust while keeping the efficiency of the original simple one. To prove qualities of our new design we also offer an experimental evaluation of the new implementation.

Klíčová slova

vision-based page segmentation, cache, template detection, cluster-based page segmentation, DOM, tree mapping

Klíčová slova v angličtině

vision-based page segmentation, cache, template detection, cluster-based page segmentation, DOM, tree mapping

Autoři

ZELENÝ, J.; BURGET, R.

Rok RIV

2014

Vydáno

05.11.2013

Nakladatel

The University of Technology Košice

Místo

Spišská Nová Ves

ISBN

978-80-8143-127-2

Kniha

Proceedings of the Twelfth International Conference on Informatics INFORMATICS'2013

Strany od

256

Strany do

261

Strany počet

6

URL

BibTex

@inproceedings{BUT103543,
  author="Jan {Zelený} and Radek {Burget}",
  title="Isomorphic mapping of DOM trees for Cluster-Based Page Segmentation",
  booktitle="Proceedings of the Twelfth International Conference on Informatics INFORMATICS'2013",
  year="2013",
  pages="256--261",
  publisher="The University of Technology Košice",
  address="Spišská Nová Ves",
  isbn="978-80-8143-127-2",
  url="https://www.fit.vut.cz/research/publication/10414/"
}

Dokumenty