Detail publikace

Creating Searchable Web Page Snapshots using Semantic Technologies

BURGET, R. SALEM, H.

Originální název

Creating Searchable Web Page Snapshots using Semantic Technologies

Typ

článek ve sborníku mimo WoS a Scopus

Jazyk

angličtina

Originální abstrakt

For many applications, it is necessary to create snapshots of web pages that accurately describe how the page appeared in a browser at a given point in time. Storing the original code (even when including all referenced resources) and creating bitmap screenshots have many drawbacks when it comes to searching, viewing and manipulating such snapshots. In this paper, we demonstrate a different approach that uses a remotely controlled web browser for rendering web pages. We capture the complete information about the rendered page and all pieces of its content, transform it to an explicit RDF-based model representation stored in a repository. Then, the stored page models may be examined using an interactive web-based tools, exported in different formats, linked with other data sources, and queried using SPARQL.

Klíčová slova

Web page snapshot, Page rendering, Data extraction, RDF, SPARQL

Autoři

BURGET, R.; SALEM, H.

Vydáno

16. 6. 2023

Nakladatel

Springer Nature Switzerland AG

Místo

Alicante

ISBN

978-3-031-34443-5

Kniha

Web Engineering - 23rd International Conference, ICWE 2023

Edice

Lecture Notes in Computer Science

Strany od

355

Strany do

358

Strany počet

4

URL

BibTex

@inproceedings{BUT183805,
  author="Radek {Burget} and Hamza {Salem}",
  title="Creating Searchable Web Page Snapshots using Semantic Technologies",
  booktitle="Web Engineering - 23rd International Conference, ICWE 2023",
  year="2023",
  series="Lecture Notes in Computer Science",
  pages="355--358",
  publisher="Springer Nature Switzerland AG",
  address="Alicante",
  doi="10.1007/978-3-031-34444-2\{_}26",
  isbn="978-3-031-34443-5",
  url="https://link.springer.com/chapter/10.1007/978-3-031-34444-2_26"
}