Detail aplikovaného výsledku

proof_platform: Platform for automated analysis and archiving of data from the web

KOCMAN, T.; POLČÁK, L.

Originální název

proof_platform: Platform for automated analysis and archiving of data from the web

Anglický název

proof_platform: Platform for automated analysis and archiving of data from the web

Druh

Software

Abstrakt

This platform enables scraping of web page content and storing the content in offline persistent database. The web crawl is performed using user-supplied regular expressions that may represent for example Torrent file names, Bitcoin wallets or keywords. Collected data may be used for law enforcement and other entitites, such as searching for information about a specific product. Archived data are stored in a database and available for later use without the possibility of modification due to web server updates.

Abstrakt aglicky

This platform enables scraping of web page content and storing the content in offline persistent database. The web crawl is performed using user-supplied regular expressions that may represent for example Torrent file names, Bitcoin wallets or keywords. Collected data may be used for law enforcement and other entitites, such as searching for information about a specific product. Archived data are stored in a database and available for later use without the possibility of modification due to web server updates.

Klíčová slova

Web crawling, web scrapping.

Klíčová slova anglicky

Web crawling, web scrapping.

Umístění

https://gitlab.com/tomaskocman/proof_platform

Licenční poplatek

K využití výsledku jiným subjektem je vždy nutné nabytí licence

www

Dokumenty