Applied result detail

Tool for Distributed Extraction of Timestamped Events from Files

RYCHLÝ, M.; BURGET, R.

Original Title

Tool for Distributed Extraction of Timestamped Events from Files

English Title

Tool for Distributed Extraction of Timestamped Events from Files

Type

Software

Abstract

A tool for distributed extraction of timestamps from various files using extractors adapted from the Plaso engine to Apache Spark infrastructure. The files to extract are uploaded to distributed file-system HDFS and the extraction process is controlled by a Web service via its REST API. The tool is able to utilise efficiently a large distributed clusters.

Abstract in English

A tool for distributed extraction of timestamps from various files using extractors adapted from the Plaso engine to Apache Spark infrastructure. The files to extract are uploaded to distributed file-system HDFS and the extraction process is controlled by a Web service via its REST API. The tool is able to utilise efficiently a large distributed clusters.

Keywords

files, events, timestamps, extraction, distributed system

Key words in English

files, events, timestamps, extraction, distributed system

Location

https://github.com/nesfit/pyspark-plaso

Licence fee

Use of the result by another entity is possible without acquiring a license in some cases

www

Documents