Publication detail

Big Data Network Flow Processing Using Apache Spark

JEŘÁBEK, K. RYŠAVÝ, O.

Original Title

Big Data Network Flow Processing Using Apache Spark

Type

conference paper

Language

English

Original Abstract

The increasing amount of traffic flows captured as a part of network monitoring activities makes the analysis more complicated. One of the goals for network traffic analysis is to identify malicious communication. In the paper, we present a new system for big data network flow classification and clustering. The proposed system is based on the popular big data engines such as Apache Spark and Apache Ignite. The conducted experiments demonstrate the feasibility of the proposed approach and show the possible scalability.

Keywords

Big Data, Network flows, Apache Spark, Cassandra, Apache Ignite

Authors

JEŘÁBEK, K.; RYŠAVÝ, O.

Released

30. 4. 2019

Publisher

Association for Computing Machinery

Location

Bukurešť

ISBN

978-1-4503-7636-5

Book

Proceedings of the 6th Conference on the Engineering of Computer Based Systems (ECBS 2019), 2019

Pages from

1

Pages to

9

Pages count

9

URL

BibTex

@inproceedings{BUT161450,
  author="Kamil {Jeřábek} and Ondřej {Ryšavý}",
  title="Big Data Network Flow Processing Using Apache Spark",
  booktitle="Proceedings of the 6th Conference on the Engineering of Computer Based Systems (ECBS 2019), 2019",
  year="2019",
  pages="1--9",
  publisher="Association for Computing Machinery",
  address="Bukurešť",
  doi="10.1145/3352700.3352709",
  isbn="978-1-4503-7636-5",
  url="https://www.fit.vut.cz/research/publication/11977/"
}