Přístupnostní navigace
E-application
Search Search Close
Bachelor's Thesis
Author of thesis: Ing. Peter Horečný
Acad. year: 2017/2018
Supervisor: Ing. Jan Mašek, Ph.D.
Reviewer: Ing. Martin Rajnoha, Ph.D.
The goal of this thesis was to create laboratory excercises for subject „Parallel data processing“, which will introduce options and capabilities of Apache Spark technology to the students. The excercises focus on work with basic operations and data preprocessing, work with concepts and algorithms of machine learning. By following the instructions, the students will solve real world situations problems by using algorithms for linear regression, classification, clustering and frequent patterns. This will show them the real usage and advantages of Spark. As an input data, there will be databases of czech and slovak companies with a lot of information provided, which need to be prepared, filtered and sorted for next processing in the first excercise. The students will also get known with functional programming, because the are not whole programs in excercises, but just the pieces of instructions, which are not repeated in the following excercises. They will get a comprehensive overview about possibilities of Spark by getting over all the excercices.
Apache Hadoop, Apache Spark, classification, linear regression, parallel data processing, frequent patterns, machine learning, big data, clustering
Date of defence
14.06.2018
Result of the defence
Defended (thesis was successfully defended)
Grading
B
Process of defence
Stručně popište koncept funkcionálního programování, porovnání s jinými přístupy, využití, výhody a nevýhody. –Student vysvětlil otázku.
Language of thesis
Slovak
Faculty
Fakulta elektrotechniky a komunikačních technologií
Department
Department of Telecommunications
Study programme
Electrical, Electronic, Communication and Control Technology (EECC Bc.)
Field of study
Teleinformatics (B-TLI)
Composition of Committee
prof. Ing. Dan Komosný, Ph.D. (předseda) prof. Mgr. Pavel Rajmic, Ph.D. (místopředseda) Ing. Vlastimil Člupek, Ph.D. (člen) Ing. Jan Mašek, Ph.D. (člen) Ing. Petr Kříž (člen) Ing. Jaroslav Vrána, Ph.D. (člen)
Supervisor’s reportIng. Jan Mašek, Ph.D.
Grade proposed by supervisor: A
Reviewer’s reportIng. Martin Rajnoha, Ph.D.
Grade proposed by reviewer: C
Responsibility: Mgr. et Mgr. Hana Odstrčilová