Course detail
Data Analysis and Visualization in Python
FIT-IZVAcad. year: 2021/2022
The aim of the course is to acquaint students with the problems of data acquisition, processing, analysis and visualization using the cross-platform scripting language Python. It has a sophisticated ecosystem offering a rich spectrum of extension libraries, either in the form of native code or in terms of performance of efficient extensions implemented in C / C ++.
During the lectures students will learn Python constructs, methods of data acquisition, storage and manipulation, possibilities of advanced computations in numerical and symbolic level and visualization of acquired data. In this course, students will also gain an overview of the properties of techniques for advanced analysis of data dependencies and their applications for various data. Finally, Python will be expanded to include custom designs and techniques to effectively overcome the disadvantages of the interpreted language for performance-oriented applications. In the practical part (project), students will go through all stages of large data processing - from the design stage, through processing to subsequent analysis and visualization.
Language of instruction
Number of ECTS credits
Mode of study
Guarantor
Department
Learning outcomes of the course unit
In addition to general knowledge of basic data processing techniques, the student will gain an overview of the effective execution of critical parts of the program, extension of the language with its own modules written in C / C ++ or the problematic of installing libraries in an isolated environment or containers.
At the end of the course students should understand how to effectively obtain, analyze and visualize data of various extent. The knowledge can then be used to solve non-trivial engineering and scientific tasks or to evaluate data for management and decision-making purposes.
Prerequisites
- IZP - Introduction to Programming Systems
- ILG - Linear algebra
- IPP - Principles of Programming Languages
- IPT - Probability and Statistics
Co-requisites
Planned learning activities and teaching methods
Assesment methods and criteria linked to learning outcomes
Course curriculum
Work placements
Aims
Specification of controlled education, way of implementation and compensation for absences
Recommended optional programme components
Prerequisites and corequisites
- recommended prerequisite
Principles of Programming Languages - recommended prerequisite
Introduction to Programming Systems - recommended prerequisite
Linear Algebra - recommended prerequisite
Probability and Statistics
Basic literature
Recommended reading
Mark Pilgrim: Ponořme se do Pythonu 3 (ISBN: 978-80-904248-2-1, dostupné online)
Robert Johansson: Numerical Python (2019, ISBN: 978-1-4842-4245-2)
Samir Madhavan: Mastering Python for Data Science (ISBN: 978-17-843901-5-0)
Elearning
Classification of course in study plans
Type of course unit
Lecture
Teacher / Lecturer
Syllabus
- Introduction to Language I
- Introduction to Language II
- Data acquisition and data persistence
- Effective implementation of operations over n-dimensional fields
- Tools for advanced data manipulation
- Basic approaches to data visualization
- Basic methods of data and data dependency analysis
- Advanced approaches to data visualization
- Advanced methods of data and data dependency analysis
- Work with image data and possibilities of data presentation
- Advanced operations over time series
- Symbolic domain calculations
- Code acceleration capabilities for HPC needs
Elearning