Course detail
Knowledge Discovery in Databases
FIT-ZZDAcad. year: 2019/2020
- The deepening of basics in KDD - basics of methods of data preprocessing (statistics quantities used in data summarization, approaches to data cleaning, transformation and reduction), basics of data warehousing, basic methods and algorithms of mining frequent items and patterns and association rules (Apriori algorithm, FP-tree, multi-level association rules, mining multidimensional association rules from relational databases), basic methods and algorithms of classification (decision tree, Bayesian classification, using neural networks, SVM) and prediction (linear and nonlinear regression), basic methods and algorithms of cluster analysis (distance of data, partitioning methods, hierarchical methods, CF-tree, density-based methods, grid- and model-based methods).
- Advanced data mining techniques - advanced techniques of data mining in 'classic' data sources, mining in data streams, time series and sequences, mining in biological data; mining in graphs, multirelational data mining, mining in object, spatial and multimedia data, mining in text, mining on the Web.
Language of instruction
Czech, English
Mode of study
Not applicable.
Guarantor
Department
Learning outcomes of the course unit
Students get a broad, yet in-depth overview of the field of data mining and knowledge discovery. They get a deeper view mainly in the field related to the topic of their thesis.
Prerequisites
Students should have basic knowledge in statistics, database systems, information theory, machine learning, neural networks. It is assumed that they have passed some subject on KDD.
Co-requisites
Not applicable.
Planned learning activities and teaching methods
Not applicable.
Assesment methods and criteria linked to learning outcomes
Control questions during consultations.
Course curriculum
Not applicable.
Work placements
Not applicable.
Aims
To deepen students' knowledge in the field of knowledge discovery in databases and other data sources (KDD) with special focus on theoretical foundations of the used techniques, algorithms and models.
Specification of controlled education, way of implementation and compensation for absences
Consultations, elaboration of a given topic, written report and presentation on the final seminar.
Recommended optional programme components
Not applicable.
Prerequisites and corequisites
Not applicable.
Basic literature
Not applicable.
Recommended reading
Aggarwal, Ch.C. (ed.): Data Streams: Models and Algorithms. Advances in Database Systems. Springer, 2006, 358 p. ISBN 0387287590.
Bishop, CH. M.: Pattern Recognition and Machine Learning. Springer, 2006, 738 p. ISBN 978-0-387-31073-2.
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Second Edition. Elsevier Inc., 2006, 770 p. ISBN 1-55860-901-3.
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Third Edition. Elsevier Inc., 2012, 703 p. ISBN 978-0-12-381479-1.
Papers in journals and conference proceedings (including those in ACM Digital library, IEEE Digital library and other electronic sources).
Bishop, CH. M.: Pattern Recognition and Machine Learning. Springer, 2006, 738 p. ISBN 978-0-387-31073-2.
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Second Edition. Elsevier Inc., 2006, 770 p. ISBN 1-55860-901-3.
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Third Edition. Elsevier Inc., 2012, 703 p. ISBN 978-0-12-381479-1.
Papers in journals and conference proceedings (including those in ACM Digital library, IEEE Digital library and other electronic sources).
Classification of course in study plans
Type of course unit
Lecture
39 hod., optionally
Teacher / Lecturer
Syllabus
- Data preprocessing.
- Data warehousing.
- Asociation analysis.
- Classification and prediction.
- Cluster analysis.
- Advanced data mining in 'classic' data sources.
- Mining in data streams.
- Data mining in time series and sequences.
- Mining in biological data.
- Data mining in graph structures.
- Mining in object, spatial and multimedia data.
- Text mining and Web mining.
- Mining moving object data.
Project
13 hod., compulsory
Teacher / Lecturer
Syllabus
- Reading up and treatment of a selected topic concerning knowledge discovery in a field related to the student's PhD thesis.
Guided consultation in combined form of studies
26 hod., optionally
Teacher / Lecturer