Přístupnostní navigace
E-application
Search Search Close
Master's Thesis
Author of thesis: Bc. Přemek Janda
Acad. year: 2025/2026
Supervisor: prof. Ing. Adam Herout, Ph.D.
Reviewer: Ing. Michal Hradiš, Ph.D.
While humans can intuitively estimate continuous values from visuals such as glyphs, for computer vision systems, this task presents a significant challenge, especially under limitations such as sparse or out-of-distribution input data. To tackle this problem, this thesis proposes a deep learning approach framing glyph analysis as a continuous regression task. The work introduces a generation pipeline for rasterized malleable glyphs and evaluates architectures based on Convolutional Neural Networks (CNN) and Vision Transformers (ViT). A core contribution of this work is the design of a VAE-assisted architecture utilizing a probabilistically regularized latent space, which decouples the geometric identity of a glyph from its magnitude. Through a series of experiments, the thesis evaluates the perception and model's capacity for interpolation and zero-shot transfer. The final results confirm that lightweight CNN backbones coupled with structured latent space division yield the highest stability and generalization performance.
Malleable glyphs, Computer vision, Deep learning, Information visualization, Value estimation, Regression, Synthetic data, CNN, Vision Transformer
Date of defence
24.06.2026
Result of the defence
Defended (thesis was successfully defended)
Grading
C
Process of defence
Student nejprve prezentoval výsledky, kterých dosáhl v rámci své práce. Komise se poté seznámila s hodnocením vedoucího a posudkem oponenta práce. Student následně odpověděl na otázky oponenta a na další otázky přítomných. Komise se na základě posudku oponenta, hodnocení vedoucího, přednesené prezentace a odpovědí studenta na položené otázky rozhodla práci hodnotit stupněm C.
Topics for thesis defence
Language of thesis
English
Faculty
Fakulta informačních technologií
Department
Department of Computer Graphics and Multimedia
Study programme
Information Technology and Artificial Intelligence (MITAI)
Specialization
Computer Vision (NVIZ)
Composition of Committee
prof. Ing. Adam Herout, Ph.D. (předseda) prof. Ing. Martin Čadík, Ph.D. (místopředseda) doc. RNDr. Milan Češka, Ph.D. (člen) prof. Dr. Ing. Pavel Zemčík, dr. h. c. (člen) Ing. David Bařina, Ph.D. (člen) Ing. Tomáš Milet, Ph.D. (člen)
Supervisor’s reportprof. Ing. Adam Herout, Ph.D.
Řešiteli se podařilo odvést množství práce a navrhnout několik postupů pro experimentální práci s počítačovým viděním na malleable glyphs. Řešení by možná prospělo intenzivnější konzultování s vedoucím.
Zadání bylo výzkumného charakteru; řešitel měl přijít s inovativními postupy k řešení nového problému. Řešiteli se podařilo navrhnout několik zajímavých přístupů a jeho výstupy jsou hodnotné.
Práce byla dokončena včas. I při dokončování pracoval řešitel samostatně a znění technické zprávy téměř nekonzultoval.
N/A
Řešitel dobře proniknul do problematiky, seznámil se s množstvím postupů strojového učení a počítačového vidění a osvojil si praktickou práci v této oblasti.
Řešitel pracoval hodně samostatně a na konzultace docházel ne často a až po urgencích.
Grade proposed by supervisor: C
Reviewer’s reportIng. Michal Hradiš, Ph.D.
The student explored a novel area and tested interesting approaches in large number of experiments. However, the general motivation is not clear, some experiments may be flawed and the presentation of results is rather confusing. The text is hard to follow and related work is missing for TTA and auxiliary reconstruction loss.
Evaluation level: zadání splněno
Evaluation level: přesahuje obvyklé rozmezí
The thesis is too long; it could have been shorter and more concise.
The text is often rather confusing, imprecise and hard to follow. Sometimes it is not clear if the text explains "general" ideas or some specific instances and it does not distinguish precisely between own and previous ideas. The structure is often confusing.
Selected specific comments (only some):
The thesis contains many high-quality figures and tables. It is also generally well formatted. On the other hand, it contains rather larger amount of typos and wrong or incomprehensible sentences. Typographic issues:
The thesis references 61 high quality sources. However, most sources cover topics of either glyphs and visual perception or various neural networks (image backbone architectures, their pre-training, generative models, VAE). The thesis is missing literature review on how to solve related tasks to the one addressed (e.g. visual object ranking, domain adaptation) and it does not mention sources for techniques used (e.g. Test Time Adaptation, auxiliary reconstruction losses). Sometimes, it is not precisely clear which ideas are novel and original. Some parts are missing sources.
Few specific issues not mentioned before:
Student performed many experiments including a human study. He also proposed and tested two interesting ideas - Test-Time Adaptation and auxiliary reconstruction loss with latent variable disentanglement. On the other hand, the presentation of results is rather confusing and some of the experiments may be poorly designed or flawed.
The student explored a novel area and tested interesting approaches. However, the motivation is not clear, some experiments may be flawed and the presentation of results is rather confusing.
Evaluation level: obtížnější zadání
The thesis addresses a slightly unusual and unexplored topic, where the student had to apply his own judgement which directions are worth pursuing including methods, evaluation methodologies, and experiments.
Grade proposed by reviewer: C
Responsibility: Mgr. et Mgr. Hana Odstrčilová