Přístupnostní navigace
E-application
Search Search Close
Master's Thesis
Author of thesis: Bc. Adam Leznar
Acad. year: 2025/2026
Supervisor: Ing. Pavel Sikora, Ph.D.
Reviewer: Ing. Petr Kříž
Thesis addresses the use of artificial intelligence and computer vision methods for the automation of waste management processes. The objective of this work is to create a custom dataset, train selected neural network models, evaluate the training process, and subsequently develop a software implementation of these models. For the research, the neural networks YOLOv11 and Mask2Former were selected due to the fundamental differences in their architectural design. This enables a comparison between the convolutional architecture of YOLOv11, designed primarily for real-time detection, and Mask2Former, which represents a transformer-based architecture employing a masked attention mechanism. To compare the properties of these architectures, two datasets containing images of various types of waste were compiled. The first dataset was downloaded from the Roboflow platform and consisted of simple images featuring a single object of interest. The second dataset contained real-world images of waste containers and their contents, capturing scenes in which objects frequently overlap, thus more faithfully reflecting the conditions of potential industrial deployment. The results indicate that Mask2Former achieves superior performance specifically on visually challenging classes, such as plastics, owing to their transparency and often highly variable and deformed shapes. However, the high segmentation accuracy of Mask2Former comes at the cost of substantially greater demands on computational memory and longer inference times, which hinders its deployment at industrial scale. The computational requirements of YOLOv11 are considerably lower and its inference time is shorter, though this comes at the cost of noisier object detections. These results could potentially be improved by expanding the custom dataset, as the current number of images is below what is typically required for training networks of this complexity. A further finding highlighted the importance of dataset content, as models trained on images downloaded from Roboflow exhibited virtually no generalization capability. Based on these findings, software tools were implemented separately for each architecture, enabling users to upload images or videos and perform segmentation. The software incorporates confidence threshold adjustment. This diploma thesis may serve as a foundation for further development of systems and datasets in the field of automated waste management.
Neural Networks, Waste, Computer Vision, Segmentation, Artificial Intelligence.
Date of defence
11.06.2026
Result of the defence
Defended (thesis was successfully defended)
Grading
C
Process of defence
Student prezentoval výsledky své práce a komise byla seznámena s posudky. Student obhájil diplomovou práci s výhradami a odpověděl na otázky členů komise a oponenta. Otázky oponenta diplomové práce: Popište rozdíl mezi segmentací s klasifikací a prostou detekcí s klasifikací objektů v obrazu. Jaké jsou výhody a nevýhody obou přístupů vzhledem k vašemu zadání? Uvádíte, že řešením některých problémů s klasifikací by bylo rozšíření vlastní trénovací sady, viz abstrakt. Jak velká by tedy tato sada měla ideálně být vzhledem ke složitosti architektur testovaných modelů? Proč jste nespojil vlastní data s daty z Roboflow a nepoužil je pro trénování a vyhodnocení používaných modelů? Co je motivací pro výběr modelů? Proč jste neprováděl augumentaci vámi vytvořené datové sady?
Language of thesis
Czech
Faculty
Fakulta elektrotechniky a komunikačních technologií
Department
Department of Telecommunications
Study programme
Audio Engineering (MPC-AUD)
Specialization
Audio Production and Recording (AUDM-ZVUK)
Composition of Committee
prof. Ing. Zdeněk Smékal, CSc. (předseda) Ing.MgA. Edgar Mojdl, Ph.D. (místopředseda) Dr. Ing. Libor Husník (člen) Ing. Václav Mach, Ph.D. (člen) Ing. Matěj Ištvánek, Ph.D. (člen)
Supervisor’s reportIng. Pavel Sikora, Ph.D.
Grade proposed by supervisor: C
Reviewer’s reportIng. Petr Kříž
Grade proposed by reviewer: C
Responsibility: Mgr. et Mgr. Hana Odstrčilová