Přístupnostní navigace
E-application
Search Search Close
Master's Thesis
Author of thesis: Bc. Jan Mostecký
Acad. year: 2025/2026
Supervisor: Ing. Štěpán Miklánek, Ph.D.
Reviewer: Ing. Matěj Ištvánek, Ph.D.
This master's thesis addresses the problem of audio production style transfer using neural networks. The objective is to design and implement a~model capable of applying a~chain of audio effects to a~raw recording so that the resulting sound matches the character of a~reference track. The theoretical section focuses on the principles of neural networks and their application in audio signal processing, provides an overview of audio style transfer methods, and describes three key audio effects for the mastering process -- parametric equalizer, compressor, and saturator -- including their mathematical models and implementation possibilities. The practical section describes the design of a~system based on differentiable effects implemented in the PyTorch library, including experimental validation of its functionality on test audio datasets. The results demonstrate that by optimizing the effect parameters, a~partial transfer of the reference recording's character can be achieved. Furthermore, the limitations of machine learning for this specific problem are discussed, alongside potential avenues for future development.
Equiliser, Compressor, Saturator, Neural network, Mashine Learning, Style Transfer
Date of defence
11.06.2026
Result of the defence
Defended (thesis was successfully defended)
Grading
B
Process of defence
Student prezentoval výsledky své práce a komise byla seznámena s posudky. Student obhájil diplomovou práci a odpověděl na otázky členů komise a oponenta. Otázky oponenta diplomové práce: Jak by bylo možné architekturu modelu upravit tak, aby lépe zachycovala dynamické vlastnosti komprese? Jak by bylo možné navržený systém doplnit o subjektivní poslechové hodnocení? Jaký přínos má modulární trénování jednotlivých expertů oproti trénování celého efektového řetězce najednou? Jaká byla velikost datasetu? Jak dlouho trvalo trénování systému?
Language of thesis
Czech
Faculty
Fakulta elektrotechniky a komunikačních technologií
Department
Department of Telecommunications
Study programme
Audio Engineering (MPC-AUD)
Specialization
Audio Production and Recording (AUDM-ZVUK)
Composition of Committee
prof. Ing. Zdeněk Smékal, CSc. (předseda) Ing.MgA. Edgar Mojdl, Ph.D. (místopředseda) Dr. Ing. Libor Husník (člen) Ing. Václav Mach, Ph.D. (člen) Ing. Matěj Ištvánek, Ph.D. (člen)
Supervisor’s reportIng. Štěpán Miklánek, Ph.D.
Grade proposed by supervisor: B
Reviewer’s reportIng. Matěj Ištvánek, Ph.D.
Grade proposed by reviewer: B
Responsibility: Mgr. et Mgr. Hana Odstrčilová