Detail publikačního výsledku

Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-Based and Adversarial approaches

MOLINER, E.; ŠVENTO, M.; JUVELA, L.; WRIGHT, A.; RAJMIC, P.; VÄLIMÄKI, V.

Originální název

Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-Based and Adversarial approaches

Anglický název

Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-Based and Adversarial approaches

Druh

Stať ve sborníku v databázi WoS či Scopus

Originální abstrakt

Accurately estimating nonlinear audio effects without access to paired input-output signals remains a challenging problem. This work studies unsupervised probabilistic approaches for solving this task. We introduce a method, novel for this application, based on diffusion generative models for blind system identification, enabling the estimation of unknown nonlinear effects using black- and gray-box models. This study compares this method with a previously proposed adversarial approach, analyzing the performance of both methods under different parameterizations of the effect operator and varying lengths of available effected recordings. Through experiments on guitar distortion effects, we show that the diffusion-based approach provides more stable results and is less sensitive to data availability, while the adversarial approach is superior at estimating more pronounced distortion effects. Our findings contribute to the robust unsupervised blind estimation of audio effects, demonstrating the potential of diffusion models for system identification in music technology.

Anglický abstrakt

Accurately estimating nonlinear audio effects without access to paired input-output signals remains a challenging problem. This work studies unsupervised probabilistic approaches for solving this task. We introduce a method, novel for this application, based on diffusion generative models for blind system identification, enabling the estimation of unknown nonlinear effects using black- and gray-box models. This study compares this method with a previously proposed adversarial approach, analyzing the performance of both methods under different parameterizations of the effect operator and varying lengths of available effected recordings. Through experiments on guitar distortion effects, we show that the diffusion-based approach provides more stable results and is less sensitive to data availability, while the adversarial approach is superior at estimating more pronounced distortion effects. Our findings contribute to the robust unsupervised blind estimation of audio effects, demonstrating the potential of diffusion models for system identification in music technology.

Klíčová slova

diffusion model; audio effect modelling; inverse problem;

Klíčová slova v angličtině

diffusion model; audio effect modelling; inverse problem;

Autoři

MOLINER, E.; ŠVENTO, M.; JUVELA, L.; WRIGHT, A.; RAJMIC, P.; VÄLIMÄKI, V.

Vydáno

09.09.2025

Nakladatel

Università Politecnica delle Marche

Místo

Ancona

Kniha

Proceedings of the International Conference on Digital Audio Effects (DAFx)

Periodikum

Proceedings of the International Conference on Digital Audio Effects (DAFx)

Stát

Rakouská republika

Strany od

366

Strany do

373

Strany počet

8

URL

BibTex

@inproceedings{BUT197846,
  author="Eloi {Moliner} and Michal {Švento} and Lauri {Juvela} and Alec {Wright} and Pavel {Rajmic} and Vesa {Välimäki}",
  title="Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-Based and Adversarial approaches",
  booktitle="Proceedings of the International Conference on Digital Audio Effects (DAFx)",
  year="2025",
  journal="Proceedings of the International Conference on Digital Audio Effects (DAFx)",
  pages="366--373",
  publisher="Università Politecnica delle Marche",
  address="Ancona",
  issn="2413-6689",
  url="https://www.scopus.com/pages/publications/105028968011?origin=resultslist"
}