Doctoral thesis (Dissertations and theses)
Sound Synthesis Parameters: Matching and Interpolation using Generative Neural Networks
Le Vaillant, Gwendal
2025
 

Files


Full Text
PhD Thesis - Gwendal Le Vaillant - v2.pdf
Author postprint (15.27 MB)
Download

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Abstract :
[en] This thesis addresses key challenges in manipulating sound synthesizers, which have shaped modern music production and have defined new music genres. Specifically, it focuses on the tasks of sound matching and sound morphing using conventional synthesizers. Given the complexity of manipulating extensive parameter sets, called presets, the study introduces a scalable, synthesizer-agnostic approach to sound matching for creating presets from target sounds. The model is validated using a widespread frequency modulation synthesizer with a large set of interdependent parameters and a semi-modular architecture. This thesis also introduces a novel variational auto-encoder model tailored for black-box synthesizer preset interpolation, which enables the intuitive generation of new presets from pre-existing ones. Leveraging multi-head self-attention networks, the model learns latent representations of synthesis parameters, aligning these with perceived timbre dimensions through attribute-based regularization. Objective evaluations of timbre linearity and smoothness % , computed on a broad set of audio features, show that the auto-encoder is able to gradually transition between diverse presets. % , surpassing linear parametric interpolation methods. The model's performance is also demonstrated through subjective experiments. It can be adapted to any conventional non-differentiable synthesizer, and can perform other tasks such as extrapolations and automatic modulations of presets. These contributions advance sound design by offering automated, generative tools for sound matching and preset interpolation.
Disciplines :
Computer science
Author, co-author :
Le Vaillant, Gwendal  ;  Université de Mons - UMONS > Faculté Polytechnique > Service Information, Signal et Intelligence artificielle
Language :
English
Title :
Sound Synthesis Parameters: Matching and Interpolation using Generative Neural Networks
Defense date :
20 February 2025
Institution :
UMONS - Université de Mons [Faculté Polytechnique], Mons, Belgium
Degree :
Doctorat en sciences de l'ingénieur et technologie
Promotor :
Dutoit, Thierry ;  Université de Mons - UMONS > Faculté Polytechnique > Service Information, Signal et Intelligence artificielle
President :
Gosselin, Bernard  ;  Université de Mons - UMONS > Faculté Polytechnique > Service Information, Signal et Intelligence artificielle
Secretary :
Siebert, Xavier  ;  Université de Mons - UMONS > Faculté Polytechnique > Service de Mathématique et Recherche opérationnelle
Jury member :
Girin, Laurent;  UGA - Universite Grenoble Alpes > GIPSA-lab
Peeters, Geoffroy;  Télécom Paris
Giot, Rudi;  HE2B - Haute-École Bruxelles-Brabant > ISIB - Institut Supérieur Industriel de Bruxelles
Research unit :
F105 - Information, Signal et Intelligence artificielle
Research institute :
R450 - Institut NUMEDIART pour les Technologies des Arts Numériques
Available on ORBi UMONS :
since 24 March 2025

Statistics


Number of views
221 (8 by UMONS)
Number of downloads
754 (3 by UMONS)

Bibliography


Similar publications



Contact ORBi UMONS