Musical Audio Synthesis Using Autoencoding Neural Nets

With an optimal network topology and tuning of hyperparameters, artificial neural networks (ANNs) may be trained to learn a mapping from low level audio features to one or more higher-level representations. Such artificial neural networks are commonly used in classification and regression settings to perform arbitrary tasks. In this work we suggest repurposing autoencoding neural networks as musical audio synthesizers. We offer an interactive musical audio synthesis system that uses feedforward artificial neural networks for musical audio synthesis, rather than discriminative or regression tasks. In our system an ANN is trained on frames of low-level features. A high level representation of the musical audio is learned though an autoencoding neural net. Our real-time synthesis system allows one to interact directly with the parameters of the model and generate musical audio in real time. This work therefore proposes the exploitation of neural networks for creative musical applications.

More info can be found at (Sarroff & Casey, 2014) and code is located on github.

Sarroff, A. M., & Casey, M. (2014). Musical audio synthesis using autoencoding neural nets. In Joint 40th International Computer Music Conference (ICMC) and 11th Sound & Music Computing conference (SMC). Link. Details