This paper summarises the latest work I did at Dolby. We study a single general audio source separation (GASS) model trained to separate speech, music, and sound events in a supervised fashion with a large-scale dataset.
In this work lead by Joan, we explore the use of generative models that operate on top of a parametric stereo domain to generate plausible stereo samples from mono audio signals.
Check it on arXiv!
As an attendee at Sónar+D 2023, I witnessed the cutting-edge advancements and trends in AI art. This renowned event brought together artists, technologists, and enthusiasts who collectively explored the intersection of artificial intelligence and creativity. From cool installations to thought-provoking discussions, Sónar 2023 provided a platform to delve into AI art world. In this blog post, I’ll cover the key trends and insights I observed at Sónar+D 2023!