We investigated various upsampling layers to consolidate the ideas we introduced in our previous paper. We benchmarked a large set of upsampling layers for music source separation: different transposed and subpixel convolution setups, different interpolation upsamplers (including two novel layers based on stretch and sinc interpolation), and different wavelet-based upsamplers (including a novel learnable wavelet layer).
I had mixed feelings this ISMIR: from one side, I was disappointed for attending to another virtual ISMIR – buuuuuut, on the other side, it was nice to meet you all! ISMIR is such a vibrant and enthusiastic community, that is always great to meet each other – even if it was virtually! Still.. I guess we all agree that ISMIR was much better when we had the possibility to jam on a boat! 🙂Continue reading
This ISMIR, I presented a retrospective of my recent work at Dolby – during an “Industry Presentation”. In this presentation I try to answer questions like.. do you publish? what do you work on? do yo hire interns? via reviewing and going through our recent papers.
PLAAE (packet loss adversarial auto-encoder) is our proposal for packet loss concealment in a non-autoregressive fashion. Our goal is to reconstruct missing speech packets until a new (real) packet is received in a video-call. Our end-to-end non-autoregressive adversarial auto-encoder specially shines at long-term predictions, beyond 60ms. The paper has been accepted for presentation at WASPAA 2021! Check out our arXiv pre-print.
While I never had the chance of attending to the Web Audio Conference (WAC), I have followed with great interest the recent developments of the web audio API. But.. this time I couldn’t resist to go – since Dolby is the main sponsor of the event, and the conference is organised by friends in my city!
As a personal curiosity, the first WAC (back in 2015!) was being organized by IRCAM when I was a research intern there – and I remember having this feeling that audio processing in the browser was going to be THE THING. Five years later, we start to see the social impact of the ideas that were introduced back then.Continue reading