Slides: Tutorial on Recurrent Neural Networks

Although I’m now a researcher at Dolby Laboratories, I’m still collaborating with some universities in Barcelona — where I’ll keep teaching deep learning for music and audio. In this context, and given the importance of the gradient vanishing/explode problem in deep neural networks, this week I’ll be teaching recurrent neural networks to the Master in Sound and Music Computing students of the Universitat Pompeu Fabra.

Here the slides!

My favourite ISMIR 2019 papers!

This year’s ISMIR was in Delft, the Netherlands. It seems like the community is starting to realise that the technologies developed by the ISMIR community can have an impact to our society – because they are starting to work! During the first days of the conference, many conversations were focusing on exploring ways to positively impact society. On the other side, technology-wise, we have seen (i) many people studying how to use musical domain knowledge to disentangle/structure/learn useful neural representations for many music applications, and (ii) many attention-based neural architectures.

Continue reading

Interspeech2019: my highlights

This was my first Interspeech, and I was interested in understanding the field from the eyes of a “speech researcher” — instead of looking at it from the music/audio perspective, that is my field of expertise. After attending to Interspeech, I realized their sensibility for languages and how diverse is the community. The best of the conference? That one of the longest slides in the world was in town.

Continue reading