This was my first Interspeech, and I was interested in understanding the field from the eyes of a “speech researcher” — instead of looking at it from the music/audio perspective, that is my field of expertise. After attending to Interspeech, I realized their sensibility for languages and how diverse is the community. The best of the conference? That one of the longest slides in the world was in town.Continue reading
The musicnn library (pronounced as “musician”) employs deep convolutional neural networks to automatically tag songs, and the models that are included achieve the best scores in public evaluation benchmarks. These state-of-the-art models have been released as an open-source library that can be easily installed and used. For example, you can use musicnn to tag this emblematic song from Muddy Waters — and it will predominantly tag it as blues!