layout

title

byline

arxiv

tags

summary

post

Now Playing: Continuous low-power music recognition

Agüera y Arcas et al

1711.10958

music

audio

signal-processing

fingerprinting

group:google

This method from a Google research uses local processing and hardware to fingerprint music and identify it from a database of songs.

The current state of the art music recognition systems generally transmit the fingerprint of the audio they receive to a remote server which contains a database of fingerprints against which it can be compared. This naturally requires a robust internet connection, and also suffers from the shortcoming that the server and database provider has access to a list of songs (and perhaps other metadata) relevant to each user.

The authors of this paper present a local processing pipeline that uses a secondary processor (a DSP) that only wakes the primary processor when music is detected. This is done using a computed log Mel feature vector of a windowed subset of the audio, which is trained on the same database used by the matching system.

Once music is detected, the digital signal processor sends a signal to wake the primary processor, which is then used to match the encountered fingerprint.

Aside from the obvious privacy advantages of this method, I also see this sort of technology being useful for domain-specific applications such as detecting the species of bird by its call or decoding aquatic mammal sounds by looking them up in an index in an environment where no internet connection is available.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2017-12-03-105.md

2017-12-03-105.md

Files

2017-12-03-105.md

Latest commit

History

2017-12-03-105.md

File metadata and controls