Real-Time Chord Detection in Audio Using Non-negative Matrix Factorization

Abstract

📜 Abstract

This paper proposes a real-time system for recognizing chords from audio, based on frame-wise estimation of chord templates from music data calculated using Non-negative Matrix Factorization. In continuous tracking of audio data, the system outputs the most probable chord successively for each frame. The proposed method requires detecting onsets of musical notes, which allows segmenting the audio into frames with suitable duration for analysis. Performance evaluation with test signals shows that the method provides satisfying results for real-time applications.

Description

✨ Summary

The paper introduces a method for real-time chord detection in audio signals using Non-negative Matrix Factorization (NMF). This approach allows the identification of musical chords from audio signals by analyzing the spectral content frame by frame. The system presented requires the detection of musical note onsets to appropriately frame the audio for analysis, which is critical for maintaining real-time performance. The method was tested and showed satisfactory results in terms of performance.

A web search reveals that the techniques discussed in this paper have been foundational in music information retrieval (MIR) systems, especially those focusing on audio content analysis. However, there are limited direct citations or explicit further developments stemming directly from this particular paper. It shares concepts that resonate with broader works on real-time audio processing and chord recognition. Similar NMF techniques have also been widely adopted in various audio and speech processing research.

For example, the work “Music Structure Analysis Using Chord Sequence Patterns and a Time-Continuous Diagram of Clusters of Chord Sequences” by Mauch and Dixon has been influenced by similar NMF techniques (Mauch, M. and Dixon, S., 2010). Although not a direct citation, similar methodologies are applied in real-time audio processing systems.

Overall, the paper is part of a broader set of research that has impacted the field of machine learning applications in music and audio signal processing.