Feature extraction of musical content for automatic music transcription
Graph Chatbot
Chat with Graph Search
Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
The enormous growth of digital music databases has led to a comparable growth in the need for methods that help users organize and access such information. One area in particular that has seen much recent research activity is the use of automated technique ...
Short-term spectral features – and most notably Mel-Frequency Cepstral Coefficients (MFCCs) – are the most widely used descriptors of audio signals and are deployed in a majority of state-of-the-art Music Information Retrieval (MIR) systems. These descript ...
In this article, we introduce a novel approach for monaural source separation with the specific aim to separate a polyphonic musical recording into two main sources: a main instrument (or melody) track and an accompaniment track. To that aim, we propose to ...
The enormous growth of digital music databases has led to a comparable growth in the need for methods that help users organize and access such information. One area in particular that has seen much recent research activity is the use of automated technique ...
The paper presents a two-layered system for learning and encoding a periodic signal and its application to a drumming task. The two layers are the dynamical system responsible for extracting the main frequency of the input signal, based on adaptive frequen ...
Multimodal signal processing is an important new field that processes signals from a variety of modalities - speech, vision, language, text- derived from one source, which aids human-computer and human-human interaction. The overarching theme of this book ...
This paper describes a new method for music onset detection. The novelty of the approach consists mainly of two elements: the time–frequency processing and the detection stages. The resonator time frequency image (RTFI) is the basic time–frequency analysis ...
With a novel, less classical approach to the subject, the authors have written a book with the conviction that signal processing should be taught to be fun. The threatment is therefore less focused on the mathematics and more on the conceptual aspects, the ...
Besides basis expansions, frames representations play a key role in signal processing. We thus consider the problem of frame domain signal processing, which is more complex and challenging than transform domain processing. Examples of such processing aboun ...
Ieee Service Center, 445 Hoes Lane, Po Box 1331, Piscataway, Nj 08855-1331 Usa2010
This paper presents a computationally efficient method for polyphonic pitch estimation. The method employs the Fast Resonator Time-Frequency Image (RTFI) as the basic time-frequency analysis tool. The approach is composed of two main stages. First, a preli ...