A pitch detection algorithm (PDA) is an algorithm designed to estimate the pitch or fundamental frequency of a quasiperiodic or oscillating signal, usually a digital recording of speech or a musical note or tone. This can be done in the time domain, the frequency domain, or both. PDAs are used in various contexts (e.g. phonetics, music information retrieval, speech coding, musical performance systems) and so there may be different demands placed upon the algorithm. There is as yet no single ideal PDA, so a variety of algorithms exist, most falling broadly into the classes given below. A PDA typically estimates the period of a quasiperiodic signal, then inverts that value to give the frequency. One simple approach would be to measure the distance between zero crossing points of the signal (i.e. the zero-crossing rate). However, this does not work well with complicated waveforms which are composed of multiple sine waves with differing periods or noisy data. Nevertheless, there are cases in which zero-crossing can be a useful measure, e.g. in some speech applications where a single source is assumed. The algorithm's simplicity makes it "cheap" to implement. More sophisticated approaches compare segments of the signal with other segments offset by a trial period to find a match. AMDF (average magnitude difference function), ASMDF (Average Squared Mean Difference Function), and other similar autocorrelation algorithms work this way. These algorithms can give quite accurate results for highly periodic signals. However, they have false detection problems (often "octave errors"), can sometimes cope badly with noisy signals (depending on the implementation), and - in their basic implementations - do not deal well with polyphonic sounds (which involve multiple musical notes of different pitches). Current time-domain pitch detector algorithms tend to build upon the basic methods mentioned above, with additional refinements to bring the performance more in line with a human assessment of pitch.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.
Related courses (1)
EE-512: Applied biomedical signal processing
The goal of this course is twofold: (1) to introduce physiological basis, signal acquisition solutions (sensors) and state-of-the-art signal processing techniques, and (2) to propose concrete examples
Related lectures (6)
Statistical Signal Processing for Wireless Communications
Introduces statistical signal processing tools for wireless communications, emphasizing practical applications and hands-on experience with Python or Matlab.
Signals & Systems II: Statistical Properties and Optimal Detectors
Explores conditional probabilities, characteristic functions, and optimal detectors in signals and systems.
Statistical Signal Processing Tools
Explores statistical signal processing tools for wireless communications, including spectral estimation and signal detection, classification, and adaptive filtering.
Show more
Related publications (25)

Numerical and experimental investigations of a microwave interferometer for the negative ion source SPIDER

Ivo Furno, Rémy Jacquier, Riccardo Agnello

The electron density close to the extraction grids and the co-extracted electrons represent a crucial issue when operating negative ion sources for fusion reactors. An excessive electron density in the plasma expansion region can indeed inhibit the negativ ...
IOP Publishing Ltd2023

midiVERTO: A Web Application to Visualize Tonality in Real Time

Fabian Claude Moss, Daniel Harasim

This paper presents a web application for visualizing the tonality of a piece of music-the organization of its chords and scales-at a high level of abstraction and with coordinated playback. The application applies the discrete Fourier transform to the pit ...
SPRINGER INTERNATIONAL PUBLISHING AG2022

Neural VTLN for Speaker Adaptation in TTS

Philip Neil Garner, Bastian Schnell

Vocal tract length normalisation (VTLN) is well established as a speaker adaptation technique that can work with very little adaptation data. It is also well known that VTLN can be cast as a linear transform in the cepstral domain. Building on this latter ...
2019
Show more
Related people (2)
Related concepts (2)
Spectral density
The power spectrum of a time series describes the distribution of power into frequency components composing that signal. According to Fourier analysis, any physical signal can be decomposed into a number of discrete frequencies, or a spectrum of frequencies over a continuous range. The statistical average of a certain signal or sort of signal (including noise) as analyzed in terms of its frequency content, is called its spectrum.
Fourier transform
In physics and mathematics, the Fourier transform (FT) is a transform that converts a function into a form that describes the frequencies present in the original function. The output of the transform is a complex-valued function of frequency. The term Fourier transform refers to both this complex-valued function and the mathematical operation. When a distinction needs to be made the Fourier transform is sometimes called the frequency domain representation of the original function.

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.