Autoregressive Models of Amplitude Modulations in Audio Compression

We present a scalable medium bit-rate wide-band audio coding technique based on frequency domain linear prediction (FDLP). FDLP is an efficient method for representing the long-term amplitude modulations of speech/audio signals using autoregressive models. For the proposed audio codec, relatively long temporal segments (1000 ms) of the input audio signal are decomposed into a set of critically sampled sub-bands using a quadrature mirror filter (QMF) bank. The technique of FDLP is applied on each sub-band to model the sub-band temporal envelopes. The residual of the linear prediction, which represents the frequency modulations in the sub-band signal [1], are encoded and transmitted along with the envelope parameters. These steps are reversed at the decoder to reconstruct the signal. The proposed codec utilizes a simple signal independent non-adaptive compression mechanism for a wide class of speech and audio signals. The subjective and objective quality evaluations show that the reconstruction signal quality for the proposed FDLP codec compares well with the state-of-the-art audio codecs in the 32-64 kbps range.

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Autoregressive Models of Amplitude Modulations in Audio Compression

Graph Chatbot

Chattez avec Graph Search

Acoustical Features as Knee Health Biomarkers: A Critical Analysis

Reticular chemistry for the rational design of mechanically robust mesoporous merged-net metal-organic frameworks

Complex-Valued-Autoencoder for Structural Health Monitoring with Frequency Modulated Continuous Wave Radar

Acoustical Features as Knee Health Biomarkers: A Critical Analysis

Reticular chemistry for the rational design of mechanically robust mesoporous merged-net metal-organic frameworks

Complex-Valued-Autoencoder for Structural Health Monitoring with Frequency Modulated Continuous Wave Radar