Publication

SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS

Related publications (36)

About
Privacy
Disclaimer

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS

Graph Chatbot

Chat with Graph Search

Perception and Reproduction of Auditory Spatial Impression

Progress report of a project in very low bit-rate speech coding

On dynamic stream weighting for Audio-Visual Speech Recognition

Multi-parametric source-filter separation of speech and prosodic voice restoration

The ICSI RT-09 Speaker Diarization System

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams

Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game

Estimating Dominance in Multi-Party Meetings Using Speaker Diarization

Verified Speaker Localization Utilizing Voicing Level in Split-bands

Multi-parametric source-filter separation of speech and prosodic voice restoration

The ICSI RT-09 Speaker Diarization System

Estimating Dominance in Multi-Party Meetings Using Speaker Diarization

Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game

Perception and Reproduction of Auditory Spatial Impression

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams

Progress report of a project in very low bit-rate speech coding

Verified Speaker Localization Utilizing Voicing Level in Split-bands

On dynamic stream weighting for Audio-Visual Speech Recognition

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams