Subtitles are text representing the contents of the audio in a film, television show, opera or other audiovisual media. Subtitles might provide a transcription or translation of spoken dialogue. Although naming conventions can vary, captions are subtitles that include written descriptions of other elements of the audio like music or sound effects. Captions are thus especially helpful to people who are deaf or hard-of-hearing. Other times, subtitles add information not present in the audio. Localizing subtitles provide cultural context to viewers, for example by explaining to an unfamiliar American audience that sake is a type of Japanese wine. Lastly, subtitles are sometimes used for humor, like in Annie Hall where subtitles show the characters' inner thoughts, which contradict what they were actually saying in the audio.
Creating, delivering and displaying subtitles is a complicated and multi-step endeavor. First, the text of subtitles needs to be written. When there is plenty of time to prepare, this process can be done by hand. However, for media produced in real-time, like live television, it may be done by stenographers or using automated speech recognition. Subtitles written by fans, rather than more official sources, are referred to as fansubs. Regardless of who does the writing, they must include information on when each line of text should be displayed.
Second, subtitles need to be distributed to the audience. Open subtitles are added directly to recorded video frames themselves and thus cannot be removed once added. On the other hand, closed subtitles are stored separately, which can allow subtitles in different languages to be used without changing the video itself. In either case, there are a wide variety of technical approaches and formats used to encode the subtitles.
Third, subtitles need to be displayed to the audience. Open subtitles are always shown whenever the video is played because they are part of the video itself. However, displaying closed subtitles is optional since they are overlaid onto the video by whatever is playing it.
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.
Le cours présente divers enjeux liés à l'alimentation: production durable, sécurité alimentaire, nutrition et santé, enjeux sociaux et culturels. L'approche interdisciplinaire intègre les SHS et les s
This course proposes to investigate six strategies for architecture in the age of acceleration: Acceleration, Regionalism, Participation, Transformation, Commonality and Resilience. Anchoring these st
Ce cours présente le cadre légal applicable à certaines problématiques dans des domaines à caractère technique, tels que le droit de l'informatique, la biotechnologie, la protection des données, la di
Karaoke (ˌkæriˈoʊki; kaɾaoke; カラオケ, clipped compound of Japanese kara 空 "empty" and ōkesutora オーケストラ "orchestra") is a type of interactive entertainment usually offered in clubs and bars, where people sing along to recorded music using a microphone. The music is an instrumental version of a well-known popular song. Lyrics are typically displayed on a video screen, along with a moving symbol, changing colour, or music video images, to guide the singer.
A film also called a movie, motion picture, moving picture, picture, photoplay or (slang) flick is a work of visual art that simulates experiences and otherwise communicates ideas, stories, perceptions, feelings, beauty, or atmosphere through the use of moving images. These images are generally accompanied by sound and, more rarely, other sensory stimulations. The word "cinema", short for cinematography, is often used to refer to filmmaking and the film industry, and the art form that is the result of it.
Closed captioning (CC) and subtitling are both processes of displaying text on a television, video screen, or other visual display to provide additional or interpretive information. Both are typically used as a transcription of the audio portion of a program as it occurs (either verbatim or in edited form), sometimes including descriptions of non-speech elements. Other uses have included providing a textual alternative language translation of a presentation's primary audio language that is usually burned-in (or "open") to the video and unselectable.
This text, accompanied by a ‘gospel’ that I made by montaging the US-recorded captions, is an attempt for re-narrating Marshall Plan’s discourse on the working class, aka the so-called ‘free labor’ of the US against the ‘communist labor’ of the “Soviet thr ...
EPFL Press2023
, , ,
Image-based retrieval in large Earth observation archives is difficult, because one needs to navigate across thousands of candidate matches only with the proposition image as a guide. By using text as a query language, the retrieval system gains in usabili ...
2022
, ,
In this paper, we propose a new approach to learn multimodal multilingual embeddings for matching images and their relevant captions in two languages. We combine two existing objective functions to make images and captions close in a joint embedding space ...