Combining Visual and Textual Features for Semantic Segmentation of Historical Newspapers
Related publications (42)
Graph Chatbot
Chat with Graph Search
Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
In crowding, the perception of a target is impeded by surrounding clutter. While traditional models are feedforward and local, there is increasing behavioral and neural evidence for a critical role of recurrent processing across the visual hierarchy in cro ...
Visual processing can be seen as the integration and segmentation of features. Objects are composed of contours, integrated into shapes and segmented from other contours. Information also needs to be integrated to solve the ill-posed problems of vision. Fo ...
In this paper, we study how to extract visual concepts to understand landscape scenicness. Using visual feature representations from a Convolutional Neural Network (CNN), we learn a number of Concept Activation Vectors (CAV) aligned with semantic concepts ...
Semantic segmentation consists of the generation of a categorical map, given an image in which each pixel of the image is automatically assigned a class. Deep learning allows the influence of the pixel's context to be learned by capturing the non-linear re ...
Dataset and models used and produced in the work described in the paper "Combining Visual and Textual Features for Semantic Segmentation of Historical Newspapers": https://infoscience.epfl.ch/record/282863?ln=en ...
Classically, vision is seen as a cascade of local, feedforward computations. This framework has been tremendously successful, inspiring a wide range of ground-breaking findings in neuroscience and computer vision. Recently, feedforward Convolutional Neural ...
One of the main limitations of artificial intelligence today is its inability to adapt to unforeseen circumstances. Machine Learning (ML), due to its data-driven nature, is particularly susceptible to this. ML relies on observations in order to learn impli ...
Stereo reconstruction is a problem of recovering a 3d structure of a scene from a pair of images of the scene, acquired from different viewpoints. It has been investigated for decades and many successful methods were developed.The main drawback of these ...
Crowding, the impairment of target discrimination in clutter, is the standard situation in vision. Traditionally, crowding is explained with (feedforward) models, in which only neighboring elements interact, leading to a “bottleneck” at the earliest stages ...
This paper presents the combined effect of indoor temperature (19 degrees C, 22 degrees C, and 26 degrees C) and colored glazing (blue, orange, and neutral) on visual perception of daylight. Experiments were performed in an office-like test room, in which ...