Beyond Bouma's window: How to explain global aspects of crowding?

Michael Herzog, Gregory Francis, Adrien Christophe Doerig, Aaron Michael Clarke, Alban Bornet
2019
Journal paper

Abstract

In crowding, perception of an object deteriorates in the presence of nearby elements. Although crowding is a ubiquitous phenomenon, since elements are rarely seen in isolation, to date there exists no consensus on how to model it. Previous experiments showed that the global configuration of the entire stimulus must be taken into account. These findings rule out simple pooling or substitution models and favor models sensitive to global spatial aspects. In order to investigate how to incorporate global aspects into models, we tested a large number of models with a database of forty stimuli tailored for the global aspects of crowding. Our results show that incorporating grouping like components strongly improves model performance. Author summary Visual crowding highlights interactions between elements in the visual field. For example, an object is more difficult to recognize if it is presented in clutter. Crowding is one of the most fundamental aspects of vision, playing crucial roles in object recognition, reading and visual perception in general, and is therefore an essential tool to understand how the visual system encodes information based on its retinal input. Hence, classic models of crowding have focused only on local interactions between neighboring visual elements. However, abundant experimental evidence argues against local processing, suggesting that the global configuration of visual elements strongly modulates crowding. Here, we tested all available models of crowding that are able to capture global processing across the entire visual field. We tested 12 models including the Texture Tiling Model, a Deep Convolutional Neural Network and the LAMINART neural network with large scale computer simulations. We found that models incorporating a grouping component are best suited to explain the data. Our results suggest that in order to understand vision in general, mid-level, contextual processing is inevitable.

Official source

https://infoscience.epfl.ch/record/267627?ln=en

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Beyond Bouma's window: How to explain global aspects of crowding?

Graph Chatbot

Chat with Graph Search

Probing and modulating inter-areal coupling in the cortical visual motion processing pathway with non-invasive brain stimulation

Predicting Visual Stimuli From Cortical Response Recorded With Wide-Field Imaging in a Mouse

Decoding electroencephalographic responses to visual stimuli compatible with electrical stimulation

Probing and modulating inter-areal coupling in the cortical visual motion processing pathway with non-invasive brain stimulation

Predicting Visual Stimuli From Cortical Response Recorded With Wide-Field Imaging in a Mouse

Decoding electroencephalographic responses to visual stimuli compatible with electrical stimulation