Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Active vision in binocular depth estimation: a top-down perspective

View through CrossRef
A bstract Depth estimation is an ill-posed problem: objects of different shapes or dimensions, even if at different distances, may project to the same image on the retina. Our brain uses several cues for depth estimation, including monocular cues such as motion parallax and binocular cues like diplopia. However, it is still unclear how the computations required for depth estimation are implemented in biologically plausible ways. State-of-the-art approaches to depth estimation based on deep neural networks implicitly describe the brain as a hierarchical feature detector. Instead, we propose an alternative approach that casts depth estimation as a problem of active inference. We show that depth can be inferred by inverting a hierarchical generative model that simultaneously predicts the eyes projections from a 2D belief over an object. Model inversion consists of a series of biologically plausible, homogeneous transformations based on Predictive Coding principles. Under the plausible assumption of a nonuniform fovea resolution, depth estimation favors an active vision strategy that fixates the object with the eyes, rendering the depth belief more accurate. This strategy is not realized by first fixating on a target and then estimating the depth, but by combining the two processes through action-perception cycles, with a similar mechanism of the saccades during object recognition. The proposed approach requires only local (top-down and bottom-up) message passing that can be implemented in biologically plausible neural circuits.
Title: Active vision in binocular depth estimation: a top-down perspective
Description:
A bstract Depth estimation is an ill-posed problem: objects of different shapes or dimensions, even if at different distances, may project to the same image on the retina.
Our brain uses several cues for depth estimation, including monocular cues such as motion parallax and binocular cues like diplopia.
However, it is still unclear how the computations required for depth estimation are implemented in biologically plausible ways.
State-of-the-art approaches to depth estimation based on deep neural networks implicitly describe the brain as a hierarchical feature detector.
Instead, we propose an alternative approach that casts depth estimation as a problem of active inference.
We show that depth can be inferred by inverting a hierarchical generative model that simultaneously predicts the eyes projections from a 2D belief over an object.
Model inversion consists of a series of biologically plausible, homogeneous transformations based on Predictive Coding principles.
Under the plausible assumption of a nonuniform fovea resolution, depth estimation favors an active vision strategy that fixates the object with the eyes, rendering the depth belief more accurate.
This strategy is not realized by first fixating on a target and then estimating the depth, but by combining the two processes through action-perception cycles, with a similar mechanism of the saccades during object recognition.
The proposed approach requires only local (top-down and bottom-up) message passing that can be implemented in biologically plausible neural circuits.

Related Results

COMPARISON OF BINOCULAR SPECTACLE PRESCRIPTION VERSUS PATCHING THERAPY IN REFRACTIVE AMBLYOPIA PATIENTS
COMPARISON OF BINOCULAR SPECTACLE PRESCRIPTION VERSUS PATCHING THERAPY IN REFRACTIVE AMBLYOPIA PATIENTS
Background: Amblyopia is the leading cause of visual impairment in children, adolescents, and middle-aged adults, characterized by a loss of binocularity and a significant reductio...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
Frequency and associated factors of accommodation and non-strabismic binocular vision dysfunction among medical university students
Frequency and associated factors of accommodation and non-strabismic binocular vision dysfunction among medical university students
AIM: To investigate the frequency and associated factors of accommodation and non-strabismic binocular vision dysfunction among medical university students. METHODS: Totally 158 st...
Monocular Depth Estimation (Literature Review)
Monocular Depth Estimation (Literature Review)
Background. The physiological basis of spatial perception is traditionally attributed to the binocular system, which integrates the signals coming to the brain from each eye into a...
A binocular synaptic network supports interocular response alignment in visual cortical neurons
A binocular synaptic network supports interocular response alignment in visual cortical neurons
AbstractIn the visual system, signals from the two eyes are combined to form a coherent representation through the convergence of synaptic input populations onto individual cortica...
Ab initio Binocular Formulation of Listing's Law
Ab initio Binocular Formulation of Listing's Law
Abstract Human eyes do not have perfectly aligned optical components; the fovea is displaced from the posterior pole, and the crystalline lens is tilted away from the optic...
Binocular processing facilitates escape behavior through multiple pathways to the superior colliculus
Binocular processing facilitates escape behavior through multiple pathways to the superior colliculus
SummaryThe superior colliculus (SC) is the main brain region regulating innate defensive behaviors to visual threat. Yet, how the SC integrates binocular visual information and to ...
Cellular and circuit mechanisms underlying binocular vision
Cellular and circuit mechanisms underlying binocular vision
SummaryHow binocular visual information is combined at the level of single neurons in the brain is not fully understood. Here, we show in mice that callosal input from the opposite...

Back to Top