Javascript must be enabled to continue!
Minimizing Binding Errors Using Learned Conjunctive Features
View through CrossRef
We have studied some of the design trade-offs governing visual representations based on spatially invariant conjunctive feature detectors, with an emphasis on the susceptibility of such systems to false-positive recognition errors—Malsburg's classical binding problem. We begin by deriving an analytical model that makes explicit how recognition performance is affected by the number of objects that must be distinguished, the number of features included in the representation, the complexity of individual objects, and the clutter load, that is, the amount of visual material in the field of view in which multiple objects must be simultaneously recognized, independent of pose, and without explicit segmentation. Using the domain of text to model object recognition in cluttered scenes, we show that with corrections for the nonuniform probability and nonindependence of text features, the analytical model achieves good fits to measured recognition rates in simulations involving a wide range of clutter loads, word sizes, and feature counts. We then introduce a greedy algorithm for feature learning, derived from the analytical model, which grows a representation by choosing those conjunctive features that are most likely to distinguish objects from the cluttered backgrounds in which they are embedded. We show that the representations produced by this algorithm are compact, decorrelated, and heavily weighted toward features of low conjunctive order. Our results provide a more quantitative basis for understanding when spatially invariant conjunctive features can support unambiguous perception in multiobject scenes, and lead to several insights regarding the properties of visual representations optimized for specific recognition tasks.
Title: Minimizing Binding Errors Using Learned Conjunctive Features
Description:
We have studied some of the design trade-offs governing visual representations based on spatially invariant conjunctive feature detectors, with an emphasis on the susceptibility of such systems to false-positive recognition errors—Malsburg's classical binding problem.
We begin by deriving an analytical model that makes explicit how recognition performance is affected by the number of objects that must be distinguished, the number of features included in the representation, the complexity of individual objects, and the clutter load, that is, the amount of visual material in the field of view in which multiple objects must be simultaneously recognized, independent of pose, and without explicit segmentation.
Using the domain of text to model object recognition in cluttered scenes, we show that with corrections for the nonuniform probability and nonindependence of text features, the analytical model achieves good fits to measured recognition rates in simulations involving a wide range of clutter loads, word sizes, and feature counts.
We then introduce a greedy algorithm for feature learning, derived from the analytical model, which grows a representation by choosing those conjunctive features that are most likely to distinguish objects from the cluttered backgrounds in which they are embedded.
We show that the representations produced by this algorithm are compact, decorrelated, and heavily weighted toward features of low conjunctive order.
Our results provide a more quantitative basis for understanding when spatially invariant conjunctive features can support unambiguous perception in multiobject scenes, and lead to several insights regarding the properties of visual representations optimized for specific recognition tasks.
Related Results
Minimizing Binding Errors Using Learned Conjunctive Features
Minimizing Binding Errors Using Learned Conjunctive Features
We have studied some of the design trade-offs governing visual representations based on spatially invariant conjunctive feature detectors, with an emphasis on the susceptibility of...
Positive Practice Overcorrection of Oral Reading Errors
Positive Practice Overcorrection of Oral Reading Errors
This study evaluated the effects of two treatment procedures on uncorrected oral reading errors and self-corrections of errors by four moderately mentally retarded girls. In an alt...
Spelling Errors in Thai Made by Chinese Students Learning Thai as a Foreign Language
Spelling Errors in Thai Made by Chinese Students Learning Thai as a Foreign Language
When learning a foreign language, it is important to learn how to spell accurately as it is crucial for communication. To spell Thai language accurately is challenging for both nat...
Two Informational Theories of Memory: a case from Memory-Conjunction Errors
Two Informational Theories of Memory: a case from Memory-Conjunction Errors
Abstract
The causal and simulation theories are often presented as very distinct views about declarative memory, their major difference lying on the causal condition...
Experimental and Numerical Investigation of the Seismic Performance of RC Moment Resisting Frames
Experimental and Numerical Investigation of the Seismic Performance of RC Moment Resisting Frames
The rehabilitation of concrete structures has been a subject of extensive investigation, exploring various facets. One such avenue involves the incorporation of fiber additives int...
Hidden depths: Acceptable ignorance about ocean bottoms
Hidden depths: Acceptable ignorance about ocean bottoms
Normal-mode analysis of underwater sound propagation in principle requires knowledge of pertinent physical parameters at all depths in the water and the bottom material—an unattain...
Intragenic conflict in phylogenomic datasets
Intragenic conflict in phylogenomic datasets
AbstractMost phylogenetic analyses assume that a single evolutionary history underlies one gene. However, both biological processes and errors in dataset assembly can violate this ...
Temporal binding past the Libet clock: testing design factors for an auditory timer
Temporal binding past the Libet clock: testing design factors for an auditory timer
AbstractVoluntary actions and causally linked sensory stimuli are perceived to be shifted towards each other in time. This so-called temporal binding is commonly assessed in paradi...