Javascript must be enabled to continue!
Bio-mimetic high-speed target localization with fused frame and event vision for edge application
View through CrossRef
Evolution has honed predatory skills in the natural world where localizing and intercepting fast-moving prey is required. The current generation of robotic systems mimics these biological systems using deep learning. High-speed processing of the camera frames using convolutional neural networks (CNN) (frame pipeline) on such constrained aerial edge-robots gets resource-limited. Adding more compute resources also eventually limits the throughput at the frame rate of the camera as frame-only traditional systems fail to capture the detailed temporal dynamics of the environment. Bio-inspired event cameras and spiking neural networks (SNN) provide an asynchronous sensor-processor pair (event pipeline) capturing the continuous temporal details of the scene for high-speed but lag in terms of accuracy. In this work, we propose a target localization system combining event-camera and SNN-based high-speed target estimation and frame-based camera and CNN-driven reliable object detection by fusing complementary spatio-temporal prowess of event and frame pipelines. One of our main contributions involves the design of an SNN filter that borrows from the neural mechanism for ego-motion cancelation in houseflies. It fuses the vestibular sensors with the vision to cancel the activity corresponding to the predator's self-motion. We also integrate the neuro-inspired multi-pipeline processing with task-optimized multi-neuronal pathway structure in primates and insects. The system is validated to outperform CNN-only processing using prey-predator drone simulations in realistic 3D virtual environments. The system is then demonstrated in a real-world multi-drone set-up with emulated event data. Subsequently, we use recorded actual sensory data from multi-camera and inertial measurement unit (IMU) assembly to show desired working while tolerating the realistic noise in vision and IMU sensors. We analyze the design space to identify optimal parameters for spiking neurons, CNN models, and for checking their effect on the performance metrics of the fused system. Finally, we map the throughput controlling SNN and fusion network on edge-compatible Zynq-7000 FPGA to show a potential 264 outputs per second even at constrained resource availability. This work may open new research directions by coupling multiple sensing and processing modalities inspired by discoveries in neuroscience to break fundamental trade-offs in frame-based computer vision1.
Frontiers Media SA
Title: Bio-mimetic high-speed target localization with fused frame and event vision for edge application
Description:
Evolution has honed predatory skills in the natural world where localizing and intercepting fast-moving prey is required.
The current generation of robotic systems mimics these biological systems using deep learning.
High-speed processing of the camera frames using convolutional neural networks (CNN) (frame pipeline) on such constrained aerial edge-robots gets resource-limited.
Adding more compute resources also eventually limits the throughput at the frame rate of the camera as frame-only traditional systems fail to capture the detailed temporal dynamics of the environment.
Bio-inspired event cameras and spiking neural networks (SNN) provide an asynchronous sensor-processor pair (event pipeline) capturing the continuous temporal details of the scene for high-speed but lag in terms of accuracy.
In this work, we propose a target localization system combining event-camera and SNN-based high-speed target estimation and frame-based camera and CNN-driven reliable object detection by fusing complementary spatio-temporal prowess of event and frame pipelines.
One of our main contributions involves the design of an SNN filter that borrows from the neural mechanism for ego-motion cancelation in houseflies.
It fuses the vestibular sensors with the vision to cancel the activity corresponding to the predator's self-motion.
We also integrate the neuro-inspired multi-pipeline processing with task-optimized multi-neuronal pathway structure in primates and insects.
The system is validated to outperform CNN-only processing using prey-predator drone simulations in realistic 3D virtual environments.
The system is then demonstrated in a real-world multi-drone set-up with emulated event data.
Subsequently, we use recorded actual sensory data from multi-camera and inertial measurement unit (IMU) assembly to show desired working while tolerating the realistic noise in vision and IMU sensors.
We analyze the design space to identify optimal parameters for spiking neurons, CNN models, and for checking their effect on the performance metrics of the fused system.
Finally, we map the throughput controlling SNN and fusion network on edge-compatible Zynq-7000 FPGA to show a potential 264 outputs per second even at constrained resource availability.
This work may open new research directions by coupling multiple sensing and processing modalities inspired by discoveries in neuroscience to break fundamental trade-offs in frame-based computer vision1.
Related Results
Indoor Localization System Based on RSSI-APIT Algorithm
Indoor Localization System Based on RSSI-APIT Algorithm
An indoor localization system based on the RSSI-APIT algorithm is designed in this study. Integrated RSSI (received signal strength indication) and non-ranging APIT (approximate pe...
Assessment of the physical characteristics and fishing performance of gillnets using biodegradable resin (PBS/PBAT and PBSAT) to reduce ghost fishing
Assessment of the physical characteristics and fishing performance of gillnets using biodegradable resin (PBS/PBAT and PBSAT) to reduce ghost fishing
Abstract
Ghost fishing is caused by derelict synthetic‐fibre nets that have been lost at sea. Thus, biodegradable nets have been developed with the aim of protecting marine ecosy...
Event based SLAM
Event based SLAM
(English) Event-based cameras are novel sensors with a bio-inspired design that exhibit a high dynamic range and extremely low latency. They sensing principle is different than the...
Regulation of the oxidase mimetic activity of ceria nanoparticles by buffer composition
Regulation of the oxidase mimetic activity of ceria nanoparticles by buffer composition
Ceria nanoparticles (CNPs) is an important typical nanozyme with
multiple enzyme mimetic activities, which could facilitate the oxidation
of organic dyes in acidic conditions, beca...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
GROWTH PERFORMANCE OF TISSUE-CULTURED ´LAKATAN´ BANANA (MUSA ACUMINATA) PLANTLETS USING STIMULANTS
GROWTH PERFORMANCE OF TISSUE-CULTURED ´LAKATAN´ BANANA (MUSA ACUMINATA) PLANTLETS USING STIMULANTS
The study aimed to determine the effects of stimulants on the growth performance of tissue-cultured 'lakatan' banana plantlets (Musa acuminata) under nursery condition; to determin...
Vision-specific and psychosocial impacts of low vision among patients with low vision at the eastern regional Low Vision Centre
Vision-specific and psychosocial impacts of low vision among patients with low vision at the eastern regional Low Vision Centre
Purpose: To determine vision-specific and psychosocial implications of low vision among patients with low vision visiting the Low Vision Centre of the Eastern Regional Hospital in ...
Numerical investigation on energy absorption characteristics of impact-resistant lightweight structure of bio-mimetic micro aerial vehicle
Numerical investigation on energy absorption characteristics of impact-resistant lightweight structure of bio-mimetic micro aerial vehicle
With the increasing development of micro aerial vehicle, the impact resistance of the structure has gradually become a research hotspot. The beetle's exoskeleton not only provides ...

