Javascript must be enabled to continue!
SAFERNet: Channel, Positional, and Global Attention Fusion for Efficient RGB-T Segmentation in Disaster Robotics
View through CrossRef
Real-time RGB and thermal (RGB-T) fusion is vital for disaster robotics,
where robots must navigate unstructured, hazardous environments under
tight resource constraints. The main challenge is achieving precise
scene understanding, especially at object boundaries, while ensuring
computational efficiency for embedded deployment. This work proposes a
novel Channel, Positional, and Global Attention (CPGA) fusion block for
convolutional neural networks (CNNs) that enhances RGB-T fusion by
integrating three complementary attention mechanisms: (i) channel
attention for enhancing boundary-focused features, (ii) positional
attention for capturing long-range spatial dependencies, and (iii)
global attention for reducing redundant computation. Furthermore, we
introduce SAFERNet (Spectral-Attentive Fusion for Emergency Response
Network), a new RGB-T semantic segmentation architecture built on CPGA
blocks. SAFERNet fuses RGB and thermal data across five hierarchical
levels of dual ResNet backbones in an encoder-decoder configuration
optimized for emergency scenarios. The model is benchmarked against
state-of-the-art RGB-T segmentation networks using two publicly
available urban scene datasets. Efficiency is assessed through real-time
performance metrics—FLOPs (floating-point operations), FPS (frames per
second), and PARAMS (parameter count)—in relation to segmentation
fidelity. To emphasize applicability in disaster response, we further
evaluate SAFERNet on a newly annotated subset of 135 images from the
UMA-SAR disaster response dataset, featuring eleven semantic classes
tailored for search-and-rescue (SAR) missions. An extensive ablation
study evaluates the effects of backbone choices, attention modules,
decoder setups, and hyperparameters. SAFERNet consistently balances
accuracy and efficiency, making it well-suited for real-time deployment
on autonomous robots in disaster scenarios. Code and annotated data are
available at https://github.com/amsalase/CPGFANet.
Title: SAFERNet: Channel, Positional, and Global Attention Fusion for Efficient RGB-T Segmentation in Disaster Robotics
Description:
Real-time RGB and thermal (RGB-T) fusion is vital for disaster robotics,
where robots must navigate unstructured, hazardous environments under
tight resource constraints.
The main challenge is achieving precise
scene understanding, especially at object boundaries, while ensuring
computational efficiency for embedded deployment.
This work proposes a
novel Channel, Positional, and Global Attention (CPGA) fusion block for
convolutional neural networks (CNNs) that enhances RGB-T fusion by
integrating three complementary attention mechanisms: (i) channel
attention for enhancing boundary-focused features, (ii) positional
attention for capturing long-range spatial dependencies, and (iii)
global attention for reducing redundant computation.
Furthermore, we
introduce SAFERNet (Spectral-Attentive Fusion for Emergency Response
Network), a new RGB-T semantic segmentation architecture built on CPGA
blocks.
SAFERNet fuses RGB and thermal data across five hierarchical
levels of dual ResNet backbones in an encoder-decoder configuration
optimized for emergency scenarios.
The model is benchmarked against
state-of-the-art RGB-T segmentation networks using two publicly
available urban scene datasets.
Efficiency is assessed through real-time
performance metrics—FLOPs (floating-point operations), FPS (frames per
second), and PARAMS (parameter count)—in relation to segmentation
fidelity.
To emphasize applicability in disaster response, we further
evaluate SAFERNet on a newly annotated subset of 135 images from the
UMA-SAR disaster response dataset, featuring eleven semantic classes
tailored for search-and-rescue (SAR) missions.
An extensive ablation
study evaluates the effects of backbone choices, attention modules,
decoder setups, and hyperparameters.
SAFERNet consistently balances
accuracy and efficiency, making it well-suited for real-time deployment
on autonomous robots in disaster scenarios.
Code and annotated data are
available at https://github.
com/amsalase/CPGFANet.
Related Results
The Nuclear Fusion Award
The Nuclear Fusion Award
The Nuclear Fusion Award ceremony for 2009 and 2010 award winners was held during the 23rd IAEA Fusion Energy Conference in Daejeon. This time, both 2009 and 2010 award winners w...
En skvatmølle i Ljørring
En skvatmølle i Ljørring
A Horizontal Mill at Ljørring, Jutland.Horizontal water-mills have been in use in Jutland since the beginning of the Christian era 2). But the one here described shows so close a c...
A Methodology for Portraying Three-Dimensional Positional Uncertainty Using Along-Hole Depth, Inclination, and Azimuth Measurement Accuracies
A Methodology for Portraying Three-Dimensional Positional Uncertainty Using Along-Hole Depth, Inclination, and Azimuth Measurement Accuracies
Along-hole depth (AHD) is the most fundamental subsurface measurement made. AHD, together with inclination (I) and azimuth (A), are used to describe the three-dimensional (3D) posi...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
A SAM2-Driven RGB-T Annotation Pipeline with Thermal-Guided Refinement for Semantic Segmentation in Search-and-Rescue Scenes
A SAM2-Driven RGB-T Annotation Pipeline with Thermal-Guided Refinement for Semantic Segmentation in Search-and-Rescue Scenes
High-quality RGB–thermal infrared (RGB-T) semantic segmentation datasets are crucial for search-and-rescue (SAR) applications, yet their development is hindered by the scarcity of ...
A SAM2-Driven RGB-T Annotation Pipeline with Thermal-Guided Refinement for Semantic Segmentation in Search-and-Rescue Scenes
A SAM2-Driven RGB-T Annotation Pipeline with Thermal-Guided Refinement for Semantic Segmentation in Search-and-Rescue Scenes
High-quality RGB–thermal infrared (RGB-T) semantic segmentation datasets are crucial for search-and-rescue (SAR) applications, yet their development is hindered by the scarcity of ...
Disaster Anthropology
Disaster Anthropology
Disaster Anthropology uses theoretical and methodological tools from across anthropological subfields to understand the effects of disasters. Anthropologists based in academia and ...
Three-Dimensional Positional Uncertainty Based on Along-Hole Depth, Inclination and Azimuth Accuracies
Three-Dimensional Positional Uncertainty Based on Along-Hole Depth, Inclination and Azimuth Accuracies
Abstract
Along-hole Depth (AHD) is the most fundamental subsurface wellbore measurement made. Well depth is the main descriptor of wellbore position, measured from z...

