Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

SAFERNet: Channel, Positional, and Global Attention Fusion for Efficient RGB-T Segmentation in Disaster Robotics

View through CrossRef
Real-time RGB and thermal (RGB-T) fusion is vital for disaster robotics, where robots must navigate unstructured, hazardous environments under tight resource constraints. The main challenge is achieving precise scene understanding, especially at object boundaries, while ensuring computational efficiency for embedded deployment. This work proposes a novel Channel, Positional, and Global Attention (CPGA) fusion block for convolutional neural networks (CNNs) that enhances RGB-T fusion by integrating three complementary attention mechanisms: (i) channel attention for enhancing boundary-focused features, (ii) positional attention for capturing long-range spatial dependencies, and (iii) global attention for reducing redundant computation. Furthermore, we introduce SAFERNet (Spectral-Attentive Fusion for Emergency Response Network), a new RGB-T semantic segmentation architecture built on CPGA blocks. SAFERNet fuses RGB and thermal data across five hierarchical levels of dual ResNet backbones in an encoder-decoder configuration optimized for emergency scenarios. The model is benchmarked against state-of-the-art RGB-T segmentation networks using two publicly available urban scene datasets. Efficiency is assessed through real-time performance metrics—FLOPs (floating-point operations), FPS (frames per second), and PARAMS (parameter count)—in relation to segmentation fidelity. To emphasize applicability in disaster response, we further evaluate SAFERNet on a newly annotated subset of 135 images from the UMA-SAR disaster response dataset, featuring eleven semantic classes tailored for search-and-rescue (SAR) missions. An extensive ablation study evaluates the effects of backbone choices, attention modules, decoder setups, and hyperparameters. SAFERNet consistently balances accuracy and efficiency, making it well-suited for real-time deployment on autonomous robots in disaster scenarios. Code and annotated data are available at https://github.com/amsalase/CPGFANet.
Title: SAFERNet: Channel, Positional, and Global Attention Fusion for Efficient RGB-T Segmentation in Disaster Robotics
Description:
Real-time RGB and thermal (RGB-T) fusion is vital for disaster robotics, where robots must navigate unstructured, hazardous environments under tight resource constraints.
The main challenge is achieving precise scene understanding, especially at object boundaries, while ensuring computational efficiency for embedded deployment.
This work proposes a novel Channel, Positional, and Global Attention (CPGA) fusion block for convolutional neural networks (CNNs) that enhances RGB-T fusion by integrating three complementary attention mechanisms: (i) channel attention for enhancing boundary-focused features, (ii) positional attention for capturing long-range spatial dependencies, and (iii) global attention for reducing redundant computation.
Furthermore, we introduce SAFERNet (Spectral-Attentive Fusion for Emergency Response Network), a new RGB-T semantic segmentation architecture built on CPGA blocks.
SAFERNet fuses RGB and thermal data across five hierarchical levels of dual ResNet backbones in an encoder-decoder configuration optimized for emergency scenarios.
The model is benchmarked against state-of-the-art RGB-T segmentation networks using two publicly available urban scene datasets.
Efficiency is assessed through real-time performance metrics—FLOPs (floating-point operations), FPS (frames per second), and PARAMS (parameter count)—in relation to segmentation fidelity.
To emphasize applicability in disaster response, we further evaluate SAFERNet on a newly annotated subset of 135 images from the UMA-SAR disaster response dataset, featuring eleven semantic classes tailored for search-and-rescue (SAR) missions.
An extensive ablation study evaluates the effects of backbone choices, attention modules, decoder setups, and hyperparameters.
SAFERNet consistently balances accuracy and efficiency, making it well-suited for real-time deployment on autonomous robots in disaster scenarios.
Code and annotated data are available at https://github.
com/amsalase/CPGFANet.

Related Results

The Nuclear Fusion Award
The Nuclear Fusion Award
The Nuclear Fusion Award ceremony for 2009 and 2010 award winners was held during the 23rd IAEA Fusion Energy Conference in Daejeon. This time, both 2009 and 2010 award winners w...
En skvatmølle i Ljørring
En skvatmølle i Ljørring
A Horizontal Mill at Ljørring, Jutland.Horizontal water-mills have been in use in Jutland since the beginning of the Christian era 2). But the one here described shows so close a c...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
A SAM2-Driven RGB-T Annotation Pipeline with Thermal-Guided Refinement for Semantic Segmentation in Search-and-Rescue Scenes
A SAM2-Driven RGB-T Annotation Pipeline with Thermal-Guided Refinement for Semantic Segmentation in Search-and-Rescue Scenes
High-quality RGB–thermal infrared (RGB-T) semantic segmentation datasets are crucial for search-and-rescue (SAR) applications, yet their development is hindered by the scarcity of ...
A SAM2-Driven RGB-T Annotation Pipeline with Thermal-Guided Refinement for Semantic Segmentation in Search-and-Rescue Scenes
A SAM2-Driven RGB-T Annotation Pipeline with Thermal-Guided Refinement for Semantic Segmentation in Search-and-Rescue Scenes
High-quality RGB–thermal infrared (RGB-T) semantic segmentation datasets are crucial for search-and-rescue (SAR) applications, yet their development is hindered by the scarcity of ...
Disaster Anthropology
Disaster Anthropology
Disaster Anthropology uses theoretical and methodological tools from across anthropological subfields to understand the effects of disasters. Anthropologists based in academia and ...
Three-Dimensional Positional Uncertainty Based on Along-Hole Depth, Inclination and Azimuth Accuracies
Three-Dimensional Positional Uncertainty Based on Along-Hole Depth, Inclination and Azimuth Accuracies
Abstract Along-hole Depth (AHD) is the most fundamental subsurface wellbore measurement made. Well depth is the main descriptor of wellbore position, measured from z...

Back to Top