Javascript must be enabled to continue!

SAFERNet: Channel, Positional, and Global Attention Fusion for Efficient RGB-T Segmentation in Disaster Robotics

Real-time RGB and thermal (RGB-T) fusion is vital for disaster robotics, where robots must navigate unstructured, hazardous environments under tight resource constraints. The main challenge is achieving precise scene understanding, especially at object boundaries, while ensuring computational efficiency for embedded deployment. This work proposes a novel Channel, Positional, and Global Attention (CPGA) fusion block for convolutional neural networks (CNNs) that enhances RGB-T fusion by integrating three complementary attention mechanisms: (i) channel attention for enhancing boundary-focused features, (ii) positional attention for capturing long-range spatial dependencies, and (iii) global attention for reducing redundant computation. Furthermore, we introduce SAFERNet (Spectral-Attentive Fusion for Emergency Response Network), a new RGB-T semantic segmentation architecture built on CPGA blocks. SAFERNet fuses RGB and thermal data across five hierarchical levels of dual ResNet backbones in an encoder-decoder configuration optimized for emergency scenarios. The model is benchmarked against state-of-the-art RGB-T segmentation networks using two publicly available urban scene datasets. Efficiency is assessed through real-time performance metrics—FLOPs (floating-point operations), FPS (frames per second), and PARAMS (parameter count)—in relation to segmentation fidelity. To emphasize applicability in disaster response, we further evaluate SAFERNet on a newly annotated subset of 135 images from the UMA-SAR disaster response dataset, featuring eleven semantic classes tailored for search-and-rescue (SAR) missions. An extensive ablation study evaluates the effects of backbone choices, attention modules, decoder setups, and hyperparameters. SAFERNet consistently balances accuracy and efficiency, making it well-suited for real-time deployment on autonomous robots in disaster scenarios. Code and annotated data are available at https://github.com/amsalase/CPGFANet.

Wiley

Andrés Salas-Espinales Ricardo Vazquez-Martin Anthony Mandow

2025

Title: SAFERNet: Channel, Positional, and Global Attention Fusion for Efficient RGB-T Segmentation in Disaster Robotics

Description:

Real-time RGB and thermal (RGB-T) fusion is vital for disaster robotics, where robots must navigate unstructured, hazardous environments under tight resource constraints.

The main challenge is achieving precise scene understanding, especially at object boundaries, while ensuring computational efficiency for embedded deployment.

This work proposes a novel Channel, Positional, and Global Attention (CPGA) fusion block for convolutional neural networks (CNNs) that enhances RGB-T fusion by integrating three complementary attention mechanisms: (i) channel attention for enhancing boundary-focused features, (ii) positional attention for capturing long-range spatial dependencies, and (iii) global attention for reducing redundant computation.

Furthermore, we introduce SAFERNet (Spectral-Attentive Fusion for Emergency Response Network), a new RGB-T semantic segmentation architecture built on CPGA blocks.

SAFERNet fuses RGB and thermal data across five hierarchical levels of dual ResNet backbones in an encoder-decoder configuration optimized for emergency scenarios.

The model is benchmarked against state-of-the-art RGB-T segmentation networks using two publicly available urban scene datasets.

Efficiency is assessed through real-time performance metrics—FLOPs (floating-point operations), FPS (frames per second), and PARAMS (parameter count)—in relation to segmentation fidelity.

To emphasize applicability in disaster response, we further evaluate SAFERNet on a newly annotated subset of 135 images from the UMA-SAR disaster response dataset, featuring eleven semantic classes tailored for search-and-rescue (SAR) missions.

An extensive ablation study evaluates the effects of backbone choices, attention modules, decoder setups, and hyperparameters.

SAFERNet consistently balances accuracy and efficiency, making it well-suited for real-time deployment on autonomous robots in disaster scenarios.

Code and annotated data are available at https://github.

com/amsalase/CPGFANet.

Back

Related Results

The Nuclear Fusion Award

The Nuclear Fusion Award ceremony for 2009 and 2010 award winners was held during the 23rd IAEA Fusion Energy Conference in Daejeon. This time, both 2009 and 2010 award winners w...

En skvatmølle i Ljørring

A Horizontal Mill at Ljørring, Jutland.Horizontal water-mills have been in use in Jutland since the beginning of the Christian era 2). But the one here described shows so close a c...

Development of educational robotics: Problems and prospects

Introduction. The inclusion of schoolchildren in the development of robotics can serve as an effective method of popularising technical education and a means of vocational guidance...

Depth-aware salient object segmentation

Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...

A Methodology for Portraying Three-Dimensional Positional Uncertainty Using Along-Hole Depth, Inclination, and Azimuth Measurement Accuracies

Along-hole depth (AHD) is the most fundamental subsurface measurement made. AHD, together with inclination (I) and azimuth (A), are used to describe the three-dimensional (3D) posi...

A SAM2-Driven RGB-T Annotation Pipeline with Thermal-Guided Refinement for Semantic Segmentation in Search-and-Rescue Scenes

High-quality RGB–thermal infrared (RGB-T) semantic segmentation datasets are crucial for search-and-rescue (SAR) applications, yet their development is hindered by the scarcity of ...

Disaster Anthropology

Disaster Anthropology uses theoretical and methodological tools from across anthropological subfields to understand the effects of disasters. Anthropologists based in academia and ...

A SAM2-Driven RGB-T Annotation Pipeline with Thermal-Guided Refinement for Semantic Segmentation in Search-and-Rescue Scenes

High-quality RGB–thermal infrared (RGB-T) semantic segmentation datasets are crucial for search-and-rescue (SAR) applications, yet their development is hindered by the scarcity of ...

Email:
Password:

Email: