Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

GCN Embedding Swin-Unet for Forest Remote Sensing Image Semantic Segmentation

View through CrossRef
Forest resources are among the most important ecosystems on the earth. The semantic segmentation and accurate positioning of ground objects in forest remote sensing (RS) imagery are crucial to the emergency treatment of forest natural disasters, especially forest fires. Currently, most existing methods for image semantic segmentation are built upon convolutional neural network (CNN). Nevertheless, these techniques face difficulties in directly accessing global contextual information and accurately detecting geometric transformations within the image’s target regions. This limitation stems from the inherent locality of convolution operations, which are restricted to processing data structured in Euclidean space and confined to square-shaped regions.Inspired by the Graph Convolution Network (GCN) with robust capabilities in processing irregular and complex targets, as well as Swin Transformers renowned for exceptional global context modeling, we present an innovative semantic segmentation framework for forest remote sensing imagery termed GSwin-Unet. This framework embeds GCN model into Swin-Unet architecture, and for the first time apply the method of combining GCN and Transformer in the domain of forest RS imagery analysis. GSwin-Unet features an innovative parallel dual-encoder architecture of GCN and Swin transformer. First, we integrate the Zero-DCE (Zero-Reference Deep Curve Estimation) algorithm into GSwin-Unet to enhance forest RS image feature representation. Second, a feature aggregation module (FAM) is proposed to bridge the dual encoders by fusing GCN-derived local aggregated features with Swin transformer-extracted features. Our study demonstrates that the GSwin-Unet significantly improves performance on the Forest Remote Sensing Dataset and exhibits good adaptability on GID dataset.
Title: GCN Embedding Swin-Unet for Forest Remote Sensing Image Semantic Segmentation
Description:
Forest resources are among the most important ecosystems on the earth.
The semantic segmentation and accurate positioning of ground objects in forest remote sensing (RS) imagery are crucial to the emergency treatment of forest natural disasters, especially forest fires.
Currently, most existing methods for image semantic segmentation are built upon convolutional neural network (CNN).
Nevertheless, these techniques face difficulties in directly accessing global contextual information and accurately detecting geometric transformations within the image’s target regions.
This limitation stems from the inherent locality of convolution operations, which are restricted to processing data structured in Euclidean space and confined to square-shaped regions.
Inspired by the Graph Convolution Network (GCN) with robust capabilities in processing irregular and complex targets, as well as Swin Transformers renowned for exceptional global context modeling, we present an innovative semantic segmentation framework for forest remote sensing imagery termed GSwin-Unet.
This framework embeds GCN model into Swin-Unet architecture, and for the first time apply the method of combining GCN and Transformer in the domain of forest RS imagery analysis.
GSwin-Unet features an innovative parallel dual-encoder architecture of GCN and Swin transformer.
First, we integrate the Zero-DCE (Zero-Reference Deep Curve Estimation) algorithm into GSwin-Unet to enhance forest RS image feature representation.
Second, a feature aggregation module (FAM) is proposed to bridge the dual encoders by fusing GCN-derived local aggregated features with Swin transformer-extracted features.
Our study demonstrates that the GSwin-Unet significantly improves performance on the Forest Remote Sensing Dataset and exhibits good adaptability on GID dataset.

Related Results

VM-UNet++ research on crack image segmentation based on improved VM-UNet
VM-UNet++ research on crack image segmentation based on improved VM-UNet
Abstract Cracks are common defects in physical structures, and if not detected and addressed in a timely manner, they can pose a severe threat to the overall safety of th...
ASCEND-UNet: An Improved UNet Configuration Optimized for Rural Settlements Mapping
ASCEND-UNet: An Improved UNet Configuration Optimized for Rural Settlements Mapping
Different types of rural settlement agglomerations have been formed and mixed in space during the rural revitalization strategy implementation in China. Discriminating them from re...
Comparison of Single-channel and Split-window Methods for Estimating Land Surface Temperature from Landsat 8 Data
Comparison of Single-channel and Split-window Methods for Estimating Land Surface Temperature from Landsat 8 Data
Abstract: Landsat 8 is the eighth satellite in the Landsat program, which provides images at 11 spectral channels, including 2 thermal infrared bands at a spatial resolution of 100...
AI‐enabled precise brain tumor segmentation by integrating Refinenet and contour‐constrained features in MRI images
AI‐enabled precise brain tumor segmentation by integrating Refinenet and contour‐constrained features in MRI images
AbstractBackgroundMedical image segmentation is a fundamental task in medical image analysis and has been widely applied in multiple medical fields. The latest transformer‐based de...
Factors influencing and patterns of forest utilization in communities around the Huay Tak Teak Biosphere Reserve, Lampang Province
Factors influencing and patterns of forest utilization in communities around the Huay Tak Teak Biosphere Reserve, Lampang Province
Background and Objectives: To establish the land regulation, it is necessary to know basic information of the surrounding community’s land use and to be aware of basic forest laws....
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
A Domain-Change Approach to the Semantic Labelling of Remote Sensing Images
A Domain-Change Approach to the Semantic Labelling of Remote Sensing Images
<p>For many years, image classification – mainly based on pixel brightness statistics – has been among the<br>most popular r...
A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing
A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing
In order to realize an artificial intelligent system, a basic mechanism should be provided for expressing and processing the semantic. We have presented semantic computing models i...

Back to Top