Javascript must be enabled to continue!
Fine-Tuning SAM2 for Generalizable Polyp Segmentation with a Channel Attention-Enhanced Decoder
View through CrossRef
Polyp segmentation is a critical task in medical image analysis, particularly in colonoscopy, where it plays a vital role in the early detection and treatment of colorectal cancer. In recent years, advancements in deep learning, especially the application of Convolutional Neural Networks (CNNs) and Transformer models, have significantly improved segmentation performance. Despite these advancements, the generalizability of these models across different datasets is often limited. Recently, Meta released the Segment Anything Model 2 (SAM2), which has demonstrated exceptional performance in both video and image segmentation tasks. This paper aims to develop a universal polyp segmentation model by fine-tuning the pre-trained encoder of SAM2. We introduce a learnable prompt layer within the Transformer blocks and employ a full-scale skip connection structure as a decoder to integrate multi-scale semantic features. Our model outperforms state-of-the-art methods on datasets such as Kvasir-Seg and CVC-ClinicDB. Additionally, our experiments show that the model exhibits excellent transfer learning capabilities on unseen datasets, making it a robust and generalizable model in the field of polyp segmentation.
Title: Fine-Tuning SAM2 for Generalizable Polyp Segmentation with a Channel Attention-Enhanced Decoder
Description:
Polyp segmentation is a critical task in medical image analysis, particularly in colonoscopy, where it plays a vital role in the early detection and treatment of colorectal cancer.
In recent years, advancements in deep learning, especially the application of Convolutional Neural Networks (CNNs) and Transformer models, have significantly improved segmentation performance.
Despite these advancements, the generalizability of these models across different datasets is often limited.
Recently, Meta released the Segment Anything Model 2 (SAM2), which has demonstrated exceptional performance in both video and image segmentation tasks.
This paper aims to develop a universal polyp segmentation model by fine-tuning the pre-trained encoder of SAM2.
We introduce a learnable prompt layer within the Transformer blocks and employ a full-scale skip connection structure as a decoder to integrate multi-scale semantic features.
Our model outperforms state-of-the-art methods on datasets such as Kvasir-Seg and CVC-ClinicDB.
Additionally, our experiments show that the model exhibits excellent transfer learning capabilities on unseen datasets, making it a robust and generalizable model in the field of polyp segmentation.
Related Results
En skvatmølle i Ljørring
En skvatmølle i Ljørring
A Horizontal Mill at Ljørring, Jutland.Horizontal water-mills have been in use in Jutland since the beginning of the Christian era 2). But the one here described shows so close a c...
AI‐enabled precise brain tumor segmentation by integrating Refinenet and contour‐constrained features in MRI images
AI‐enabled precise brain tumor segmentation by integrating Refinenet and contour‐constrained features in MRI images
AbstractBackgroundMedical image segmentation is a fundamental task in medical image analysis and has been widely applied in multiple medical fields. The latest transformer‐based de...
Polyp segmentation on colonoscopy image using improved Unet and transfer learning
Polyp segmentation on colonoscopy image using improved Unet and transfer learning
Colorectal cancer is among the most common malignancies and can develop from high-risk colon polyps. Colonoscopy remains the gold-standard investigation for colorectal cancer scree...
Multiple surface segmentation using novel deep learning and graph based methods
Multiple surface segmentation using novel deep learning and graph based methods
<p>The task of automatically segmenting 3-D surfaces representing object boundaries is important in quantitative analysis of volumetric images, which plays a vital role in nu...
Developing and Evaluating a Situated Assessment Instrument for Trichotillomania: The SAM² TAI
Developing and Evaluating a Situated Assessment Instrument for Trichotillomania: The SAM² TAI
Trichotillomania (hair pulling disorder) is characterized by the recurrent and repetitive pulling of one’s own hair, often resulting in distress for the individual. Being able to a...
Research on Polyp Segmentation Method Using PANet Based on Contrastive Learning
Research on Polyp Segmentation Method Using PANet Based on Contrastive Learning
Abstract
The early screening of colorectal cancer primarily relies on the identification and removal of polyps during colonoscopy. However, the high similarity betw...
Adaptive Multi-source Domain Collaborative Fine-tuning for Transfer Learning
Adaptive Multi-source Domain Collaborative Fine-tuning for Transfer Learning
Fine-tuning is an important technique in transfer learning that has achieved significant success in tasks that lack training data. However, as it is difficult to extract effective ...
Modern approach in the management of malignant colorectal polyp
Modern approach in the management of malignant colorectal polyp
Malignant colorectal polyp refers to the polyp in which the neoplastic lesion invades into but not beyond the submucosa. The morphological features and surface patterns of the mali...

