Javascript must be enabled to continue!
Contextual Patch-NetVLAD: Context-Aware Patch Feature Descriptor and Patch Matching Mechanism for Visual Place Recognition
View through CrossRef
The goal of visual place recognition (VPR) is to determine the location of a query image by identifying its place in a collection of image databases. Visual sensor technologies are crucial for visual place recognition as they allow for precise identification and location of query images within a database. Global descriptor-based VPR methods face the challenge of accurately capturing the local specific regions within a scene; consequently, it leads to an increasing probability of confusion during localization in such scenarios. To tackle feature extraction and feature matching challenges in VPR, we propose a modified patch-NetVLAD strategy that includes two new modules: a context-aware patch descriptor and a context-aware patch matching mechanism. Firstly, we propose a context-driven patch feature descriptor to overcome the limitations of global and local descriptors in visual place recognition. This descriptor aggregates features from each patch’s surrounding neighborhood. Secondly, we introduce a context-driven feature matching mechanism that utilizes cluster and saliency context-driven weighting rules to assign higher weights to patches that are less similar to densely populated or locally similar regions for improved localization performance. We further incorporate both of these modules into the patch-NetVLAD framework, resulting in a new approach called contextual patch-NetVLAD. Experimental results are provided to show that our proposed approach outperforms other state-of-the-art methods to achieve a Recall@10 score of 99.82 on Pittsburgh30k, 99.82 on FMDataset, and 97.68 on our benchmark dataset.
Title: Contextual Patch-NetVLAD: Context-Aware Patch Feature Descriptor and Patch Matching Mechanism for Visual Place Recognition
Description:
The goal of visual place recognition (VPR) is to determine the location of a query image by identifying its place in a collection of image databases.
Visual sensor technologies are crucial for visual place recognition as they allow for precise identification and location of query images within a database.
Global descriptor-based VPR methods face the challenge of accurately capturing the local specific regions within a scene; consequently, it leads to an increasing probability of confusion during localization in such scenarios.
To tackle feature extraction and feature matching challenges in VPR, we propose a modified patch-NetVLAD strategy that includes two new modules: a context-aware patch descriptor and a context-aware patch matching mechanism.
Firstly, we propose a context-driven patch feature descriptor to overcome the limitations of global and local descriptors in visual place recognition.
This descriptor aggregates features from each patch’s surrounding neighborhood.
Secondly, we introduce a context-driven feature matching mechanism that utilizes cluster and saliency context-driven weighting rules to assign higher weights to patches that are less similar to densely populated or locally similar regions for improved localization performance.
We further incorporate both of these modules into the patch-NetVLAD framework, resulting in a new approach called contextual patch-NetVLAD.
Experimental results are provided to show that our proposed approach outperforms other state-of-the-art methods to achieve a Recall@10 score of 99.
82 on Pittsburgh30k, 99.
82 on FMDataset, and 97.
68 on our benchmark dataset.
Related Results
The impact of patch encounter rate on patch residence time of female parasitoids increases with patch quality
The impact of patch encounter rate on patch residence time of female parasitoids increases with patch quality
Abstract
1. For animal species that forage on patchily distributed resources, patch time allocation is of prime importance to their reproductive success. Accord...
Quantitative structure-activity relationship study on the MMP-13 inhibitory activity of fused pyrimidine derivatives possessing a 1,2,4-Triazol-3-yl group as a ZBG
Quantitative structure-activity relationship study on the MMP-13 inhibitory activity of fused pyrimidine derivatives possessing a 1,2,4-Triazol-3-yl group as a ZBG
QSAR study has been carried out on the MMP-13 inhibitory activity of fused pyrimidine derivatives possessing a1,2,4-triazol-3-yl group as a ZBG in 0D- to 2D-Dragon descriptors. The...
Reducing Computational Complexity in Vision Transformers Using Patch Slimming
Reducing Computational Complexity in Vision Transformers Using Patch Slimming
Vision Transformers (ViTs) have emerged as a dominant class of deep learning models for image recognition tasks, demonstrating superior performance compared to traditional Convolut...
A method for feature matching in images using descriptor structures
A method for feature matching in images using descriptor structures
A method of feature matching in images using descriptor structures is considered in the work. The descriptors in the developed method can be any known solutions in the field of com...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
2021 Census to Census Coverage Survey Matching Results.
2021 Census to Census Coverage Survey Matching Results.
The 2021 England and Wales Census was matched to the Census Coverage Survey (CCS). This was an essential requisite for estimating undercount in the Census. To ensure outputs could ...
Refining intra-patch connectivity measures in landscape fragmentation and connectivity indices
Refining intra-patch connectivity measures in landscape fragmentation and connectivity indices
Abstract
Context. Measuring intra-patch connectivity, i.e. the connectivity within a habitat patch, is important to evaluate landscape fragmentation and connectivity. Howev...
DescFold: A web server for protein fold recognition
DescFold: A web server for protein fold recognition
Abstract
Background
Machine learning-based methods have been proven to be powerful in developing new fold recognition tools. In our previous work...

