Javascript must be enabled to continue!

Contextual Patch-NetVLAD: Context-Aware Patch Feature Descriptor and Patch Matching Mechanism for Visual Place Recognition

The goal of visual place recognition (VPR) is to determine the location of a query image by identifying its place in a collection of image databases. Visual sensor technologies are crucial for visual place recognition as they allow for precise identification and location of query images within a database. Global descriptor-based VPR methods face the challenge of accurately capturing the local specific regions within a scene; consequently, it leads to an increasing probability of confusion during localization in such scenarios. To tackle feature extraction and feature matching challenges in VPR, we propose a modified patch-NetVLAD strategy that includes two new modules: a context-aware patch descriptor and a context-aware patch matching mechanism. Firstly, we propose a context-driven patch feature descriptor to overcome the limitations of global and local descriptors in visual place recognition. This descriptor aggregates features from each patch’s surrounding neighborhood. Secondly, we introduce a context-driven feature matching mechanism that utilizes cluster and saliency context-driven weighting rules to assign higher weights to patches that are less similar to densely populated or locally similar regions for improved localization performance. We further incorporate both of these modules into the patch-NetVLAD framework, resulting in a new approach called contextual patch-NetVLAD. Experimental results are provided to show that our proposed approach outperforms other state-of-the-art methods to achieve a Recall@10 score of 99.82 on Pittsburgh30k, 99.82 on FMDataset, and 97.68 on our benchmark dataset.

MDPI AG

Wenyuan Sun Wentang Chen Runxiang Huang Jing Tian

Sensors

2024

Title: Contextual Patch-NetVLAD: Context-Aware Patch Feature Descriptor and Patch Matching Mechanism for Visual Place Recognition

Description:

The goal of visual place recognition (VPR) is to determine the location of a query image by identifying its place in a collection of image databases.

Visual sensor technologies are crucial for visual place recognition as they allow for precise identification and location of query images within a database.

Global descriptor-based VPR methods face the challenge of accurately capturing the local specific regions within a scene; consequently, it leads to an increasing probability of confusion during localization in such scenarios.

To tackle feature extraction and feature matching challenges in VPR, we propose a modified patch-NetVLAD strategy that includes two new modules: a context-aware patch descriptor and a context-aware patch matching mechanism.

Firstly, we propose a context-driven patch feature descriptor to overcome the limitations of global and local descriptors in visual place recognition.

This descriptor aggregates features from each patch’s surrounding neighborhood.

Secondly, we introduce a context-driven feature matching mechanism that utilizes cluster and saliency context-driven weighting rules to assign higher weights to patches that are less similar to densely populated or locally similar regions for improved localization performance.

We further incorporate both of these modules into the patch-NetVLAD framework, resulting in a new approach called contextual patch-NetVLAD.

Experimental results are provided to show that our proposed approach outperforms other state-of-the-art methods to achieve a Recall@10 score of 99.

82 on Pittsburgh30k, 99.

82 on FMDataset, and 97.

68 on our benchmark dataset.

Back

Abstract 1. For animal species that forage on patchily distributed resources, patch time allocation is of prime importance to their reproductive success. Accord...

Quantitative structure-activity relationship study on the MMP-13 inhibitory activity of fused pyrimidine derivatives possessing a 1,2,4-Triazol-3-yl group as a ZBG

QSAR study has been carried out on the MMP-13 inhibitory activity of fused pyrimidine derivatives possessing a1,2,4-triazol-3-yl group as a ZBG in 0D- to 2D-Dragon descriptors. The...

A method for feature matching in images using descriptor structures

A method of feature matching in images using descriptor structures is considered in the work. The descriptors in the developed method can be any known solutions in the field of com...

Reducing Computational Complexity in Vision Transformers Using Patch Slimming

Vision Transformers (ViTs) have emerged as a dominant class of deep learning models for image recognition tasks, demonstrating superior performance compared to traditional Convolut...

Local descriptor for retinal fundus image registration

A feature‐based retinal image registration (RIR) technique aligns multiple fundus images and composed of pre‐processing, feature point extraction, feature descriptor, matching and ...

2021 Census to Census Coverage Survey Matching Results.

The 2021 England and Wales Census was matched to the Census Coverage Survey (CCS). This was an essential requisite for estimating undercount in the Census. To ensure outputs could ...

Depth-aware salient object segmentation

Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...

Refining intra-patch connectivity measures in landscape fragmentation and connectivity indices

Abstract Context. Measuring intra-patch connectivity, i.e. the connectivity within a habitat patch, is important to evaluate landscape fragmentation and connectivity. Howev...

Email:
Password:

Email:

Contextual Patch-NetVLAD: Context-Aware Patch Feature Descriptor and Patch Matching Mechanism for Visual Place Recognition

Related Results