Javascript must be enabled to continue!

Visual and Semantic Guided Scene Text Retrieval

Abstract In this paper, we introduce a novel end-to-end trainable network for the task of scene text retrieval. Diverging from the state-of-the-art methods that match the visual features of individual character images for retrieval, our network transforms the entire query text into a single query image. By integrating visual and language modules, our network extracts rich visual and semantic features from the query image, facilitating efficient similarity modeling and query matching. This hybrid embedding approach using visual-semantic features of query images shows excellent robustness in dealing with complex text styles and layouts. Experimental results on multiple benchmark datasets validate the superiority of our framework, especially in multilingual retrieval tasks, where our framework achieves a 20.15% increase in mAP score compared to the current state-of-the-art. This significant performance boost showcases the potent potential of our network in multilingual scene text retrieval tasks.

Research Square Platform LLC

Hailong Luo Mayire Ibrayim Askar Hamdulla Qilin Deng

2024

Title: Visual and Semantic Guided Scene Text Retrieval

Description:

Abstract In this paper, we introduce a novel end-to-end trainable network for the task of scene text retrieval.

Diverging from the state-of-the-art methods that match the visual features of individual character images for retrieval, our network transforms the entire query text into a single query image.

By integrating visual and language modules, our network extracts rich visual and semantic features from the query image, facilitating efficient similarity modeling and query matching.

This hybrid embedding approach using visual-semantic features of query images shows excellent robustness in dealing with complex text styles and layouts.

Experimental results on multiple benchmark datasets validate the superiority of our framework, especially in multilingual retrieval tasks, where our framework achieves a 20.

15% increase in mAP score compared to the current state-of-the-art.

This significant performance boost showcases the potent potential of our network in multilingual scene text retrieval tasks.

Back

In order to realize an artificial intelligent system, a basic mechanism should be provided for expressing and processing the semantic. We have presented semantic computing models i...

E-Press and Oppress

From elephants to ABBA fans, silicon to hormone, the following discussion uses a new research method to look at printed text, motion pictures and a te...

The nature of automatic semantic retrieval in individuals with mild cognitive impairment

The number of people diagnosed with Alzheimer’s disease (AD), a progressive and terminal kind of dementia, continues to rise with an estimated 14 million Americans affected by 2050...

On Flores Island, do "ape-men" still exist? https://www.sapiens.org/biology/flores-island-ape-men/

<spa...

Combining Convolutional Neural Network and Markov Random Field for Semantic Image Retrieval

With the rapidly growing number of images over the Internet, efficient scalable semantic image retrieval becomes increasingly important. This paper presents a novel approach for se...

Improving Sentence Retrieval Using Sequence Similarity

Sentence retrieval is an information retrieval technique that aims to find sentences corresponding to an information need. It is used for tasks like question answering (QA) or nove...

New Research Progress in Image Retrieval

Image retrieval is generally divided into two categories: one is text-based Image Retrieval; another is content-based Image Retrieval. Early image retrieval technology is mainly ba...

Semantic Excel: An Introduction to a User-Friendly Online Software Application for Statistical Analyses of Text Data

Semantic Excel (www.semanticexcel.com) is an online software application with a simple, yet powerful interface enabling users to perform statistical analyses on texts. The purpose ...

Email:
Password:

Email:

Visual and Semantic Guided Scene Text Retrieval

Related Results