Javascript must be enabled to continue!
Visual and Semantic Guided Scene Text Retrieval
View through CrossRef
Abstract
In this paper, we introduce a novel end-to-end trainable network for the task of scene text retrieval. Diverging from the state-of-the-art methods that match the visual features of individual character images for retrieval, our network transforms the entire query text into a single query image. By integrating visual and language modules, our network extracts rich visual and semantic features from the query image, facilitating efficient similarity modeling and query matching. This hybrid embedding approach using visual-semantic features of query images shows excellent robustness in dealing with complex text styles and layouts. Experimental results on multiple benchmark datasets validate the superiority of our framework, especially in multilingual retrieval tasks, where our framework achieves a 20.15% increase in mAP score compared to the current state-of-the-art. This significant performance boost showcases the potent potential of our network in multilingual scene text retrieval tasks.
Title: Visual and Semantic Guided Scene Text Retrieval
Description:
Abstract
In this paper, we introduce a novel end-to-end trainable network for the task of scene text retrieval.
Diverging from the state-of-the-art methods that match the visual features of individual character images for retrieval, our network transforms the entire query text into a single query image.
By integrating visual and language modules, our network extracts rich visual and semantic features from the query image, facilitating efficient similarity modeling and query matching.
This hybrid embedding approach using visual-semantic features of query images shows excellent robustness in dealing with complex text styles and layouts.
Experimental results on multiple benchmark datasets validate the superiority of our framework, especially in multilingual retrieval tasks, where our framework achieves a 20.
15% increase in mAP score compared to the current state-of-the-art.
This significant performance boost showcases the potent potential of our network in multilingual scene text retrieval tasks.
Related Results
E-Press and Oppress
E-Press and Oppress
From elephants to ABBA fans, silicon to hormone, the following discussion uses a new research method to look at printed text, motion pictures and a te...
A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing
A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing
In order to realize an artificial intelligent system, a basic mechanism should be provided for expressing and processing the semantic. We have presented semantic computing models i...
The nature of automatic semantic retrieval in individuals with mild cognitive impairment
The nature of automatic semantic retrieval in individuals with mild cognitive impairment
The number of people diagnosed with Alzheimer’s disease (AD), a progressive and terminal kind of dementia, continues to rise with an estimated 14 million Americans affected by 2050...
On Flores Island, do "ape-men" still exist? https://www.sapiens.org/biology/flores-island-ape-men/
On Flores Island, do "ape-men" still exist? https://www.sapiens.org/biology/flores-island-ape-men/
<span style="font-size:11pt"><span style="background:#f9f9f4"><span style="line-height:normal"><span style="font-family:Calibri,sans-serif"><b><spa...
Combining Convolutional Neural Network and Markov Random Field for Semantic Image Retrieval
Combining Convolutional Neural Network and Markov Random Field for Semantic Image Retrieval
With the rapidly growing number of images over the Internet, efficient scalable semantic image retrieval becomes increasingly important. This paper presents a novel approach for se...
Improving Sentence Retrieval Using Sequence Similarity
Improving Sentence Retrieval Using Sequence Similarity
Sentence retrieval is an information retrieval technique that aims to find sentences corresponding to an information need. It is used for tasks like question answering (QA) or nove...
New Research Progress in Image Retrieval
New Research Progress in Image Retrieval
Image retrieval is generally divided into two categories: one is text-based Image Retrieval; another is content-based Image Retrieval. Early image retrieval technology is mainly ba...
Semantic Excel: An Introduction to a User-Friendly Online Software Application for Statistical Analyses of Text Data
Semantic Excel: An Introduction to a User-Friendly Online Software Application for Statistical Analyses of Text Data
Semantic Excel (www.semanticexcel.com) is an online software application with a simple, yet powerful interface enabling users to perform statistical analyses on texts. The purpose ...

