Javascript must be enabled to continue!
Underwater-art: Expanding information perspectives with text templates for underwater acoustic target recognition
View through CrossRef
Underwater acoustic target recognition is an intractable task due to the complex acoustic source characteristics and sound propagation patterns. Limited by insufficient data and narrow information perspective, recognition models based on deep learning seem far from satisfactory in practical underwater scenarios. Although underwater acoustic signals are severely influenced by distance, channel depth, or other factors, annotations of relevant information are often nonuniform, incomplete, and hard to use. In this work, the proposal is to implement underwater acoustic recognition based on templates made up of rich relevant information (UART). The templates are designed to integrate relevant information from different perspectives into descriptive natural language. UART adopts an audio-spectrogram-text trimodal contrastive learning framework, which endows UART with the ability to guide the learning of acoustic representations by descriptive natural language. These experiments reveal that UART has better recognition capability and generalization performance than traditional paradigms. Furthermore, the pretrained UART model could provide superior prior knowledge for the recognition model in the scenario without any auxiliary annotation.
Acoustical Society of America (ASA)
Title: Underwater-art: Expanding information perspectives with text templates for underwater acoustic target recognition
Description:
Underwater acoustic target recognition is an intractable task due to the complex acoustic source characteristics and sound propagation patterns.
Limited by insufficient data and narrow information perspective, recognition models based on deep learning seem far from satisfactory in practical underwater scenarios.
Although underwater acoustic signals are severely influenced by distance, channel depth, or other factors, annotations of relevant information are often nonuniform, incomplete, and hard to use.
In this work, the proposal is to implement underwater acoustic recognition based on templates made up of rich relevant information (UART).
The templates are designed to integrate relevant information from different perspectives into descriptive natural language.
UART adopts an audio-spectrogram-text trimodal contrastive learning framework, which endows UART with the ability to guide the learning of acoustic representations by descriptive natural language.
These experiments reveal that UART has better recognition capability and generalization performance than traditional paradigms.
Furthermore, the pretrained UART model could provide superior prior knowledge for the recognition model in the scenario without any auxiliary annotation.
Related Results
Present status and challenges of underwater acoustic target recognition technology: A review
Present status and challenges of underwater acoustic target recognition technology: A review
Future naval warfare has placed high demands on underwater targets’ target detection, target recognition, and opposition resistance, among other things. However, the ocean’s comple...
On Flores Island, do "ape-men" still exist? https://www.sapiens.org/biology/flores-island-ape-men/
On Flores Island, do "ape-men" still exist? https://www.sapiens.org/biology/flores-island-ape-men/
<span style="font-size:11pt"><span style="background:#f9f9f4"><span style="line-height:normal"><span style="font-family:Calibri,sans-serif"><b><spa...
Exploring target imaging in underwater bubble group environment based on polarization information
Exploring target imaging in underwater bubble group environment based on polarization information
Underwater optical imaging is an important way to implement the seabed exploration and target recognition. There occur a lot of bubbles due to the sea wave, ship wake, marine creat...
E-Press and Oppress
E-Press and Oppress
From elephants to ABBA fans, silicon to hormone, the following discussion uses a new research method to look at printed text, motion pictures and a te...
Emerging underwater survey technologies: A review and future outlook
Emerging underwater survey technologies: A review and future outlook
Emerging underwater survey technologies are revolutionizing the way we explore and understand the underwater world. This review examines the latest advancements in underwater surve...
Edge Enhanced CrackNet for Underwater Crack Detection of Concrete Dams
Edge Enhanced CrackNet for Underwater Crack Detection of Concrete Dams
Underwater crack detection in dam structures is of significant engineering importance and scientific value for ensuring the structural safety, assessing operational conditions, and...
Optimizing Underwater Vision: A Rigorous Investigation into CNN's Deep Image Enhancement for Subaquatic Scenes
Optimizing Underwater Vision: A Rigorous Investigation into CNN's Deep Image Enhancement for Subaquatic Scenes
In this paper, Convolutional Neural Networks were used to enhance the visual fidelity of underwater images. The UWCNN is introduced in this article, which utilizes underwater scene...
Acoustic cloaking design based on penetration manipulation with combination acoustic metamaterials
Acoustic cloaking design based on penetration manipulation with combination acoustic metamaterials
The acoustic wave transmission manipulation ability is the most important performance for the acoustic metamaterials. To manipulate the acoustic transmission, the combination acous...


