Javascript must be enabled to continue!
A General-Purpose Sound Taxonomy for the Classification of Heterogeneous Sound Collections
View through CrossRef
Abstract
The task of sound classification is fundamental for decoding information embedded in the audio domain. Taxonomies are integral to this process by defining the way in which the audio domain is conceptualized. Existing sound taxonomies often feature complex hierarchies and can be very specific to a particular use case. Thus, navigating and describing audio content using such taxonomies becomes a difficult task, not only for computers but also for non-expert humans. In this paper, we introduce a user-centric methodology that integrates both theoretical and data-driven approaches to formulate a general-purpose and user-friendly taxonomy for sound classification, the Broad Sound Taxonomy (BST). The taxonomy comprises a two-level hierarchical structure with 5 broad top-level categories and 23 second-level categories capturing finer semantic distinctions. To inform the taxonomy's design process, we leverage the context of the Freesound website, a popular example of a large and heterogeneous sound collection with a substantial user base. We consider the needs of different user profiles including sound designers, musicians, and researchers, and we conduct a user-based evaluation to assess the usefulness of the taxonomy and users' understanding of its categories. Results suggest that the taxonomy is appropriate for classifying sounds of diverse nature and can be used effectively by participants with different levels of expertise. The proposed taxonomy has several applications in the fields of multimedia retrieval, machine listening, audio analysis and, more generally, computational audio.
Title: A General-Purpose Sound Taxonomy for the Classification of Heterogeneous Sound Collections
Description:
Abstract
The task of sound classification is fundamental for decoding information embedded in the audio domain.
Taxonomies are integral to this process by defining the way in which the audio domain is conceptualized.
Existing sound taxonomies often feature complex hierarchies and can be very specific to a particular use case.
Thus, navigating and describing audio content using such taxonomies becomes a difficult task, not only for computers but also for non-expert humans.
In this paper, we introduce a user-centric methodology that integrates both theoretical and data-driven approaches to formulate a general-purpose and user-friendly taxonomy for sound classification, the Broad Sound Taxonomy (BST).
The taxonomy comprises a two-level hierarchical structure with 5 broad top-level categories and 23 second-level categories capturing finer semantic distinctions.
To inform the taxonomy's design process, we leverage the context of the Freesound website, a popular example of a large and heterogeneous sound collection with a substantial user base.
We consider the needs of different user profiles including sound designers, musicians, and researchers, and we conduct a user-based evaluation to assess the usefulness of the taxonomy and users' understanding of its categories.
Results suggest that the taxonomy is appropriate for classifying sounds of diverse nature and can be used effectively by participants with different levels of expertise.
The proposed taxonomy has several applications in the fields of multimedia retrieval, machine listening, audio analysis and, more generally, computational audio.
Related Results
Towards a Taxonomy of Systemic Risks
Towards a Taxonomy of Systemic Risks
Systemic risks, emerging from dynamic interactions among natural, technological, and societal systems, pose multifaceted challenges to modern, interconnected societies. These risks...
Biological collections and ecological/environmental research: a review, some observations and a look to the future
Biological collections and ecological/environmental research: a review, some observations and a look to the future
Housed worldwide, mostly in museums and herbaria, is a vast collection of biological specimens developed over centuries. These biological collections, and associated taxonomic and ...
Clinical taxonomy of Subjective Cognitive Decline: evidence from a Memory Clinic Cohort
Clinical taxonomy of Subjective Cognitive Decline: evidence from a Memory Clinic Cohort
Abstract
Background
Subjective cognitive decline (SCD) is recognized as a potential early risk factor for Alzheimer's dis...
Towards a Global Collection Description Standard
Towards a Global Collection Description Standard
With digitisation of natural history collections over the past decades, their traditional roles — for taxonomic studies and public education — have been greatly expanded into the f...
Modeling methods for dispersive sound speed profiles of the Martian atmosphere and their effects on sound propagation paths
Modeling methods for dispersive sound speed profiles of the Martian atmosphere and their effects on sound propagation paths
At present, Mars acoustic detection is gradually becoming an important new tool for the knowledge and exploration of Mars. To explore the sources of Mars sound, it is necessary to ...
OntoExtractor
OntoExtractor
This chapter introduces OntoExtractor, a tool for the semi-automatic generation of the taxonomy from a set of documents or data sources. The tool generates the taxonomy in a bottom...
The Role of Static Pressure and Temperature in Building Acoustics
The Role of Static Pressure and Temperature in Building Acoustics
The influence of static pressure and temperature on sound reduction indices, impact sound pressure levels, improvements of impact sound pressure levels and sound reduction indices,...
Putting collections back on the ground: mana taonga, mātauranga Māori and NATHIST museum collections
Putting collections back on the ground: mana taonga, mātauranga Māori and NATHIST museum collections
<p><strong>Technological, cultural and climatic changes are shifting natural history collection management (NATHIST CM) more and more into a post-normal science environ...

