Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

SpEx: A Tool for Visualising and Navigating Speech Audio

View through CrossRef
<p>Audio is a ubiquitous form of information that is usually treated as a single, unbreakable, piece of content. Thus, audio interfaces remain simple, usually consisting of play, pause, forward, and rewind controls. Spoken audio can contain useful information across multiple topics and finding the information desired is usually time consuming. Most audio players simply do not reveal the content of the audio. By using the speech transcript and acoustic qualities of the audio, I have developed a tool, SpEx, which enabled search and navigation within spoken audio. SpEx displayed audio as discrete segments and revealed the topic content of each segment using mature Information Visualisation techniques. Audio segments were produced based on the acoustic and sentence properties of speech to identify topically and aurally distinct regions. A user study found that SpEx allowed users to find information in spoken audio quickly and reliably. By making spoken audio more accessible, people can gain access to a wider range of information.</p>
Victoria University of Wellington Library
Title: SpEx: A Tool for Visualising and Navigating Speech Audio
Description:
<p>Audio is a ubiquitous form of information that is usually treated as a single, unbreakable, piece of content.
Thus, audio interfaces remain simple, usually consisting of play, pause, forward, and rewind controls.
Spoken audio can contain useful information across multiple topics and finding the information desired is usually time consuming.
Most audio players simply do not reveal the content of the audio.
By using the speech transcript and acoustic qualities of the audio, I have developed a tool, SpEx, which enabled search and navigation within spoken audio.
SpEx displayed audio as discrete segments and revealed the topic content of each segment using mature Information Visualisation techniques.
Audio segments were produced based on the acoustic and sentence properties of speech to identify topically and aurally distinct regions.
A user study found that SpEx allowed users to find information in spoken audio quickly and reliably.
By making spoken audio more accessible, people can gain access to a wider range of information.
</p>.

Related Results

Aerosol retrievals from the ACEPOL Campaign
Aerosol retrievals from the ACEPOL Campaign
Abstract. In this paper, we present aerosol retrieval results from the ACEPOL (Aerosol Characterization from Polarimeter and Lidar) campaign, which was a joint initiative between N...
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...
The Neural Mechanisms of Private Speech in Second Language Learners’ Oral Production: An fNIRS Study
The Neural Mechanisms of Private Speech in Second Language Learners’ Oral Production: An fNIRS Study
Background: According to Vygotsky’s sociocultural theory, private speech functions both as a tool for thought regulation and as a transitional form between outer and inner speech. ...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Preaching With Audio-Visuals
Preaching With Audio-Visuals
Problem The increasing usage of audio-visuals in modern communication has brought about an increase in communication efficiency, an efficiency that is sometimes lacking in sermons...
Perbandingan Tingkat Kemiripan Rekaman Suara Menggunakan Metode Itakura Saito Distance untuk Mendukung Analisa Audio Forensik
Perbandingan Tingkat Kemiripan Rekaman Suara Menggunakan Metode Itakura Saito Distance untuk Mendukung Analisa Audio Forensik
Audio mengacu pada suara yang berbentuk sinyal listrik atau digital. Audio digital sering digunakan untuk merekam, menyimpan, dan mengirimkan audio, karena dapat dengan mudah dipro...

Back to Top