Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Localizing and Extracting Caption in News Video Using Multi-Frame Average

View through CrossRef
News video is a very important video source. Caption in a news video can help us to understand the semantics of video content directly. A caption localization and extraction approach for news video will be proposed. This approach applies a new Multi-Frame Average (MFA) method to reduce the complexity of the background of the image. A time-based average pixel value search is employed and a Canny edge detection is performed to get the edge map. Then, a horizontal scan and a vertical scan on this edge map are used to obtain the top, bottom, left and right boundaries of the rectangles of candidate captions. Then, some rules are applied to confirm the caption. Experimental results show that the proposed approach can reduce the background complexity in most cases, and achieves a high precision and recall. Finally, we analyze the relationship between background variation of frame sequence and detection performance in detail.
Title: Localizing and Extracting Caption in News Video Using Multi-Frame Average
Description:
News video is a very important video source.
Caption in a news video can help us to understand the semantics of video content directly.
A caption localization and extraction approach for news video will be proposed.
This approach applies a new Multi-Frame Average (MFA) method to reduce the complexity of the background of the image.
A time-based average pixel value search is employed and a Canny edge detection is performed to get the edge map.
Then, a horizontal scan and a vertical scan on this edge map are used to obtain the top, bottom, left and right boundaries of the rectangles of candidate captions.
Then, some rules are applied to confirm the caption.
Experimental results show that the proposed approach can reduce the background complexity in most cases, and achieves a high precision and recall.
Finally, we analyze the relationship between background variation of frame sequence and detection performance in detail.

Related Results

Audio and video editing system design based on OpenCV
Audio and video editing system design based on OpenCV
With the rapid development of the Internet, a new carrier for people to perceive the world and communicate with each other - audio and video - is gradually being favoured by the pu...
Makna Voice Over dalam Pemberitaan Feature di Televisi
Makna Voice Over dalam Pemberitaan Feature di Televisi
Abstract. Voice Over or what is known as VO is being discussed a lot, not only about the profession, but also from the industry side and the various voice over techniques used. Due...
The Canberra Bubble
The Canberra Bubble
According to the ABC television program Four Corners, “Parliament House in Canberra is a hotbed of political intrigue and high tension … . It’s known as the ‘Canberra Bubble’ and i...
Reconstructing the Media Space of Digital News from Visualization to Spatial Immersion in the Case of “Dong News”
Reconstructing the Media Space of Digital News from Visualization to Spatial Immersion in the Case of “Dong News”
The rapid development of motion news highlights the necessity of exploring spatial transformations in news communication, particularly the evolution from two-dimensional news visua...
Understanding the Research Challenges in Low-Resource Language and Linking Bilingual News Articles in Multilingual News Archive
Understanding the Research Challenges in Low-Resource Language and Linking Bilingual News Articles in Multilingual News Archive
The developed world has focused on Web preservation compared to the developing world, especially news preservation for future generations. However, the news published online is vol...
Video tracking for marketing applications
Video tracking for marketing applications
Traçage du contenu marketing vidéo Au cours des dernières décennies, la production et la consommation de vidéos ont considérablement augmenté et il est communément ...

Back to Top