Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

MarkYolo: An Enhanced YOLOv10 Network with Dynamic Convolution and Attention Mechanism for Circular Marker Detection in High-Speed Video Measurement

View through CrossRef
In high-speed video measurement, accurate detection of circular markers is critical for applications in structural analysis, motion tracking, and industrial automation. Traditional marker detection methods often struggle with challenges such as dynamic occlusion, complex backgrounds, and scale variations. To address these issues, this paper proposes MarkYolo, an enhanced object detection framework based on YOLOv10 tailored for robust circular marker detection. Key innovations include: (1) Omni-dimensional Dynamic Convolution (ODConv) integrated into a novel COD module to capture multi-dimensional contextual features while reducing computational complexity; (2) an Adaptive Fine-Grained Channel Attention (AFGCAttention) mechanism to enhance small object localization by adaptively fusing global and local information; and (3) Normalized Wasserstein Distance (NWD) loss to improve robustness against positional shifts and scale variations by modeling bounding boxes as Gaussian distributions. Experiments on the CME dataset demonstrate that MarkYolo achieves a state-of-the-art AP50-95 of 75.4%, outperforming the baseline YOLOv10 by 4.9% while maintaining real-time efficiency. The model also reduces false positives and missed detections in complex scenarios, offering significant advancements for high-speed photogrammetry applications. Further ablation studies validate the synergistic contributions of each proposed module, highlighting improvements in recall (96.4%), precision (98.3%), and computational efficiency (8.6 GFLOPs). This work provides a practical solution for enhancing marker detection accuracy in dynamic environments and lays a foundation for future lightweight deployments on edge devices.
Auricle Global Society of Education and Research
Title: MarkYolo: An Enhanced YOLOv10 Network with Dynamic Convolution and Attention Mechanism for Circular Marker Detection in High-Speed Video Measurement
Description:
In high-speed video measurement, accurate detection of circular markers is critical for applications in structural analysis, motion tracking, and industrial automation.
Traditional marker detection methods often struggle with challenges such as dynamic occlusion, complex backgrounds, and scale variations.
To address these issues, this paper proposes MarkYolo, an enhanced object detection framework based on YOLOv10 tailored for robust circular marker detection.
Key innovations include: (1) Omni-dimensional Dynamic Convolution (ODConv) integrated into a novel COD module to capture multi-dimensional contextual features while reducing computational complexity; (2) an Adaptive Fine-Grained Channel Attention (AFGCAttention) mechanism to enhance small object localization by adaptively fusing global and local information; and (3) Normalized Wasserstein Distance (NWD) loss to improve robustness against positional shifts and scale variations by modeling bounding boxes as Gaussian distributions.
Experiments on the CME dataset demonstrate that MarkYolo achieves a state-of-the-art AP50-95 of 75.
4%, outperforming the baseline YOLOv10 by 4.
9% while maintaining real-time efficiency.
The model also reduces false positives and missed detections in complex scenarios, offering significant advancements for high-speed photogrammetry applications.
Further ablation studies validate the synergistic contributions of each proposed module, highlighting improvements in recall (96.
4%), precision (98.
3%), and computational efficiency (8.
6 GFLOPs).
This work provides a practical solution for enhancing marker detection accuracy in dynamic environments and lays a foundation for future lightweight deployments on edge devices.

Related Results

Graph convolutional neural networks for 3D data analysis
Graph convolutional neural networks for 3D data analysis
(English) Deep Learning allows the extraction of complex features directly from raw input data, eliminating the need for hand-crafted features from the classical Machine Learning p...
Audio and video editing system design based on OpenCV
Audio and video editing system design based on OpenCV
With the rapid development of the Internet, a new carrier for people to perceive the world and communicate with each other - audio and video - is gradually being favoured by the pu...
Smart Surveillance for Fall Detection with YOLOV10 in Unstructured Outdoor Settings
Smart Surveillance for Fall Detection with YOLOV10 in Unstructured Outdoor Settings
Abstract - Falls are one of the most common and dangerous problems in industrial areas and open spaces, often causing serious injuries and safety issues. It's hard to detect falls ...
MMS-YOLOv10: A fast and improved pavement surface defect detection model based on YOLOv10
MMS-YOLOv10: A fast and improved pavement surface defect detection model based on YOLOv10
Abstract Pavement defect detection greatly affects pavement service life and vehicle operation safety. Current pavement defect detection models encounter difficulti...
Vehicle detection in drone aerial views based on lightweight OSD-YOLOv10
Vehicle detection in drone aerial views based on lightweight OSD-YOLOv10
Abstract To address the challenges of low performance in vehicle image detection from UAV aerial imagery, difficulties in small target feature extraction, and the large p...
REAL-TIME OBJECT DETECTION MODEL USING YOLOv10
REAL-TIME OBJECT DETECTION MODEL USING YOLOv10
From security systems to driverless cars, object detection is essential to many applications. The main goal of this project is to use YOLOv10 and RCNN (Region-Convolutional Neural ...
NETWORK VIDEO CONTENT AS A FORM OF UNIVERSITY PROMOTION
NETWORK VIDEO CONTENT AS A FORM OF UNIVERSITY PROMOTION
In the context of visualization and digitalization of media consumption, network video content is becoming an important form of university promotion in the educational services mar...
Safety Helmet Detection And License Plate Detection Using Advanced Yolov10
Safety Helmet Detection And License Plate Detection Using Advanced Yolov10
This project presents an advanced computer vision system for realtime Safety Helmet Detection and License Plate Recognition using the latest YOLOv10 object detection architecture. ...

Back to Top