Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Visual SLAM system based on dynamic semantic features

View through CrossRef
Aiming at the problem that dynamic objects (such as pedestrians, vehicles, and animals) in real scenes affect the positioning and mapping accuracy of visual SLAM (Simultaneous Localization and Mapping), a YOLOv3-ORB-SLAM3 algorithm is proposed based on ORB-SLAM3 (Oriented FAST and Rotated BRIEF-Simultaneous Localization and Mapping 3). The algorithm adds a semantic thread on the basis of ORB-SLAM3, and adopts a dual-thread mechanism for dynamic and static scene feature extraction: the semantic thread uses YOLOv3 to perform semantic recognition target detection on dynamic objects in the scene, and removes outliers from the extracted dynamic area feature points; the tracking thread extracts scene area features through ORB features, and obtains static scene features combined with semantic information and sends them to the backend, thereby eliminating the interference of dynamic scenes on the system and improving the positioning accuracy of the visual SLAM algorithm. Verified using the TUM (Technical University of Munich) dataset, the results show that the ATE (Average Treatment Effect) index of the YOLOv3-ORB-SLAM3 algorithm in the monocular mode is about 30% lower than that of the ORB-SLAM3 algorithm, and the ATE index of the dynamic sequence in the RGB-D (Red, Green and Blue-Depth) mode is reduced 10%, while the static sequence has no significant decrease.
Title: Visual SLAM system based on dynamic semantic features
Description:
Aiming at the problem that dynamic objects (such as pedestrians, vehicles, and animals) in real scenes affect the positioning and mapping accuracy of visual SLAM (Simultaneous Localization and Mapping), a YOLOv3-ORB-SLAM3 algorithm is proposed based on ORB-SLAM3 (Oriented FAST and Rotated BRIEF-Simultaneous Localization and Mapping 3).
The algorithm adds a semantic thread on the basis of ORB-SLAM3, and adopts a dual-thread mechanism for dynamic and static scene feature extraction: the semantic thread uses YOLOv3 to perform semantic recognition target detection on dynamic objects in the scene, and removes outliers from the extracted dynamic area feature points; the tracking thread extracts scene area features through ORB features, and obtains static scene features combined with semantic information and sends them to the backend, thereby eliminating the interference of dynamic scenes on the system and improving the positioning accuracy of the visual SLAM algorithm.
Verified using the TUM (Technical University of Munich) dataset, the results show that the ATE (Average Treatment Effect) index of the YOLOv3-ORB-SLAM3 algorithm in the monocular mode is about 30% lower than that of the ORB-SLAM3 algorithm, and the ATE index of the dynamic sequence in the RGB-D (Red, Green and Blue-Depth) mode is reduced 10%, while the static sequence has no significant decrease.

Related Results

Information metrics for localization and mapping
Information metrics for localization and mapping
Decades of research have made possible the existence of several autonomous systems that successfully and efficiently navigate within a variety of environments under certain conditi...
GNV2-SLAM: vision SLAM system for cowshed inspection robots
GNV2-SLAM: vision SLAM system for cowshed inspection robots
Simultaneous Localization and Mapping (SLAM) has emerged as one of the foundational technologies enabling mobile robots to achieve autonomous navigation, garnering significant atte...
Dynamic SLAM: A Visual SLAM in Outdoor Dynamic Scenes
Dynamic SLAM: A Visual SLAM in Outdoor Dynamic Scenes
Abstract Simultaneous localization and mapping (SLAM) has been widely used in augmented reality(AR), virtual reality(VR), robotics, and autonomous vehicles as the theoretic...
A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing
A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing
In order to realize an artificial intelligent system, a basic mechanism should be provided for expressing and processing the semantic. We have presented semantic computing models i...
A survey: which features are required for dynamic visual simultaneous localization and mapping?
A survey: which features are required for dynamic visual simultaneous localization and mapping?
AbstractIn recent years, simultaneous localization and mapping in dynamic environments (dynamic SLAM) has attracted significant attention from both academia and industry. Some pion...
Performance Analysis of the Microsoft Kinect Sensor for 2D Simultaneous Localization and Mapping (SLAM) Techniques
Performance Analysis of the Microsoft Kinect Sensor for 2D Simultaneous Localization and Mapping (SLAM) Techniques
This paper presents a performance analysis of two open-source, laser scanner-based Simultaneous Localization and Mapping (SLAM) techniques (i.e., Gmapping and Hector SLAM) using a...
Event based SLAM
Event based SLAM
(English) Event-based cameras are novel sensors with a bio-inspired design that exhibit a high dynamic range and extremely low latency. They sensing principle is different than the...
Exploiting Wikipedia Semantics for Computing Word Associations
Exploiting Wikipedia Semantics for Computing Word Associations
<p><b>Semantic association computation is the process of automatically quantifying the strength of a semantic connection between two textual units based on various lexi...

Back to Top