Javascript must be enabled to continue!
Lightweight Human Pose Estimation Algorithm Based on Polarized Self-Attention
View through CrossRef
Abstract
In recent years, human pose estimation has been widely used in human-computer interaction, augmented reality, video surveillance, and many other fields, but the task of pose estimation still faces many challenges. To address the large number of parameters and complicated calculation in the current mainstream human pose estimation network, this paper proposes a lightweight pose estimation network (Lightweight Polarized Network, referred to as LPNet) based on a polarized self-attention mechanism. First, ghost convolution is used to reduce the number of parameters of the feature extraction network; second, by introducing the polarized self-attention module, the pixel-level regression task can be better solved, the lack of extracted features due to the decrease in the number of parameters can be reduced, and the accuracy of the regression of human keypoints can be improved; finally, a new coordinate decoding method is designed to reduce the error in the heatmap decoding process and improve the accuracy of keypoint regression. The method proposed in this paper was evaluated on the human keypoint detection datasets COCO and MPII, and compared with the current mainstream methods. The experimental results show that the proposed method greatly reduces the number of parameters of the model while ensuring a small loss in accuracy.
Title: Lightweight Human Pose Estimation Algorithm Based on Polarized Self-Attention
Description:
Abstract
In recent years, human pose estimation has been widely used in human-computer interaction, augmented reality, video surveillance, and many other fields, but the task of pose estimation still faces many challenges.
To address the large number of parameters and complicated calculation in the current mainstream human pose estimation network, this paper proposes a lightweight pose estimation network (Lightweight Polarized Network, referred to as LPNet) based on a polarized self-attention mechanism.
First, ghost convolution is used to reduce the number of parameters of the feature extraction network; second, by introducing the polarized self-attention module, the pixel-level regression task can be better solved, the lack of extracted features due to the decrease in the number of parameters can be reduced, and the accuracy of the regression of human keypoints can be improved; finally, a new coordinate decoding method is designed to reduce the error in the heatmap decoding process and improve the accuracy of keypoint regression.
The method proposed in this paper was evaluated on the human keypoint detection datasets COCO and MPII, and compared with the current mainstream methods.
The experimental results show that the proposed method greatly reduces the number of parameters of the model while ensuring a small loss in accuracy.
Related Results
Pose estimation for robotic percussive riveting.
Pose estimation for robotic percussive riveting.
Recently, a robotic percussive riveting system has been developed at Ryerson University for an automation of percussive riveting process of aero-structural fastening assembly. The ...
Pose estimation for robotic percussive riveting.
Pose estimation for robotic percussive riveting.
Recently, a robotic percussive riveting system has been developed at Ryerson University for an automation of percussive riveting process of aero-structural fastening assembly. The ...
Utra-thin single-layered high-efficiency focusing metasurface lens
Utra-thin single-layered high-efficiency focusing metasurface lens
For potential applications of metasurfaces in lens technologies, we propose a cross circularly polarized focusing metasurface which is capable of transforming a circularly polarize...
Deep Learning for Realistic Virtual Clothes Fitting
Deep Learning for Realistic Virtual Clothes Fitting
With the continuous growth of the online shopping industry, determining how a particular garment would appear on us is challenging. To overcome this, this project presents a web-ba...
STSP-Net: A Spatial-Temporal Skeletal Perception Network for Robust 3D Pose Estimation in Children's Sports
STSP-Net: A Spatial-Temporal Skeletal Perception Network for Robust 3D Pose Estimation in Children's Sports
Introduction: Children's sports motion pose estimation has significant applications in sports training, health monitoring, and rehabilitation assessment. However, existing 3D pose ...
The Research of Long-Optical-Path Visible Laser Polarization Characteristics in Smoke Environment
The Research of Long-Optical-Path Visible Laser Polarization Characteristics in Smoke Environment
The concentration of smoke in an environment can cause obvious interference to visible light intensity imaging, and it is a non-negligible factor in the polarized imaging of ground...
Lightweight Joint Loss 2D Pose Estimation Network Based onCM-RTMPose
Lightweight Joint Loss 2D Pose Estimation Network Based onCM-RTMPose
Abstract
Human pose estimation tasks often need to be deployed on edge devices. While existing humanpose estimation networks can achieve good accuracy, their complex networ...
Animal Pose Estimation Algorithm Based on the Lightweight Stacked Hourglass Network
Animal Pose Estimation Algorithm Based on the Lightweight Stacked Hourglass Network
Abstract
Pose estimation has been a hot topic in the field of machine vision in recent years. In the pose estimation task, a lightweight stacked hourglass network (SHN) alg...

