Javascript must be enabled to continue!
Modeling 3D Convolution Architecture for Actions Recognition
View through CrossRef
Abstract
Action recognition infrastructure can be applied anywhere behavior analysis is required and represents presently a domain of maximum actuality in security and surveillance. The model based on 3D Convolutions is a middle ground between simple key-frame approaches based on 2D convolutions, and other more complex approaches based on Recurrent Neural Networks. Behavior analysis represents a domain greatly improved by action recognition. By placing human actions in different categories it is possible to extract statistics regarding a person’s behavior, characteristics, abilities and preferences which can be processed later by specialized personnel, depending on the selected domain. The proposed model follows simple 3D convolution architecture. Hidden layers are composed of a convolution operation, an activation function and, sometimes, a pooling layer. Leaky ReLU was used as activation function to alleviate the problem of vanishing gradients. Batch Normalization is a technique used for scaling and adjusting the output of an activation layer, and it has been used to reduce over-fitting and decrease the training time. The 3D Convolution structure has the advantage of learning spatio-temporal features, because the convolution is applied over a sequence of frames. In the present paper is presented a proposed 3D convolution model that has average results, with an accuracy of approximately 55% on the NTU RGB+D dataset.
American Society of Mechanical Engineers
Title: Modeling 3D Convolution Architecture for Actions Recognition
Description:
Abstract
Action recognition infrastructure can be applied anywhere behavior analysis is required and represents presently a domain of maximum actuality in security and surveillance.
The model based on 3D Convolutions is a middle ground between simple key-frame approaches based on 2D convolutions, and other more complex approaches based on Recurrent Neural Networks.
Behavior analysis represents a domain greatly improved by action recognition.
By placing human actions in different categories it is possible to extract statistics regarding a person’s behavior, characteristics, abilities and preferences which can be processed later by specialized personnel, depending on the selected domain.
The proposed model follows simple 3D convolution architecture.
Hidden layers are composed of a convolution operation, an activation function and, sometimes, a pooling layer.
Leaky ReLU was used as activation function to alleviate the problem of vanishing gradients.
Batch Normalization is a technique used for scaling and adjusting the output of an activation layer, and it has been used to reduce over-fitting and decrease the training time.
The 3D Convolution structure has the advantage of learning spatio-temporal features, because the convolution is applied over a sequence of frames.
In the present paper is presented a proposed 3D convolution model that has average results, with an accuracy of approximately 55% on the NTU RGB+D dataset.
Related Results
The architecture of differences
The architecture of differences
Following in the footsteps of the protagonists of the Italian architectural debate is a mark of culture and proactivity. The synthesis deriving from the artistic-humanistic factors...
Architecture between heteronomy and self-generation
Architecture between heteronomy and self-generation
Introduction
«I have never worked in the technocratic exaltation, solving a constructive problem and that’s it. I’ve always tried to interpret the space of human life» (Vitto...
Relationship Between Weight Correlation of the Convolution Kernels and the Optimal Architecture of CNN
Relationship Between Weight Correlation of the Convolution Kernels and the Optimal Architecture of CNN
Currently, deep learning has been one of the most popular research topics, and it has already been successfully applied in many fields such as image recognition, recommendation sys...
The quantum convolution product
The quantum convolution product
Abstract
In classical statistical mechanics, physical states (probability measures) are embedded in the Banach algebra of complex Borel measures on phase space, wher...
Convolution-Based Approach for modeling the Paliperidone Extended Release and Long-Acting Injectable (LAI) PK of Once-, and Three-Monthly Products Administration and for Optimizing the Development of New LAI products
Convolution-Based Approach for modeling the Paliperidone Extended Release and Long-Acting Injectable (LAI) PK of Once-, and Three-Monthly Products Administration and for Optimizing the Development of New LAI products
Abstract
The aim of this paper was to develop a new modeling approach for describing the paliperidone PK resulting from the administration of extended-release once-a-day or...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
Enhancing Pangeo-Fish with HEALPix Convolution: Impact Evaluation and Benefits
Enhancing Pangeo-Fish with HEALPix Convolution: Impact Evaluation and Benefits
The Pangeo-Fish project processes biologging data to analyze fish movement and migration patterns.  While SciPy’s convolution methods are robust, they are not op...
Experimental realization of convolution processing in photonic synthetic frequency dimensions
Experimental realization of convolution processing in photonic synthetic frequency dimensions
Convolution is an essential operation in signal and image processing and consumes most of the computing power in convolutional neural networks. Photonic convolution has the promise...

