Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Scale-Aware Network with Scale Equivariance

View through CrossRef
The convolutional neural network (CNN) has achieved good performance in object classification due to its inherent translation equivariance, but its scale equivariance is poor. A Scale-Aware Network (SA Net) with scale equivariance is proposed to estimate the scale during classification. The SA Net only learns samples of one scale in the training stage; in the testing stage, the unknown-scale testing samples are up-sampled and down-sampled, and a group of image copies with different scales are generated to form the image pyramid. The up-sampling adopts interpolation, and the down-sampling adopts interpolation combined with wavelet transform to avoid spectrum aliasing. The generated test samples with different scales are sent to the Siamese network with weight sharing for inferencing. According to the position of the maximum value of the classification-score matrix, the testing samples can be classified and the scale can be estimated simultaneously. The results on the MNIST and FMNIST datasets show that the SA Net has better performance than the existing methods. When the scale is larger than 4, the SA Net has higher classification accuracy than other methods. In the scale-estimation experiment, the SA Net can achieve low relative RMSE on any scale. The SA Net has potential for effective use in remote sensing, optical image recognition and medical diagnosis in cytohistology.
Title: Scale-Aware Network with Scale Equivariance
Description:
The convolutional neural network (CNN) has achieved good performance in object classification due to its inherent translation equivariance, but its scale equivariance is poor.
A Scale-Aware Network (SA Net) with scale equivariance is proposed to estimate the scale during classification.
The SA Net only learns samples of one scale in the training stage; in the testing stage, the unknown-scale testing samples are up-sampled and down-sampled, and a group of image copies with different scales are generated to form the image pyramid.
The up-sampling adopts interpolation, and the down-sampling adopts interpolation combined with wavelet transform to avoid spectrum aliasing.
The generated test samples with different scales are sent to the Siamese network with weight sharing for inferencing.
According to the position of the maximum value of the classification-score matrix, the testing samples can be classified and the scale can be estimated simultaneously.
The results on the MNIST and FMNIST datasets show that the SA Net has better performance than the existing methods.
When the scale is larger than 4, the SA Net has higher classification accuracy than other methods.
In the scale-estimation experiment, the SA Net can achieve low relative RMSE on any scale.
The SA Net has potential for effective use in remote sensing, optical image recognition and medical diagnosis in cytohistology.

Related Results

Equivariance in Vision for Unsupervised Low Data Regimes
Equivariance in Vision for Unsupervised Low Data Regimes
Vision équivariante en mode non supervisée et en présence d'une faible quantité de données Cette thèse aborde la question clé de la création de modèles de vision ro...
Network Automation
Network Automation
Purpose: The article "Network Automation in the Contemporary Economy" explores the concepts and methods of effective network management. The application stack, Jinja template engin...
Equivariance-Guided Rotation-Invariant Self-Supervised Learning with p4-Equivariant CNNs
Equivariance-Guided Rotation-Invariant Self-Supervised Learning with p4-Equivariant CNNs
Self-supervised learning (SSL) in computer vision has advanced through joint-embedding methods that learn representations invariant to semantic transformations between image pairs....
Equivariance-Guided Rotation-Invariant Self-Supervised Learning with p4-Equivariant CNNs
Equivariance-Guided Rotation-Invariant Self-Supervised Learning with p4-Equivariant CNNs
Self-supervised learning (SSL) in computer vision has advanced through joint-embedding methods that learn representations invariant to semantic transformations between image pairs....
Detection of gene communities in multi-networks reveals cancer drivers
Detection of gene communities in multi-networks reveals cancer drivers
In the past years the advent of high-throughput experimental technologies provided biologists with a flood of molecular data. This huge amount of information requires the design of...
The synergistic effect of ego-network stability and whole network position: a perspective of transnational coopetition network
The synergistic effect of ego-network stability and whole network position: a perspective of transnational coopetition network
PurposeThe authors selected global automobile manufacturing firms whose sales ranked within 100 in the five years from 2014 to 2018 in the Factiva database to examine how the chara...
Modified neural networks for rapid recovery of tokamak plasma parameters for real time control
Modified neural networks for rapid recovery of tokamak plasma parameters for real time control
Two modified neural network techniques are used for the identification of the equilibrium plasma parameters of the Superconducting Steady State Tokamak I from external magnetic mea...
ANALISIS KETERKAITAN KEKERASAN DENGAN PERBUATAN CABUL TERHADAP ANAK
ANALISIS KETERKAITAN KEKERASAN DENGAN PERBUATAN CABUL TERHADAP ANAK
<span id="page3R_mcid85" class="markedContent"><em><span style="left: calc(var(--scale-factor)*85.10px); top: calc(var(--scale-factor)*399.05px); font-size: calc(var...

Back to Top