Javascript must be enabled to continue!
Equivariance-Guided Rotation-Invariant Self-Supervised Learning with p4-Equivariant CNNs
View through CrossRef
Self-supervised learning (SSL) in computer vision has advanced through joint-embedding methods that learn representations invariant to semantic transformations between image pairs. However, although geometric transformations such as rotation are semantically invariant, learning rotation-invariant representations remains challenging due to the inherently rotation-equivariant nature of object images. Previous methods attempted to improve rotation robustness via equivariant learning, yet a clear performance gap persists between non-rotated and rotated samples. To address these limitations, we propose GIE (Guiding Invariance with Equivariance), a framework that forms rotation-invariant representations guided by the rotation-equivariant structure of images. GIE employs group-equivariant convolutional networks to produce strictly rotation-equivariant feature maps. An equivariance-guided orientation-alignment step then transforms equivariant features into invariant embeddings while preserving discriminative information. This eliminates the need for repeated inferences required in canonicalization, enabling computationally efficient and scalable training within standard SSL frameworks. Experimental results show that across multiple SSL frameworks—including SimCLR, SimSiam, and MoCo v2—GIE significantly improves robustness on rotated data. Notably, it yields up to a 7% gain over the base p4-equivariant CNN and up to a 24% gain over standard ResNet backbones. These results demonstrate the effectiveness of GIE in learning robust, rotation-invariant representations.
Title: Equivariance-Guided Rotation-Invariant Self-Supervised Learning with p4-Equivariant CNNs
Description:
Self-supervised learning (SSL) in computer vision has advanced through joint-embedding methods that learn representations invariant to semantic transformations between image pairs.
However, although geometric transformations such as rotation are semantically invariant, learning rotation-invariant representations remains challenging due to the inherently rotation-equivariant nature of object images.
Previous methods attempted to improve rotation robustness via equivariant learning, yet a clear performance gap persists between non-rotated and rotated samples.
To address these limitations, we propose GIE (Guiding Invariance with Equivariance), a framework that forms rotation-invariant representations guided by the rotation-equivariant structure of images.
GIE employs group-equivariant convolutional networks to produce strictly rotation-equivariant feature maps.
An equivariance-guided orientation-alignment step then transforms equivariant features into invariant embeddings while preserving discriminative information.
This eliminates the need for repeated inferences required in canonicalization, enabling computationally efficient and scalable training within standard SSL frameworks.
Experimental results show that across multiple SSL frameworks—including SimCLR, SimSiam, and MoCo v2—GIE significantly improves robustness on rotated data.
Notably, it yields up to a 7% gain over the base p4-equivariant CNN and up to a 24% gain over standard ResNet backbones.
These results demonstrate the effectiveness of GIE in learning robust, rotation-invariant representations.
Related Results
Equivariance-Guided Rotation-Invariant Self-Supervised Learning with p4-Equivariant CNNs
Equivariance-Guided Rotation-Invariant Self-Supervised Learning with p4-Equivariant CNNs
Self-supervised learning (SSL) in computer vision has advanced through joint-embedding methods that learn representations invariant to semantic transformations between image pairs....
Equivariant parametrized topological complexity
Equivariant parametrized topological complexity
AbstractIn this paper, we define and study an equivariant analogue of Cohen, Farber and Weinberger’s parametrized topological complexity. We show that several results in the non-eq...
Equivariance in Vision for Unsupervised Low Data Regimes
Equivariance in Vision for Unsupervised Low Data Regimes
Vision équivariante en mode non supervisée et en présence d'une faible quantité de données
Cette thèse aborde la question clé de la création de modèles de vision ro...
Rotation invariance and equivariance in 3D deep learning: a survey
Rotation invariance and equivariance in 3D deep learning: a survey
AbstractDeep neural networks (DNNs) in 3D scenes show a strong capability of extracting high-level semantic features and significantly promote research in the 3D field. 3D shapes a...
Introductory Lectures on Equivariant Cohomology
Introductory Lectures on Equivariant Cohomology
Equivariant cohomology is concerned with the algebraic topology of spaces with a group action, or in other words, with symmetries of spaces. First defined in the 1950s, it has been...
General Properties of Equivariant Cohomology
General Properties of Equivariant Cohomology
This chapter assesses the general properties of equivariant cohomology. Both the homotopy quotient and equivariant cohomology are functorial constructions. Equivariant cohomology i...
Is a Fitbit a Diary? Self-Tracking and Autobiography
Is a Fitbit a Diary? Self-Tracking and Autobiography
Data becomes something of a mirror in which people see themselves reflected. (Sorapure 270)In a 2014 essay for The New Yorker, the humourist David Sedaris recounts an obsession spu...
The Histological Diagnosis of Breast Cancer by Employing scale invariant ResNet 18 With Spatial Supervised Technique
The Histological Diagnosis of Breast Cancer by Employing scale invariant ResNet 18 With Spatial Supervised Technique
Abstract
Background
Breast cancer is one of the most prevalent cause of morbidity and mortality in women all over the world. Hi...

