Javascript must be enabled to continue!

Equivariance-Guided Rotation-Invariant Self-Supervised Learning with p4-Equivariant CNNs

Self-supervised learning (SSL) in computer vision has advanced through joint-embedding methods that learn representations invariant to semantic transformations between image pairs. However, although geometric transformations such as rotation are semantically invariant, learning rotation-invariant representations remains challenging due to the inherently rotation-equivariant nature of object images. Previous methods attempted to improve rotation robustness via equivariant learning, yet a clear performance gap persists between non-rotated and rotated samples. To address these limitations, we propose GIE (Guiding Invariance with Equivariance), a framework that forms rotation-invariant representations guided by the rotation-equivariant structure of images. GIE employs group-equivariant convolutional networks to produce strictly rotation-equivariant feature maps. An equivariance-guided orientation-alignment step then transforms equivariant features into invariant embeddings while preserving discriminative information. This eliminates the need for repeated inferences required in canonicalization, enabling computationally efficient and scalable training within standard SSL frameworks. Experimental results show that across multiple SSL frameworks—including SimCLR, SimSiam, and MoCo v2—GIE significantly improves robustness on rotated data. Notably, it yields up to a 7% gain over the base p4-equivariant CNN and up to a 24% gain over standard ResNet backbones. These results demonstrate the effectiveness of GIE in learning robust, rotation-invariant representations.

Elsevier BV

Sangjun Han Woojin Cheong Kwanghyun Ko Changhoon Song Myungjoo Kang

2026

Title: Equivariance-Guided Rotation-Invariant Self-Supervised Learning with p4-Equivariant CNNs

Description:

Self-supervised learning (SSL) in computer vision has advanced through joint-embedding methods that learn representations invariant to semantic transformations between image pairs.

However, although geometric transformations such as rotation are semantically invariant, learning rotation-invariant representations remains challenging due to the inherently rotation-equivariant nature of object images.

Previous methods attempted to improve rotation robustness via equivariant learning, yet a clear performance gap persists between non-rotated and rotated samples.

To address these limitations, we propose GIE (Guiding Invariance with Equivariance), a framework that forms rotation-invariant representations guided by the rotation-equivariant structure of images.

GIE employs group-equivariant convolutional networks to produce strictly rotation-equivariant feature maps.

An equivariance-guided orientation-alignment step then transforms equivariant features into invariant embeddings while preserving discriminative information.

This eliminates the need for repeated inferences required in canonicalization, enabling computationally efficient and scalable training within standard SSL frameworks.

Experimental results show that across multiple SSL frameworks—including SimCLR, SimSiam, and MoCo v2—GIE significantly improves robustness on rotated data.

Notably, it yields up to a 7% gain over the base p4-equivariant CNN and up to a 24% gain over standard ResNet backbones.

These results demonstrate the effectiveness of GIE in learning robust, rotation-invariant representations.

Back

Self-supervised learning (SSL) in computer vision has advanced through joint-embedding methods that learn representations invariant to semantic transformations between image pairs....

Equivariant parametrized topological complexity

AbstractIn this paper, we define and study an equivariant analogue of Cohen, Farber and Weinberger’s parametrized topological complexity. We show that several results in the non-eq...

Equivariance in Vision for Unsupervised Low Data Regimes

Vision équivariante en mode non supervisée et en présence d'une faible quantité de données Cette thèse aborde la question clé de la création de modèles de vision ro...

Rotation invariance and equivariance in 3D deep learning: a survey

AbstractDeep neural networks (DNNs) in 3D scenes show a strong capability of extracting high-level semantic features and significantly promote research in the 3D field. 3D shapes a...

Introductory Lectures on Equivariant Cohomology

Equivariant cohomology is concerned with the algebraic topology of spaces with a group action, or in other words, with symmetries of spaces. First defined in the 1950s, it has been...

Is a Fitbit a Diary? Self-Tracking and Autobiography

Data becomes something of a mirror in which people see themselves reflected. (Sorapure 270)In a 2014 essay for The New Yorker, the humourist David Sedaris recounts an obsession spu...

CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021

The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...

The Histological Diagnosis of Breast Cancer by Employing scale invariant ResNet 18 With Spatial Supervised Technique

Abstract Background Breast cancer is one of the most prevalent cause of morbidity and mortality in women all over the world. Hi...

Email:
Password:

Email:

Equivariance-Guided Rotation-Invariant Self-Supervised Learning with p4-Equivariant CNNs

Related Results