Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

A-I-RAVEN and I-RAVEN-Mesh: Two New Benchmarks for Abstract Visual Reasoning

View through CrossRef
We study generalization and knowledge reuse capabilities of deep neural networks in the domain of abstract visual reasoning (AVR), employing Raven's Progressive Matrices (RPMs), a recognized benchmark task for assessing AVR abilities. Two knowledge transfer scenarios referring to the I-RAVEN dataset are investigated. Firstly, inspired by generalization assessment capabilities of the PGM dataset and popularity of I-RAVEN, we introduce Attributeless-I-RAVEN (A-I-RAVEN), a benchmark with 10 generalization regimes that allow to systematically test generalization of abstract rules applied to held-out attributes at various levels of complexity (primary and extended regimes). In contrast to PGM, A-I-RAVEN features compositionality, a variety of figure configurations, and does not require substantial computational resources. Secondly, we construct I-RAVEN-Mesh, a dataset that enriches RPMs with a novel component structure comprising line-based patterns, facilitating assessment of progressive knowledge acquisition in transfer learning setting. We evaluate 13 strong models from the AVR literature on the introduced datasets, revealing their specific shortcomings in generalization and knowledge transfer.
Title: A-I-RAVEN and I-RAVEN-Mesh: Two New Benchmarks for Abstract Visual Reasoning
Description:
We study generalization and knowledge reuse capabilities of deep neural networks in the domain of abstract visual reasoning (AVR), employing Raven's Progressive Matrices (RPMs), a recognized benchmark task for assessing AVR abilities.
Two knowledge transfer scenarios referring to the I-RAVEN dataset are investigated.
Firstly, inspired by generalization assessment capabilities of the PGM dataset and popularity of I-RAVEN, we introduce Attributeless-I-RAVEN (A-I-RAVEN), a benchmark with 10 generalization regimes that allow to systematically test generalization of abstract rules applied to held-out attributes at various levels of complexity (primary and extended regimes).
In contrast to PGM, A-I-RAVEN features compositionality, a variety of figure configurations, and does not require substantial computational resources.
Secondly, we construct I-RAVEN-Mesh, a dataset that enriches RPMs with a novel component structure comprising line-based patterns, facilitating assessment of progressive knowledge acquisition in transfer learning setting.
We evaluate 13 strong models from the AVR literature on the introduced datasets, revealing their specific shortcomings in generalization and knowledge transfer.

Related Results

Contributions to bluetooth low energy mesh networks
Contributions to bluetooth low energy mesh networks
Bluetooth Low Energy (BLE) has become a popular Internet of Things (IoT) technology. However, it was originally designed to only support the star topology. This PhD thesis investig...
Aktivitas Penghambatan Enzim α-Amilase dan Penyerapan Kolesterol Dari Serat Pangan Alga Eucheuma spinosum
Aktivitas Penghambatan Enzim α-Amilase dan Penyerapan Kolesterol Dari Serat Pangan Alga Eucheuma spinosum
Alga Eucheuma spinosum didapatkan langsung dari seorang nelayan (Pulau Nain Sulahwesi Utara), yang sudah banyak dibudidayakan dan ditelitih. Penelitian ini bertujuan untuk menentuk...
A-I-RAVEN and I-RAVEN-Mesh: Two New Benchmarks for Abstract Visual Reasoning
A-I-RAVEN and I-RAVEN-Mesh: Two New Benchmarks for Abstract Visual Reasoning
We study generalization and knowledge reuse capabilities of deep neural networks in the domain of abstract visual reasoning (AVR), employing Raven's Progressive Matrices (RPMs), a ...
Study on Electromagnetic Shielding of Infrared /Visible Optical Window
Study on Electromagnetic Shielding of Infrared /Visible Optical Window
In allusion to electromagnetic radiation damage that existed in daily life, social safety and military field, electromagnetic shielding technology of infrared and infrared optical ...
Polypropylene Pelvic Mesh: What Went Wrong and What Will Be of the Future?
Polypropylene Pelvic Mesh: What Went Wrong and What Will Be of the Future?
Background: Polypropylene (PP) pelvic mesh is a synthetic mesh made of PP polymer used to treat pelvic organ prolapse (POP). Its use has become highly controversial due to reports ...
Optimisation in Neurosymbolic Learning Systems
Optimisation in Neurosymbolic Learning Systems
In the last few years, Artificial Intelligence (AI) has reached the public consciousness through high-profile applications such as chatbots, image generators, speech synthesis and ...
Approaching the Construction of Arguments in Postgraduate Education Programs
Approaching the Construction of Arguments in Postgraduate Education Programs
Constructing arguments, applying logical reasoning, and developing intellectual skills are fundamental to academic success in postgraduate education and qualitative research. The s...

Back to Top