Javascript must be enabled to continue!
Auxiliary Governed Attention: A Governable, Inference-time Auxiliary Attention Mechanism with Sovereign Boundaries for Frozen Transformers
View through CrossRef
We introduce Auxiliary Governed Attention (AGA), a post-hoc, inference-time auxiliary attention mechanism designed for frozen pre-trained Transformers. Unlike retrieval-augmented generation or parameter-efficient fine-tuning, AGA operates as a governable side-channel that injects bounded corrective signals into the primary attention output, without modifying model parameters or externalizing knowledge into prompts. AGA is built around the principle of sovereign separation between primary and auxiliary reasoning, enabling strict control over when and how auxiliary information may influence inference. We formalize this control through: (i) a governance framework that distinguishes intrinsic, borrowed, and governed knowledge; (ii) explicit lifecycle semantics for auxiliary learning units (probationary, confirmed, deprecated, quarantined); (iii) a hierarchical rights model regulating activation, contribution, and persistence during inferencedesigned to prevent reasoning drift and ghost knowledge accumulation. In addition, AGA employs primary-entropy-gated intervention, ensuring that auxiliary attention is activated only when the primary model exhibits uncertainty. Experiments on commonsense reasoning (CommonsenseQA, PIQA), long-context question answering (Natural Questions, HotpotQA), and continual knowledge injection tasks show consistent improvements of 2.1-4.3 percentage points with less than 0.6% parameter overhead. Ablation studies demonstrate that governance mechanisms are critical for long-term stability, preventing uncontrolled accumulation of auxiliary knowledge and collapse into implicit retrieval-based behavior.
Title: Auxiliary Governed Attention: A Governable, Inference-time Auxiliary Attention Mechanism with Sovereign Boundaries for Frozen Transformers
Description:
We introduce Auxiliary Governed Attention (AGA), a post-hoc, inference-time auxiliary attention mechanism designed for frozen pre-trained Transformers.
Unlike retrieval-augmented generation or parameter-efficient fine-tuning, AGA operates as a governable side-channel that injects bounded corrective signals into the primary attention output, without modifying model parameters or externalizing knowledge into prompts.
AGA is built around the principle of sovereign separation between primary and auxiliary reasoning, enabling strict control over when and how auxiliary information may influence inference.
We formalize this control through: (i) a governance framework that distinguishes intrinsic, borrowed, and governed knowledge; (ii) explicit lifecycle semantics for auxiliary learning units (probationary, confirmed, deprecated, quarantined); (iii) a hierarchical rights model regulating activation, contribution, and persistence during inferencedesigned to prevent reasoning drift and ghost knowledge accumulation.
In addition, AGA employs primary-entropy-gated intervention, ensuring that auxiliary attention is activated only when the primary model exhibits uncertainty.
Experiments on commonsense reasoning (CommonsenseQA, PIQA), long-context question answering (Natural Questions, HotpotQA), and continual knowledge injection tasks show consistent improvements of 2.
1-4.
3 percentage points with less than 0.
6% parameter overhead.
Ablation studies demonstrate that governance mechanisms are critical for long-term stability, preventing uncontrolled accumulation of auxiliary knowledge and collapse into implicit retrieval-based behavior.
Related Results
Studi Literatur: Aplikasi dan Fungsi Porang (Amorphophallus Oncophyllus) dalam Frozen Yoghurt
Studi Literatur: Aplikasi dan Fungsi Porang (Amorphophallus Oncophyllus) dalam Frozen Yoghurt
Abstract — Frozen yoghurt is a frozen desserts made with yoghurt and quite similar to ice cream but low in calorie, which cointains milk, sweetener, stabilizers, emulsifier, and la...
CREATION OF A STRUCTURAL MODEL OF AN POWER TRANSFORMERS IN THE FORM OF AC TRANSFORMING COMPLEXES
CREATION OF A STRUCTURAL MODEL OF AN POWER TRANSFORMERS IN THE FORM OF AC TRANSFORMING COMPLEXES
Due to the multiple transformation of electrical energy, the rated capacity of power transformers can be 8 or more times the rated generation capacity. Therefore, the state of reli...
Influence of Soil Salinization on Active Layer Thickness of Frozen Soil
Influence of Soil Salinization on Active Layer Thickness of Frozen Soil
The climate of the Qinghai–Tibet Plateau is distinct. Given the large temperature difference between day and night, drought in perennial years, low rainfall and large evaporation v...
Analisis Faktor - Faktor Yang Mempengaruhi Preferensi Konsumen Dalam Pembelian Produk Frozen Food (Studi Kasus Pelanggan “Nadelia Frozen” Patumbak Medan)
Analisis Faktor - Faktor Yang Mempengaruhi Preferensi Konsumen Dalam Pembelian Produk Frozen Food (Studi Kasus Pelanggan “Nadelia Frozen” Patumbak Medan)
Frozen food is food that is processed and then packaged in half-cooked packaging and when consumed must go through a re-processing process, namely by heating it in a frying pan. Fr...
Sistem Prediksi Penjualan Frozen Food dengan Metode Monte Carlo (Studi Kasus: Supermama Frozen Food)
Sistem Prediksi Penjualan Frozen Food dengan Metode Monte Carlo (Studi Kasus: Supermama Frozen Food)
Abstract. Frozen Food Sales Prediction System Case Study of Supermama Frozen Food Using the Monte Carlo Method. Frozen processed food is increasingly popular, so frozen food stores...
Frozen Goals: Identifying and Defining a New Type of Goal
Frozen Goals: Identifying and Defining a New Type of Goal
Goals pursuit involves multiple stages from setting the goal to actively pursuing the goal to finally achieving or abandoning the goal. Sometimes, however, individuals may set a go...
On the Remote Calibration of Instrumentation Transformers: Influence of Temperature
On the Remote Calibration of Instrumentation Transformers: Influence of Temperature
The remote calibration of instrumentation transformers is theoretically possible using synchronous measurements across a transmission line with a known impedance and a local set of...
Comparative Study Between English And Korean Interrogative Sentences
Comparative Study Between English And Korean Interrogative Sentences
This study discusses English And Korean Interrogative Sentences. It is aimed at describing the forms and types of English and Korean Interrogative Sentences and finding out the sim...

