Javascript must be enabled to continue!

All on board

Machine learning models offer transformative benefits across disciplines such as medicine, chemistry, and physics. However, as these models grow in size and usage, their energy demands increase dramatically, raising sustainability concerns. Neuromorphic hardware, inspired by the energy efficiency of the human brain, seeks to address this challenge by offering low-power, fast-processing alternatives to conventional computing. A key difference between neuromorphic systems and traditional architectures is the absence of shared memory between neurons, which poses a challenge to implementing learning algorithms. Yet, the brain learns effectively under this constraint, indicating the potential for machine learning methods to be adapted to such hardware. This research presents the design and implementation of the first ever fully on-chip neuromorphic Loihi 2 reinforcement learning agent. This circuit consists of a fully embedded Q-learning algorithm and an on-chip simulation of the CartPole-v0 environment on Intel's Loihi 2 neuromorphic processor. The system successfully trained agents that solved the CartPole-v0 task 36% of the time. Among all agents trained on Loihi 2, the top 50% achieved an average episode reward of 193.01, which is very near the benchmark score of 195 required to solve the task. In comparison, a similar Q-learning algorithm implemented on a conventional Intel Core i7-10870H CPU solved the task 62% of the time, with the top 50% of agents achieving a perfect score of 200. Despite the lower success rate, Loihi 2 exhibited major advantages in efficiency. During training, its dynamic power draw was only 0.02 watts, compared to 12 watts on the CPU. Execution time per weight update was also faster: 58.42 microseconds on Loihi 2 versus 162.54 microseconds on the CPU. Consequently, the neuromorphic system can train the same number of successful agents in only 44% of the required time for the CPU and with 600 times less power. This translates to a 1,365-fold increase in energy efficiency for an equivalent level of training success, and a 4,149-fold increase during inference. These findings demonstrate the viability of reinforcement learning on neuromorphic hardware and highlight its promise for building energy-efficient, real-time, embedded AI systems, as well as bring neuromorphic computing closer to realizing its potential as the backbone of a new sustainable, brain-like generation of AI.

Drexel University Libraries

Steven Christian Nesbit Edward Kim

2025

Title: All on board

Description:

Machine learning models offer transformative benefits across disciplines such as medicine, chemistry, and physics.

However, as these models grow in size and usage, their energy demands increase dramatically, raising sustainability concerns.

Neuromorphic hardware, inspired by the energy efficiency of the human brain, seeks to address this challenge by offering low-power, fast-processing alternatives to conventional computing.

A key difference between neuromorphic systems and traditional architectures is the absence of shared memory between neurons, which poses a challenge to implementing learning algorithms.

Yet, the brain learns effectively under this constraint, indicating the potential for machine learning methods to be adapted to such hardware.

This research presents the design and implementation of the first ever fully on-chip neuromorphic Loihi 2 reinforcement learning agent.

This circuit consists of a fully embedded Q-learning algorithm and an on-chip simulation of the CartPole-v0 environment on Intel's Loihi 2 neuromorphic processor.

The system successfully trained agents that solved the CartPole-v0 task 36% of the time.

Among all agents trained on Loihi 2, the top 50% achieved an average episode reward of 193.

01, which is very near the benchmark score of 195 required to solve the task.

In comparison, a similar Q-learning algorithm implemented on a conventional Intel Core i7-10870H CPU solved the task 62% of the time, with the top 50% of agents achieving a perfect score of 200.

Despite the lower success rate, Loihi 2 exhibited major advantages in efficiency.

During training, its dynamic power draw was only 0.

02 watts, compared to 12 watts on the CPU.

Execution time per weight update was also faster: 58.

42 microseconds on Loihi 2 versus 162.

54 microseconds on the CPU.

Consequently, the neuromorphic system can train the same number of successful agents in only 44% of the required time for the CPU and with 600 times less power.

This translates to a 1,365-fold increase in energy efficiency for an equivalent level of training success, and a 4,149-fold increase during inference.

These findings demonstrate the viability of reinforcement learning on neuromorphic hardware and highlight its promise for building energy-efficient, real-time, embedded AI systems, as well as bring neuromorphic computing closer to realizing its potential as the backbone of a new sustainable, brain-like generation of AI.

Back

Abstract Introduction The short half-life of standard factor VIII (FVIII) products means that frequent injections (3 to 4 times/week) are needed for e...

A Phase 1b, Dose-Finding Study Of Ruxolitinib Plus Panobinostat In Patients With Primary Myelofibrosis (PMF), Post–Polycythemia Vera MF (PPV-MF), Or Post–Essential Thrombocythemia MF (PET-MF): Identification Of The Recommended Phase 2 Dose

Abstract Background Myelofibrosis (MF) is a myeloproliferative neoplasm associated with progressive, debilitating symptoms that ...

EFFECTS OF BOARD CHARACTERISTICS ON FINANCIAL REPORTING QUALITY OF NIGERIA LISTED COMMERCIAL BANKS: A SYSTEM GMM APPROACH

This study investigates the effects of board characteristics on the financial reporting quality of Nigeria-listed commercial banks, focusing on Board Independence (BOI), Managerial...

Dynamics of Mutations in Patients with ET Treated with Imetelstat

Abstract Background: Imetelstat, a first in class specific telomerase inhibitor, induced hematologic responses in all patients (pts) with essential thrombocythemia (...

Combinatorial Antigen Targeting Strategy for Acute Myeloid Leukemia

Introduction: Efforts to safely and effectively treat acute myeloid leukemia (AML) by targeting a single leukemia associated antigen with chimeric antigen receptor T (CAR T) cells ...

Risk of Infections with BCMA-Directed Immunotherapy in Multiple Myeloma

Abstract Introduction: B cell maturation antigen (BCMA) is a novel target for T cell immunotherapy in MM including bispecific antibody (bsAb) and chimeric antigen re...

Do board chairs matter? The influence of board chairs on firm performance

Research summary : We use a variance decomposition methodology to assess the degree to which board chairs may influence their companies' performance. To isola...

Efficacy and Safety of Subcutaneous Prophylaxis with Concizumab in Patients with Hemophilia a or B with Inhibitors: Results from explorer4, a Phase 2, Randomized, Open-Label, Controlled Trial

Introduction Concizumab is an anti-tissue factor pathway inhibitor (TFPI) monoclonal antibody in clinical development for the subcutaneous prophylactic treatment of hemophilia pati...

Email:
Password:

Email:

All on board

Related Results