SpikeRL: A Scalable and Energy-efficient Framework for Deep Spiking Reinforcement Learning

Tahmid, Tokey; Gates, Mark; Luszczek, Piotr; Schuman, Catherine D.

Submitted by webmaster on Mon, 01/26/2026 - 13:28

Title	SpikeRL: A Scalable and Energy-efficient Framework for Deep Spiking Reinforcement Learning
Publication Type	Conference Paper
Year of Publication	2026
Authors	Tahmid, T., M. Gates, P. Luszczek, and C. D. Schuman
Conference Name	2025 International Conference on Neuromorphic Systems (ICONS)
Date Published	2026-01
Publisher	IEEE
Conference Location	Seattle, WA, USA
Keywords	Message Passing Interface (MPI), mixed precision, Reinforcement Learning, Spiking Neural Network
Abstract	In the era of dramatic growth of AI, massive investments in large-scale data-driven AI systems demand highperformance computing, which in turn consume tremendous amounts of energy and resources. This trend raises new challenges in optimizing sustainability without sacrificing scalability or performance. Among the energy-efficient alternatives of the traditional Von Neumann architecture, neuromorphic computing and its Spiking Neural Networks (SNNs) are a promising choice due to their inherent energy efficiency. However, in some realworld application scenarios such as complex continuous control tasks, SNNs often lack the performance optimizations that traditional artificial neural networks have. Researchers have addressed this by combining SNNs with Deep Reinforcement Learning (DeepRL), yet scalability remains unexplored. In this paper, we extend our previous work on SpikeRL, which is a scalable and energy efficient framework for DeepRL-based SNNs for continuous control. In our initial implementation of SpikeRL framework, we depended on the population encoding from the Population-coded Spiking Actor Network (PopSAN) method for our SNN model and implemented distributed training with Message Passing Interface (MPI) through mpi4py. We also optimized our model training by using mixed-precision for parameter updates. In our new SpikeRL framework, we have implemented our own DeepRL-SNN component with population encoding, and distributed training with PyTorch Distributed package with NCCL backend while still optimizing with mixed precision training. Our new SpikeRL implementation is 4.26x faster and 2.25x more energy efficient than state-of-the-art DeepRL-SNN methods. Our proposed SpikeRL framework demonstrates a truly scalable and sustainable solution for complex continuous control tasks in real-world applications.
URL	https://ieeexplore.ieee.org/document/11345902/
DOI	10.1109/ICONS69015.2025.00033

External Publication Flag: