PULSAR
2.0.0
Parallel Ultra-Light Systolic Array Runtime
|
![]() ![]() | |
![]() ![]() | |
![]() ![]() | |
![]() ![]() | |
![]() ![]() | |
![]() ![]() | |
![]() ![]() | |
![]() ![]() | Callback data for finishing a local communication |
![]() ![]() | Callback data for queueing a local communication |
![]() ![]() | Callback data for releasing a device packet |
![]() ![]() | VDP's data channel. Implements a data link between a pair of VDPs. Identifies the source and destination VDPs by tuples. Contains a thread-safe list of data packets |
![]() ![]() | PRT configuration |
![]() ![]() | VSA's accelerator device. Represents a hardware accelerator. Currently synonymous with an Nvidia GPU |
![]() ![]() | Mapping of VDPs to hardware |
![]() ![]() | VDP's data packet A packet of data transferred through VDP's channels |
![]() ![]() | VSA's proxy |
![]() ![]() | MPI communication request for a packet. Contains a packet, some info, MPI request and MPI status |
![]() ![]() | VSA's worker thread. Represents a single CPU core or a collection of cores |
![]() ![]() | Local transfer object |
![]() ![]() | Virtual Data Processor (VDP). Is uniquely identified by a tuple. Fires for a predefined number of cycles. Has a fixed number of input and output channels. Has a persistent local store. Has access to read-only global store |
![]() ![]() | Virtual Systolic Array (VSA) VSA contains global informationa about the system, a local communication proxy, an array of local worker threads, and an array of local accelerator devices |
![]() ![]() |