University of Toronto
With current FPGAs (Field Programmable Gate Array), designers can now instantiate several embedded processors, memory units, and a wide variety of IP blocks to build a single-chip, high-performance multiprocessor embedded system. Furthermore, Multi-FPGA systems can be built to provide massive parallelism given an efficient programming model. In this paper, the authors present a lightweight subset implementation of the standard message passing interface, MPI that is suitable for embedded processors. It does not require an operating system and uses a small memory footprint.