Preliminary Program

A Comparative Analysis of Microrings Based Incoherent Photonic GEMM Accelerators

Sairam Sri Vatsavai, Venkata Sai Praneeth Karempudi, Oluwaseun Alo, Ishan Thakkar
University of Kentucky

Abstract

Several microring resonator (MRR) based analog photonic architectures have been proposed to accelerate linear computations, such as general matrix-matrix multiplications (GEMMs), which are found in abundance in deep neural networks. These architectures offer exceptional throughput and energy efficiency compared to their electronic counterparts by mapping GEMM operations on naturally linear, low-dissipation, high-speed optical phenomena that affect analog optical signals. To implement GEMM functions, these MRR-based architectures, in general, manipulate optical signals in five different ways: (i) Splitting (copying) of multiple optical signals to achieve a certain fan-out, (ii) Aggregation (multiplexing) of multiple optical signals to achieve a certain fan-in, (iii) Modulation of optical signals to imprint input values onto analog signal amplitude, (iv) Weighting of modulated optical signals to achieve analog input-weight multiplication, (v) Summation of optical signals. The MRR-based GEMM accelerators from prior works undertake the summation of optical signals at the end. However, they undertake the first four ways of signal manipulation in an arbitrary order without due deliberation, essentially ignoring the possible impact of the order of these manipulations on their performance. In this paper, we conduct a detailed analysis of accelerator organizations with three different orders of these manipulations: (1) Modulation-Aggregation-Splitting-Weighting (MASW), (2) Aggregation-Splitting-Modulation-Weighting (ASMW), and (3) Splitting-Modulation-Weighting-Aggregation (SMWA). We show, via our modeling and analysis of evaluation results, that these MASW, ASMW, and SMWA organizations affect the crosstalk noise and optical signal losses in different magnitudes, which renders these organizations with different levels of processing parallelism at the circuit level, and different magnitudes of throughput and energy-area efficiency at the system level. Our evaluation results for four CNN models show that SMWA organization achieves up to 4.4x, 5x, and 5.2x better throughput, energy efficiency, and area-energy efficiency, respectively, compared to ASMW and MASW organizations on average.