Paper
27 January 2023 An energy-efficient quantized inference framework for electro-photonic computing system
Zeyu Lin, Hongfei Li, Sheng Zhang, Yaxiong Lei, Sen Li
Author Affiliations +
Proceedings Volume 12550, International Conference on Optical and Photonic Engineering (icOPEN 2022); 125501E (2023) https://doi.org/10.1117/12.2666848
Event: International Conference on Optical and Photonic Engineering (icOPEN 2022), 2022, ONLINE, China
Abstract
Deep Learning is developing rapidly and has made breakthroughs in various fields. As a result, the number of parameters in deep neural networks (DNNs) is scaling about 5× rate of Moore’s Law. To meet the computing power needs of DNNs, new computing architectures have been proposed, and the electro-photonic computing system is one of them. With the physical characteristics of photons, the electro-photonic computing system has demonstrated great potential in improving the efficiency of convolution calculation. However, there are still much space for improvement in the energy efficiency of the electro-photonic computing system. And the high energy consumption led by the high-precision analog-to-digital converters (ADCs) is an important factor hindering the energy efficiency. ADCs dominate about 50% energy consumption of the entire system, and they will increase exponentially with the precision. But if the low-precision ADCs are used directly, they will destroy the computing accuracy in the convolution calculation and reduce the inference performance of DNNs. In this paper, we propose an energy-efficient quantized inference framework for electro-photonic computing system to balance the energy consumption of ADCs and the DNNs inference performance, which includes a multi-scale quantization scheme based on per-slice granularity and width variable optical matrix. The proposed framework can reduce the energy consumption of the system by using low-precision ADCs, and ensure the inference performance of the DNNs at the same time. The experiments are carried out in MobileNet-v2, ResNet50 and Vgg16 respectively. The experimental results show that the proposed quantized inference framework can reduce the precision of ADCs by 7 bits and compared with using the low-precision ADCs directly, it improves the model accuracy of 40.5%, 70.2% and 65.4% respectively, which is close to the full precision model.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Zeyu Lin, Hongfei Li, Sheng Zhang, Yaxiong Lei, and Sen Li "An energy-efficient quantized inference framework for electro-photonic computing system", Proc. SPIE 12550, International Conference on Optical and Photonic Engineering (icOPEN 2022), 125501E (27 January 2023); https://doi.org/10.1117/12.2666848
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Quantization

Optical computing

Analog to digital converters

Computing systems

Convolution

Error analysis

Optical matrix switches

RELATED CONTENT


Back to Top