Paper
13 March 2021 Intra CTU depth decision for HEVC by using neural networks
Yanfen Li, Hanxiang Wang, L. Minh Dang, Khawar Islam, Hae Kwang Kim
Author Affiliations +
Proceedings Volume 11766, International Workshop on Advanced Imaging Technology (IWAIT) 2021; 1176630 (2021) https://doi.org/10.1117/12.2589191
Event: International Workshop on Advanced Imaging Technology 2021 (IWAIT 2021), 2021, Online Only
Abstract
As a video encoding standard, High Efficiency Video Coding (HEVC) achieves excellent performance while causing a dramatic increase in coding complexity. Especially, the coding tree unit (CTU) depth decision process is the most complicated section, which takes heavy computation complexity in the entire HEVC intra coding process. Therefore, a deep learning-based method is applied to directly predict the CTU depth level for each frame in this study. In addition, a large-scale dataset that contains the coding unit image files and the corresponding depths was generated by HM16.15 to train and test the deep learning model. Besides, a Convolutional Neural Network called LeNet is fine-tuned by modifying the original architecture, and then the model with a more complicated structure is evaluated and compared on an acquired dataset. The experiments show that the fine-tuned deep learning model has the ability to identify accurately the depth level of CTU, the recognition accuracy reaches over 98.6%.
© (2021) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Yanfen Li, Hanxiang Wang, L. Minh Dang, Khawar Islam, and Hae Kwang Kim "Intra CTU depth decision for HEVC by using neural networks", Proc. SPIE 11766, International Workshop on Advanced Imaging Technology (IWAIT) 2021, 1176630 (13 March 2021); https://doi.org/10.1117/12.2589191
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
Back to Top