Paper
1 June 2020 HEVC intra prediction mode classification by deep learning
Hanxiang Wang, Yanfen Li, L. Minh Dang, Hae Kwang Kim
Author Affiliations +
Proceedings Volume 11515, International Workshop on Advanced Imaging Technology (IWAIT) 2020; 115151D (2020) https://doi.org/10.1117/12.2566252
Event: International Workshop on Advanced Imaging Technologies 2020 (IWAIT 2020), 2020, Yogyakarta, Indonesia
Abstract
In High Efficiency Video Coding (HEVC) standard, the best intra prediction mode is decided by choosing the smallest ratedistortion cost of actual encoding among the total of 35 modes with the MPM (Most Probable Mode) scheme for compression purpose of mode encoding with reference to the adjacent reference blocks of the current prediction unit. This causes heavy computational complexity. In this paper, a deep neural network is conceived and experimented as a probable module for the intra prediction mode decision process inside of the HEVC encoding scheme. The neural network is trained and tested with a ground-truth dataset constructed from actual HEVC Intra encoding of original images. For the performance of the test, accuracy is used as the percentage of the correct mode output by the designed neural network to the ground-truth mode. The experimental results show that the neural network does not give good accuracy for the correct mode. However, accuracy increased when similar angle mode is considered as the correct mode. Also, the special modes of DC and Planar for MPM are analyzed in this paper.
© (2020) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Hanxiang Wang, Yanfen Li, L. Minh Dang, and Hae Kwang Kim "HEVC intra prediction mode classification by deep learning", Proc. SPIE 11515, International Workshop on Advanced Imaging Technology (IWAIT) 2020, 115151D (1 June 2020); https://doi.org/10.1117/12.2566252
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Neural networks

Computer programming

Data modeling

Convolutional neural networks

Performance modeling

Video coding

Computer engineering

RELATED CONTENT

Analysis on the number of layers in the transformer based...
Proceedings of SPIE (December 28 2022)
Self-attention on RNN-based text classification
Proceedings of SPIE (October 03 2022)
Sequence-to-sequence image caption generator
Proceedings of SPIE (March 15 2019)
A hybrid and regenerative model chat robot based on LSTM...
Proceedings of SPIE (October 28 2021)

Back to Top