Presentation + Paper
17 September 2018 Intra prediction with deep learning
Raz Birman, Yoram Segal, Avishay David-Malka, Ofer Hadar
Author Affiliations +
Abstract
One fundamental component of video compression standards is Intra-Prediction. Intra-Prediction takes advantage of redundancy in the information of neighboring pixel values within video frames to predict blocks of pixels from their surrounding pixels and thus allowing to transmit the prediction errors instead of the pixel values themselves. The prediction errors are of smaller values than the pixels themselves, thus allowing to accomplish compression of the video stream. Prevalent standards take advantage of intra-frame pixel value dependencies to perform prediction at the encoder end and transfer only residual errors to the decoder. The standards use multiple “Modes”, which are various linear combinations of pixels for prediction of their neighbors within image Macro-Blocks (MBs). In this research, we have used Deep Neural Networks (DNN) to perform the predictions. Using twelve Fully Connected Networks, we managed to reduce Mean Square Error (MSE) of the predicted error by up to 3 times as compared to standard modes prediction results. This substantial improvement comes at the expense of more extensive computations. However, these extra computations can be significantly mitigated by the use of dedicated Graphical Processing Units (GPUs).
Conference Presentation
© (2018) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Raz Birman, Yoram Segal, Avishay David-Malka, and Ofer Hadar "Intra prediction with deep learning", Proc. SPIE 10752, Applications of Digital Image Processing XLI, 1075214 (17 September 2018); https://doi.org/10.1117/12.2320551
Lens.org Logo
CITATIONS
Cited by 3 scholarly publications and 1 patent.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video compression

Neural networks

Video

Video coding

Neurons

RELATED CONTENT

LIVE: latent image and video encoding
Proceedings of SPIE (September 30 2024)
VMAF and variants: towards a unified VQA
Proceedings of SPIE (August 01 2021)
Integrated segmentation approach for video coding
Proceedings of SPIE (January 10 1997)

Back to Top