Presentation + Paper
12 April 2021 Real-time object detection in 360-degree videos
Author Affiliations +
Abstract
Streaming of 360-degree videos over the internet is challenging task, but it provides rich multimedia experiences by allowing viewers to navigate 360-degree contents. The 360-degree videos need larger bandwidth and less latency to be streamed over the internet than the conventional videos. Therefore, non-visible area must be discarded from the video to save bandwidth. View prediction techniques have been used to predict visible area of the 360-degree video frames to be streamed. Linear regression using viewer’s past viewing behavior data is useful to predict short-term future behavior of the viewer, which is not useful when the network delay is longer than the prediction horizon. Object detection techniques help predicting viewers’ future motion for longer prediction horizon since the viewers tend to follow the objects that draw their attention. However, conventional object detection techniques using a convolutional neural network, such as YOLO, are difficult to be applied to 360-degree videos. There are distortions in the 360-degree videos when the spherical 360-degree video is projected into equi-rectangular videos for processing and storing purposes. A same object could have different shapes in the equi-rectangular video depends on their angular position in the sphere. Therefore, in this paper, we propose a multi-directional projection (MDP) technique to detect objects in the 360-degree videos. The proposed multi- directional projection technique mitigates the distortions in the equi-rectangular videos and feeds the redirected videos to the object detection system. Therefore, the neural network trained with conventional video dataset can be used without any change. Experimental result shows that the proposed method helps detecting objects in the edges of the 360-degree videos.
Conference Presentation
© (2021) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jounsup Park "Real-time object detection in 360-degree videos", Proc. SPIE 11736, Real-Time Image Processing and Deep Learning 2021, 117360C (12 April 2021); https://doi.org/10.1117/12.2586403
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Spherical lenses

Video processing

Convolutional neural networks

Internet

Detection and tracking algorithms

Multimedia

RELATED CONTENT

Image processing for drawing recognition
Proceedings of SPIE (March 03 2014)
4K-based intra and interprediction techniques for HEVC
Proceedings of SPIE (April 29 2016)
Deco video video editing and viewing browser enables to...
Proceedings of SPIE (January 22 2008)
A new method based on twi difference algorithm and motion...
Proceedings of SPIE (November 03 2005)
Online scene change detection of multicast (MBone) video
Proceedings of SPIE (October 05 1998)

Back to Top