Spherical Convolution empowered FoV Prediction in 360-degree Video Multicast with Limited FoV Feedback

01/29/2022
by   Jie Li, et al.
2

Field of view (FoV) prediction is critical in 360-degree video multicast, which is a key component of the emerging Virtual Reality (VR) and Augmented Reality (AR) applications. Most of the current prediction methods combining saliency detection and FoV information neither take into account that the distortion of projected 360-degree videos can invalidate the weight sharing of traditional convolutional networks, nor do they adequately consider the difficulty of obtaining complete multi-user FoV information, which degrades the prediction performance. This paper proposes a spherical convolution-empowered FoV prediction method, which is a multi-source prediction framework combining salient features extracted from 360-degree video with limited FoV feedback information. A spherical convolution neural network (CNN) is used instead of a traditional two-dimensional CNN to eliminate the problem of weight sharing failure caused by video projection distortion. Specifically, salient spatial-temporal features are extracted through a spherical convolution-based saliency detection model, after which the limited feedback FoV information is represented as a time-series model based on a spherical convolution-empowered gated recurrent unit network. Finally, the extracted salient video features are combined to predict future user FoVs. The experimental results show that the performance of the proposed method is better than other prediction methods.

READ FULL TEXT

page 3

page 5

page 7

page 10

page 12

page 13

page 14

research
06/21/2021

Applying VertexShuffle Toward 360-Degree Video Super-Resolution on Focused-Icosahedral-Mesh

With the emerging of 360-degree image/video, augmented reality (AR) and ...
research
02/02/2017

Video Salient Object Detection via Fully Convolutional Networks

This paper proposes a deep learning model to efficiently detect salient ...
research
02/04/2019

Very Long Term Field of View Prediction for 360-degree Video Streaming

360-degree videos have gained increasing popularity in recent years with...
research
06/06/2022

Subtitle-based Viewport Prediction for 360-degree Virtual Tourism Video

360-degree streaming videos can provide a rich immersive experiences to ...
research
09/15/2023

Head-Related Transfer Function Interpolation with a Spherical CNN

Head-related transfer functions (HRTFs) are crucial for spatial soundfie...
research
06/04/2018

Cube Padding for Weakly-Supervised Saliency Prediction in 360° Videos

Automatic saliency prediction in 360 videos is critical for viewpoint gu...
research
11/13/2018

Spherical clustering of users navigating 360^∘ content

In Virtual Reality (VR) applications, understanding how users explore th...

Please sign up or login with your details

Forgot password? Click here to reset