Improving the Visual Quality of Video Frame Prediction Models Using the Perceptual Straightening Hypothesis

Kancharla, Parimala and Channappayya, Sumohana S. (2021) Improving the Visual Quality of Video Frame Prediction Models Using the Perceptual Straightening Hypothesis. IEEE Signal Processing Letters, 28. pp. 2167-2171. ISSN 1070-9908

[img] Text
IEEE_Signal.pdf - Published Version
Restricted to Registered users only

Download (1MB) | Request a copy

Abstract

We present a simple and effective method to improve the visual quality of the predicted frames in a frame prediction model. A recent neuroscience study hypothesizes that the perceptual representations of a sequence of frames extracted from a natural video follow a straight temporal trajectory. The perceptual representations of a sequence of video frames are found using a computational model of the LGN and V1 areas of the human visual system. In this work, we leverage the strength of this perceptual straightening model to formulate a novel objective function for video frame prediction. In general, a frame prediction model takes past frames as input and predicts the future frame. We enforce the perceptual straightness constraint through adversarial training by introducing the proposed novel quality aware discriminator loss. Our quality aware discriminator imposes the linear relationship between the perceptual representation of the predicted frame and the perceptual representations of the past frames.Specifically, we claim that imposing a perceptual straightness constraint through the discriminator helps in predicting (i.e., generating) video frames that look more natural and therefore, having a higher perceptual quality. We demonstrate the effectiveness of our proposed objective function on two popular video datasets using three different frame prediction models. These experiments show that our solution is both consistent and stable, thereby allowing it to be integrated with other frame prediction models as well. © 1994-2012 IEEE.

[error in script]
IITH Creators:
IITH CreatorsORCiD
Channappayya, Sumohana S.https://orcid.org/0000-0002-5880-4023
Item Type: Article
Uncontrolled Keywords: Image generation; video generation; video prediction
Subjects: Electrical Engineering
Divisions: Department of Electrical Engineering
Depositing User: . LibTrainee 2021
Date Deposited: 29 Aug 2022 09:38
Last Modified: 29 Aug 2022 09:38
URI: http://raiithold.iith.ac.in/id/eprint/10320
Publisher URL: http://doi.org/10.1109/LSP.2021.3118639
OA policy: https://v2.sherpa.ac.uk/id/publication/3572
Related URLs:

Actions (login required)

View Item View Item
Statistics for RAIITH ePrint 10320 Statistics for this ePrint Item