Improving the Visual Quality of Video Frame Prediction Models Using the Perceptual Straightening Hypothesis
Kancharla, Parimala and Channappayya, Sumohana S. (2021) Improving the Visual Quality of Video Frame Prediction Models Using the Perceptual Straightening Hypothesis. IEEE Signal Processing Letters, 28. pp. 2167-2171. ISSN 1070-9908
Text
IEEE_Signal.pdf - Published Version Restricted to Registered users only Download (1MB) | Request a copy |
Abstract
We present a simple and effective method to improve the visual quality of the predicted frames in a frame prediction model. A recent neuroscience study hypothesizes that the perceptual representations of a sequence of frames extracted from a natural video follow a straight temporal trajectory. The perceptual representations of a sequence of video frames are found using a computational model of the LGN and V1 areas of the human visual system. In this work, we leverage the strength of this perceptual straightening model to formulate a novel objective function for video frame prediction. In general, a frame prediction model takes past frames as input and predicts the future frame. We enforce the perceptual straightness constraint through adversarial training by introducing the proposed novel quality aware discriminator loss. Our quality aware discriminator imposes the linear relationship between the perceptual representation of the predicted frame and the perceptual representations of the past frames.Specifically, we claim that imposing a perceptual straightness constraint through the discriminator helps in predicting (i.e., generating) video frames that look more natural and therefore, having a higher perceptual quality. We demonstrate the effectiveness of our proposed objective function on two popular video datasets using three different frame prediction models. These experiments show that our solution is both consistent and stable, thereby allowing it to be integrated with other frame prediction models as well. © 1994-2012 IEEE.
IITH Creators: |
|
||||
---|---|---|---|---|---|
Item Type: | Article | ||||
Uncontrolled Keywords: | Image generation; video generation; video prediction | ||||
Subjects: | Electrical Engineering | ||||
Divisions: | Department of Electrical Engineering | ||||
Depositing User: | . LibTrainee 2021 | ||||
Date Deposited: | 29 Aug 2022 09:38 | ||||
Last Modified: | 29 Aug 2022 09:38 | ||||
URI: | http://raiithold.iith.ac.in/id/eprint/10320 | ||||
Publisher URL: | http://doi.org/10.1109/LSP.2021.3118639 | ||||
OA policy: | https://v2.sherpa.ac.uk/id/publication/3572 | ||||
Related URLs: |
Actions (login required)
View Item |
Statistics for this ePrint Item |