The Karhunen-Loeve Transform (KLT) when applied to an AR (1) process with known block boundaries resembles the Discrete Sine Transform of type VII (DST-VII). Similarly, when both boundaries are known, the KLT becomes like the DST-I. In this paper, we will use the same methodology to suggest new shapes and forms of temporal prediction structures for video coding. Specifically, treat the Group of Pictures (GOP) as samples from some AR (1) process and interpret factorizations of the resulting DST-VII and DST-I transforms as sequences of temporal predictions with specific weight factors applied. Then, identify a subset of GOP lengths producing simple structures of short-length DST-VII and DST-I factorizations as candidates for practical implementations of temporal prediction algorithms. We will also analyze the coding gains achievable by traditional vs transform-based predictions considering single- and dual-side boundaries. These results may be of interest for future evolutions of video coding algorithms and architectures.
|