Integer fast lapped transforms based on direct-lifting of DCTs for lossy-to-lossless image coding

Suzuki, Taizo; Ikehara, Masaaki

doi:10.1186/1687-5281-2013-65

Research
Open access
Published: 27 December 2013

Integer fast lapped transforms based on direct-lifting of DCTs for lossy-to-lossless image coding

Taizo Suzuki¹ &
Masaaki Ikehara²

EURASIP Journal on Image and Video Processing volume 2013, Article number: 65 (2013) Cite this article

2738 Accesses
8 Citations
1 Altmetric
Metrics details

Abstract

The discrete cosine transforms (DCTs) have found wide applications in image/video compression (image coding). DCT-based lapped transforms (LTs), called fast LTs (FLTs), overcome blocking artifacts generated at low bit rate image coding by DCT while keeping fast implementation. This paper presents a realization of more effective integer FLT (IntFLT) for lossy-to-lossless image coding, which is unified lossless and lossy image coding, than the conventional IntFLTs. It is composed of few operations and direct application of DCTs to lifting blocks, called direct-lifting of DCTs. Since the direct-lifting can reuse any existing software/hardware for DCTs, the proposed IntFLTs have a great potential for fast implementation which is dependent on the architecture design and DCT algorithms. Furthermore, the proposed IntFLTs do not need any side information unlike integer DCT (IntDCT) based on direct-lifting as our previous work. Moreover, they can be easily extended to larger size which is recently required as in DCT for the standard H.26x series. As a result, the proposed method shows better lossy-to-lossless image coding than the conventional IntFLTs.

1 Introduction

The most popular image/video compression (image coding) standards, JPEG [1, 2] and H.26x series [3, 4], employ discrete cosine transform (DCT) [5] at their transformation stages. DCT can be basically classified into types I to IV (DCT-I to IV) and has numerous fast implementations [6–10] and applications for signal processing. In them, DCT-II, so-called DCT, has excellent energy compaction capability and DCT-III is its inverse transform, so-called inverse DCT (IDCT). However, DCT generates annoying blocking artifacts at low bit rates because the DCT bases are short and create discontinuities at block boundaries due to non-overlapping. To overcome this drawback, lapped transforms (LTs), which are classified into lapped orthogonal transform (LOT) and lapped biorthogonal transform (LBT), have received much attention. DCT-based fast LTs (FLTs), which are classified into fast LOT (FLOT) and fast LBT (FLBT), are well-known as fast and effective transform for image coding [11]. FLTs are constructed by cascading DCT-II, DCT-III, DCT-IV, rotation matrices with π/4 angles, ±1 operations, scaling factors, a delay matrix, and permutation matrices. To improve the coding performance and reduce the complexity more, LiftLT with VLSI-friendly implementation has been proposed by Tran [12] ^a. However, the LTs cannot be applied to the lossless mode.

On the other hand, JPEG achieves the lossless mode by using differential pulse code modulation (DPCM) in place of DCT. JPEG 2000 [13] employs 9/7-tap and 5/3-tap discrete wavelet transforms (9/7-DWT and 5/3-DWT) for lossy and lossless modes, respectively [14]. They mean that JPEG and JPEG 2000 do not have compatibility between the lossy and lossless mode. Of course, lossless transform such as 5/3-DWT is applicable to lossy-to-lossless image coding. However, its lossy performance is not good compared with 9/7-DWT because each transform is suitable only in each mode. The next standard JPEG XR [15] has solved this problem by achieving lossy-to-lossless image coding which is unified lossy and lossless image coding. JPEG XR employs only hierarchical lapped transform (HLT) for both of lossy and lossless modes [16]. The HLT is composed of lifting structures [17–19] with rounding operations and achieves integer-to-integer transform, whereas it does not have enough coding performance, especially for images with many high frequency components. Various lifting-based filter banks (L-FBs) [20–28], which contain integer DCTs (IntDCTs) [29–35], have been researched to improve coding performance. However, these except for IntDCTs are not practical due to the complexity.

This paper presents a realization of integer FLT (IntFLT), which is constructed by lifting structures with rounding operations, for lossy-to-lossless image coding. Although FLT can be easily applied to lossy-to-lossless image coding by simple lifting factorizations of rotation matrices and scaling factors, the obtained integer transform is unsuitable due to large rounding error because of many rounding operations. The conventional IntFLTs also have many operations, whereas the proposed IntFLTs have simple implementations with few operations and direct application of DCTs to lifting blocks, called direct-lifting of DCTs. The direct-lifting can reuse any existing software/hardware for DCTs^b. As a result, although the proposed IntFLTs are apparently sacrificing the complexity to achieve the lossless mode compared with LiftLT, they have a great potential for fast implementation which is dependent on the architecture design and DCT algorithms. Furthermore, the proposed IntFLTs do not need any side information unlike IntDCT based on direct-lifting as our previous work [35]. Moreover, they can be easily extended to larger size which is recently required as in DCT for H.26x series. Such IntFLT already proposed in [36] cannot achieve enough coding performance due to the orthogonality. This paper introduces IntFLT without such a restriction. Finally, the proposed method shows better lossy-to-lossless image coding than the conventional IntFLTs.

1.1 Notations

Several special matrices with reserved symbols are as follows: I, J, 0, and D are an identity matrix, a reversal identity matrix, a null matrix, and a diagonal matrix with alternating ±1 entries (i.e., diag{1,-1,1,-1,⋯ }), respectively. Also, ·^T and ·^-1 are transpose and inverse of a matrix, respectively.

2 Review

2.1 Fast lapped transform (FLT)

An M-channel (M = 2^k, $k \in N$ ) FLT can be constructed in polyphase structure from components with well-known fast-computable algorithms. One of the most elegant solution is the type-II FLOT. The polyphase matrix E(z) is expressed as [11]

\begin{align} E (z) = [\begin{matrix} I & 0 \\ 0 & S_{I V} C_{I I I} \end{matrix}] W Λ (z) W [\begin{matrix} C_{I I} & 0 \\ 0 & C_{I V} \end{matrix}] W \tilde{I} J \end{align}

(1)

where

\begin{align} W = \frac{1}{\sqrt{2}} [\begin{matrix} I & I \\ I & - I \end{matrix}], \tilde{I} = [\begin{matrix} I & 0 \\ 0 & J \end{matrix}], Λ (z) = [\begin{matrix} I & 0 \\ 0 & z^{- 1} I \end{matrix}], \end{align}

z^-1 is a delay, and C _II, C _III, C _IV, and S _IV are DCT-II, DCT-III, DCT-IV, and type-IV discrete sine transform (DST-IV) matrices whose (m, n)-elements are presented by

\begin{align} {[C_{I I}]}_{m, n} = & \sqrt{\frac{2}{N}} c_{m} cos (\frac{m (n + 1 / 2) π}{N}) \\ {[C_{I I I}]}_{m, n} = & \sqrt{\frac{2}{N}} c_{n} cos (\frac{(m + 1 / 2) nπ}{N}) \\ {[C_{I V}]}_{m, n} = & \sqrt{\frac{2}{N}} cos (\frac{(m + 1 / 2) (n + 1 / 2) π}{N}) \\ {[S_{I V}]}_{m, n} = & \sqrt{\frac{2}{N}} sin (\frac{(m + 1 / 2) (n + 1 / 2) π}{N}) \end{align}

where $c_{i} = 1 / \sqrt{2}$ (i = 0) or 1 (i ≠ 0). Also, $C_{I I}^{- 1} = C_{I I}^{T} = C_{I I I}$ , $C_{I V}^{- 1} = C_{I V}^{T} = C_{I V}$ , and $S_{I V}^{- 1} = S_{I V}^{T} = S_{I V}$ . Since the following relationship between DST-IV and DCT-IV matrices can be established: S _IV = DC _IV J, Equation 1 can be easily represented by

\begin{align} E (z) = [\begin{matrix} I & 0 \\ 0 & D C_{I V} J C_{I I I} \end{matrix}] W Λ (z) W [\begin{matrix} C_{I I} & 0 \\ 0 & C_{I V} \end{matrix}] W \tilde{I} J . \end{align}

(2)

On the other hand, the HLT for JPEG XR is based on FLOT with scaling factors [16]. By inspiring it, the FLT in this paper is defined by

\begin{align} E (z) = [\begin{matrix} I & 0 \\ 0 & D C_{I V} J C_{I I I} \end{matrix}] W Λ (z) W [\begin{matrix} s_{0} C_{I I} & 0 \\ 0 & s_{1} C_{I V} \end{matrix}] W \tilde{I} J \end{align}

(3)

where $s_{1} = s_{0}^{- 1}$ which is the restriction for lifting factorization. This is called FLBT in this paper. Since FLOT in Equation 2 is understandably equal to FLBT in Equation 3 with s ₀ = s ₁ = 1, we use this equation (3) as a representative expression of FLT. The FLT with this polyphase matrix is implemented as shown in the top half in Figure 1.

2.2 Direct-lifting structure

In [35], we have presented direct-lifting which is a class of block-lifting [25] known as a more effective lifting structure for lossy-to-lossless image coding than standard lifting structure [17–19]. The block-lifting reduces rounding error by merging many rounding operations. The direct-lifting is a key technology to produce novel IntFLTs. To achieve the lifting, we suppose a processing of two individual M × 1 signals x _i and x _j by an M × M arbitrary nonsingular matrix T and its inverse transform matrix T^-1, respectively, as shown at the left side of Figure 2. The input signals x _i and x _j are simultaneously transformed to the output signals y _i and y _j by T and T^-1 as

\begin{align} [\begin{matrix} y_{i} \\ y_{j} \end{matrix}] = [\begin{matrix} T & 0 \\ 0 & T^{- 1} \end{matrix}] [\begin{matrix} x_{i} \\ x_{j} \end{matrix}] . \end{align}

This block diagonal matrix diag{T, T^-1} can be factorized into complete block-liftings such as

\begin{align} [\begin{matrix} T & 0 \\ 0 & T^{- 1} \end{matrix}] = [\begin{matrix} 0 & I \\ - I & 0 \end{matrix}] [\begin{matrix} I & 0 \\ T & I \end{matrix}] [\begin{matrix} I & - T^{- 1} \\ 0 & I \end{matrix}] [\begin{matrix} I & 0 \\ T & I \end{matrix}] . \end{align}

(4)

Thus, the parallel block system of T and T^-1 can be efficiently implemented by the block-liftings as shown at the right side of Figure 2. This is a breakthrough structure because any block T and its inverse one T^-1 can be directly applied to the block-lifting coefficients without breaking their forms. Although any existing software/hardware for DCT cannot be directly reused for the conventional IntDCTs, we can admit any of them as the lifting blocks when T = C _II.

3 IntFLTs based on direct-lifting of DCTs

This section presents a realization of IntFLT for lossy-to-lossless image coding. The IntFLTs have simple implementations with few operations and direct-lifting of DCTs.

3.1 Direct-lifting of DCTs

FLT in Equation 3 is transferred into the another type so that direct-lifting (4) can be applied. First, Equation 3 is rewritten as

\begin{align} E (z) = & [\begin{matrix} I & 0 \\ 0 & D C_{I V} J C_{I I I} \end{matrix}] W Λ (z) W [\begin{matrix} C_{I I} & 0 \\ 0 & C_{I I} \end{matrix}] \\ \times [\begin{matrix} s_{0} I & 0 \\ 0 & s_{1} C_{I I I} C_{I V} \end{matrix}] W \tilde{I} J \end{align}

(5)

where C _II C _III = I. By moving diag{C _II, C _II} to the postprocessing part, Equation 5 is rewritten as

\begin{align} E (z) & = [\begin{matrix} I & 0 \\ 0 & D C_{I V} J C_{I I I} \end{matrix}] [\begin{matrix} C_{I I} & 0 \\ 0 & C_{I I} \end{matrix}] W Λ (z) W \\ \times [\begin{matrix} s_{0} I & 0 \\ 0 & s_{1} C_{I I I} C_{I V} \end{matrix}] W \tilde{I} J \\ = [\begin{matrix} C_{I I} & 0 \\ 0 & D C_{I V} J \end{matrix}] W Λ (z) W [\begin{matrix} s_{0} I & 0 \\ 0 & s_{1} C_{I I I} C_{I V} \end{matrix}] W \tilde{I} J ≜ \tilde{E} (z) \end{align}

(6)

where C _III C _II = I and $\tilde{E} (z)$ are used to distinguish from the original E(z) in Equation 3. Of course, $\tilde{E} (z)$ is the same transfer function as E(z). The FLT with this polyphase matrix $\tilde{E} (z)$ is implemented as shown at the bottom half in Figure 1.

Next, as already mentioned, we consider the parallel process of two different type FLTs in Equations 3 and 6 as follows:

\begin{align} [\begin{matrix} y_{i} \\ y_{j} \end{matrix}] = [\begin{matrix} E (z) & 0 \\ 0 & \tilde{E} (z) \end{matrix}] [\begin{matrix} x_{i} \\ x_{j} \end{matrix}] \end{align}

where x _i and x _j are individual input signals along process direction, and y _i and y _j are their output signals as shown in Figure 1. It means that when a row (column) signals are processed by Equation 3, other row (column) signals are processed by Equation 6. However, each DCT matrix in both FLT is processed by direct-lifting of each combination of DCT-II/DCT-III, DCT-III/DCT-II, and DCT-IV/DCT-IV as shown in dashed line box in Figure 1. For example, the combination of DCT-II/DCT-III is factorized as

\begin{align} [\begin{matrix} C_{I I} & 0 \\ 0 & C_{I I I} \end{matrix}] & = [\begin{matrix} 0 & I \\ - I & 0 \end{matrix}] [\begin{matrix} I & 0 \\ C_{I I} & I \end{matrix}] [\begin{matrix} I & - C_{I I I} \\ 0 & I \end{matrix}] [\begin{matrix} I & 0 \\ C_{I I} & I \end{matrix}] \end{align}

by substituting it into Equation 4.

3.2 Lifting structure of rotation matrix with π/4 angle

Since W in Equations 3 and 6 and Figure 1 includes the scaling factor $1 / \sqrt{2}$ , we factorize this into lifting structure. In [14], W is simply factorized as

\begin{align} W = [\begin{matrix} I & w_{0} I \\ 0 & I \end{matrix}] [\begin{matrix} I & 0 \\ w_{1} I & I \end{matrix}] [\begin{matrix} I & w_{0} I \\ 0 & I \end{matrix}] [\begin{matrix} I & 0 \\ 0 & - I \end{matrix}] \end{align}

where $w_{0} = 1 - \sqrt{2}$ and $w_{1} = 1 / \sqrt{2}$ . But this factorization includes many floating-point multipliers. To eliminate as much multipliers as possible, the following scaled matrices are used in place of pure W.

\begin{align} [\begin{matrix} \frac{1}{\sqrt{2}} I & 0 \\ 0 & \sqrt{2} I \end{matrix}] W & = [\begin{matrix} I & 0 \\ 0 & - I \end{matrix}] [\begin{matrix} I & \frac{1}{2} I \\ 0 & I \end{matrix}] [\begin{matrix} I & 0 \\ - I & I \end{matrix}] ≜ W_{1} \\ W [\begin{matrix} \sqrt{2} I & 0 \\ 0 & \frac{1}{\sqrt{2}} I \end{matrix}] & = [\begin{matrix} I & 0 \\ I & I \end{matrix}] [\begin{matrix} I & - \frac{1}{2} I \\ 0 & I \end{matrix}] [\begin{matrix} I & 0 \\ 0 & - I \end{matrix}] ≜ W_{2} \\ [\begin{matrix} \sqrt{2} I & 0 \\ 0 & \frac{1}{\sqrt{2}} I \end{matrix}] W & = [\begin{matrix} I & 0 \\ \frac{1}{2} I & I \end{matrix}] [\begin{matrix} I & - I \\ 0 & I \end{matrix}] [\begin{matrix} I & 0 \\ 0 & - I \end{matrix}] ≜ W_{3} \\ W [\begin{matrix} \frac{1}{\sqrt{2}} I & 0 \\ 0 & \sqrt{2} I \end{matrix}] & = [\begin{matrix} I & 0 \\ 0 & - I \end{matrix}] [\begin{matrix} I & I \\ 0 & I \end{matrix}] [\begin{matrix} I & 0 \\ - \frac{1}{2} I & I \end{matrix}] ≜ W_{4} \end{align}

Note that a lifting structure with coefficient 1/2 and rounding operation can be replaced by one adder and one bit-shifter [37], i.e., multiplierless operations. With these matrices, Equations 3 and (6) are represented as follows:

\begin{align} [\begin{matrix} \frac{1}{\sqrt{2}} I & 0 \\ 0 & \sqrt{2} I \end{matrix}] E (z) = & [\begin{matrix} I & 0 \\ 0 & D C_{I V} J C_{I I I} \end{matrix}] W_{1} Λ (z) W_{2} \\ \times [\begin{matrix} s_{0} C_{I I} & 0 \\ 0 & s_{1} C_{I V} \end{matrix}] W_{1} \tilde{I} J \end{align}

(7)

\begin{align} [\begin{matrix} \sqrt{2} I & 0 \\ 0 & \frac{1}{\sqrt{2}} I \end{matrix}] \tilde{E} (z) = & [\begin{matrix} C_{I I} & 0 \\ 0 & D C_{I V} J \end{matrix}] W_{3} Λ (z) W_{2} \\ \times [\begin{matrix} s_{0} I & 0 \\ 0 & s_{1} C_{I I I} C_{I V} \end{matrix}] W_{1} \tilde{I} J \end{align}

(8)

Figure 3 shows the process that an image is two-dimensionally transformed by the proposed IntFLT. The i th (1 ≤ ((i + 1) mod M) ≤ M / 2) row signals and the j th ((M / 2 + 1) ≤ ((j + 1) mod M) ≤ M) row signals, i.e., the yellow and green areas in Figure 3, are processed by FLTs in Equations 7 and 8, respectively. Here, note that the one-dimensionally transformed output signals are scaled by $1 / \sqrt{2}$ and $\sqrt{2}$ as compared with the output signals transformed by normal FLTs as shown in the dashed line box in Figure 3. By considering these scales $1 / \sqrt{2}$ and $\sqrt{2}$ for the next column process, Equations 3 and 6 are represented again as follows:

\begin{align} E (z) [\begin{matrix} \sqrt{2} I & 0 \\ 0 & \frac{1}{\sqrt{2}} I \end{matrix}] = & [\begin{matrix} I & 0 \\ 0 & D C_{I V} J C_{I I I} \end{matrix}] W_{2} Λ (z) W_{1} \\ \times [\begin{matrix} s_{0} C_{I I} & 0 \\ 0 & s_{1} C_{I V} \end{matrix}] W_{2} \tilde{I} J \end{align}

(9)

\begin{align} \tilde{E} (z) [\begin{matrix} \frac{1}{\sqrt{2}} I & 0 \\ 0 & \sqrt{2} I \end{matrix}] = & [\begin{matrix} C_{I I} & 0 \\ 0 & D C_{I V} J \end{matrix}] W_{4} Λ (z) W_{3} \\ \times [\begin{matrix} s_{0} I & 0 \\ 0 & s_{1} C_{I I I} C_{I V} \end{matrix}] W_{4} \tilde{I} J . \end{align}

(10)

Similarly, the i th column signals and the j th column signals, i.e., the red and blue areas in Figure 3, are processed by FLTs in Equations 9 and 10, respectively. Consequently, the scales are changed temporarily for fast implementation and restored after two-dimensional transform.

3.3 Lifting structure of scaling part

In this subsection, we present lifting structures of each scaling part diag{s ₀ I, s ₁ I} including in Equations 7 to 10. According to Equation 4, we define a simple realization of integer transform in the scaling part as follows:

\begin{align} [\begin{array}{l} s_{0} I & 0 \\ 0 & s_{1} I \end{array}] = [\begin{array}{l} 0 & I \\ - I & 0 \end{array}] [\begin{array}{l} I & 0 \\ s_{0} I & I \end{array}] [\begin{array}{l} I & - s_{1} I \\ 0 & I \end{array}] [\begin{array}{l} I & 0 \\ s_{0} I & I \end{array}] \end{align}

where $s_{1} = s_{0}^{- 1}$ . The lifting coefficients s ₀ and s ₁ in the scaling part are empirically determined.

4 Results

4.1 Coding gain

This paper designed 8 × 16 and 16 × 32 IntFLTs. First, the comparison of coding gain of the ideal FLTs and the proposed IntFLTs is shown.

The coding gain is one of the most important factors to be considered in compression applications. A transform with higher coding gain compacts more energy into a fewer number of coefficients. As a result, higher objective performance such as PSNR would be achieved after quantization. The biorthogonal coding gain is defined as [38]

\begin{align} Coding gain [dB] = 10 {log}_{10} \frac{σ_{x}^{2}}{\prod_{k = 0}^{M - 1} σ_{x_{k}}^{2} ∥ f_{k} ∥^{2}} \end{align}

where $σ_{x}^{2}$ is the variance of the input signal, $σ_{x_{k}}^{2}$ is the variance of the k th subbands and ∥ f _k ∥² is the norm of the k th synthesis filter. Although the coding gain does not completely dominate all image coding results due to rounding error, it is clear that all of coding gain are not lost as shown in Table 1.

Table 1 Comparisons of coding gain of the ideal FLTs and the proposed IntFLTs (dB)

Full size table

For comparison, the coding gain of LiftLT [12] is 9.5378 (dB) which is higher than the proposed 8 × 16 IntFLTs because this is optimized for lossy coding.

4.2 Lossy-to-lossless image coding

Lossy-to-lossless image coding results by the designed IntFLTs are shown in this subsection. As targets for comparison, LiftLT [12], 5/3-DWT and 9/7-DWT for JPEG 2000 [14], HLT for JPEG XR [16], and the conventional 8 × 16 and 16 × 32 IntFLTs were applied. The conventional 8 × 16 and 16 × 32 IntFLTs are based on simple three-step lifting factorizations of rotation matrices and scaling factors [14]. The periodic extension was used for image boundaries except for DWTs and HLT. To evaluate transform performance fairly, a very common wavelet-based zerotree coder SPIHT [39] was adopted for all^c. Moreover, we used 8-bit gray scale test images with 512 × 512 size such as Barbara.

First, the proposed IntFLTs and the conventional methods are applied to lossless image coding. The comparison of lossless bit rate (LBR)

\begin{align} LBR & (bpp) = \frac{Total number of bits [bit]}{Total number of pixels [pixel]} \end{align}

is shown in Table 2.

Table 2 Comparison of lossless image coding (LBR (bpp))

Full size table

If lossy compressed data is required, it can be achieved by interrupting the obtained lossless bitstream. The comparison of peak signal-to-noise ratio (PSNR)

PSNR (dB) = 10 {log}_{10} (\frac{25 5^{2}}{MSE})

where MSE is the mean squared error, is shown in Table 3.

Table 3 Comparison of lossy image coding (PSNR (dB))

Full size table

Even though the proposed and conventional IntFLTs have same transfer function, the proposed IntFLTs perform better coding than the conventional IntFLTs, especially lossy image coding results show excellent performance. We consider that this is mainly due to the reduction of rounding operations as shown in Table 4 and no large lifting coefficients^d. Moreover, note that the proposed IntFLTs have a more effective implementation than the conventional IntFLTs due to the construction with few operations and direct-lifting of DCTs. The direct-lifting can reuse any existing software/hardware for DCTs. On the other hand, LiftLT and 9/7-DWT perform often good lossy image coding because they were designed for the lossy mode. However, it cannot preserve the high frequency components in the images as shown in Figure 4, whereas the proposed IntFLTs, especially the proposed 16 × 32 IntFLT, can preserve them.

Table 4 Comparison of number of rounding operations in each one-dimensional transform of M × 1 signals

Full size table

5 Conclusions

This paper presented integer fast lapped transforms (IntFLTs) for effective lossy-to-lossless image coding, which were constructed by few operations and direct-lifting of discrete cosine transforms (DCTs). Due to merging, many rounding operations and keeping small lifting coefficients by use of direct-lifting, the proposed IntFLTs performed better coding than the conventional IntFLTs in lossy-to-lossless image coding. Also, the proposed IntFLTs can preserve the high frequency components in the images. Since the direct-lifting can reuse any existing software/hardware for DCTs, the proposed IntFLTs have a great potential for fast implementation which is dependent on the architecture design and DCT algorithms. Furthermore, the proposed IntFLTs do not need any side information unlike IntDCT based on direct-lifting as our previous work.

Endnotes

^a “The conventional IntFLTs” do not include LiftLT in this paper.

^b Any other lifting-based DCTs cannot reuse all existing software/hardware for DCTs.

^c The block transform coefficients through 2^k-channel ( $k \in N$ ) FLTs are rearranged to a k-level wavelet-like multi-resolution representation, and they are applied to the zerotree coder [40], e.g., 3-level wavelet-like multi-resolution representation when M = 8.

^d The IntFLT referred by [20] has less rounding operations. However, it performs undesirable coding due to large lifting coefficients. For example, although the 8 × 16 FLOT has only five rounding operations in each 4 × 4 DCT, its application of lossless image coding shows 5.06 (bpp) for Barbara.

References

Wallace GK: The JPEG still picture compression standard. IEEE Trans. Consum. Electr 1992, 38: 18-34.
Article Google Scholar
Pennebaker W, Mitchell J: JPEG, Still Image Data Compression Standard. New York: Van Nostrand; 1993.
Google Scholar
Wiegand T, Sullivan GJ, Bjntegaard G, Luthra A: Overview of the H.264/AVC video coding standard. IEEE Trans. Circuits Syst. Video Technol 2003, 13(7):560-576.
Article Google Scholar
Sullivan GJ, Ohm JR, Han WJ, Wiegand T: Overview of the high efficiency video coding (HEVC) standard. IEEE Trans. Circuits Syst. Video Technol 2012, 22(12):1649-1668.
Article Google Scholar
Rao KR, Yip P: Discrete Cosine Transform Algorithms. Boston: Academic Press; 1990.
Google Scholar
Jain AK: A fast Karhunen-Loeve transform for a class of random processes. IEEE Trans. Commun 1976, 24(9):1023-1029. 10.1109/TCOM.1976.1093409
Article Google Scholar
Chen WH, Smith CH, Fralick SC: A fast computational algorithm for the discrete cosine transform. IEEE Trans. Commun 1977, 25(9):1004-1009. 10.1109/TCOM.1977.1093941
Article Google Scholar
Wang Z: Fast algorithms for the discrete W transform and for the discrete Fourier transform. IEEE Trans. Acoust. Speech Signal Process 1984, ASSP-32(4):803-816.
Article Google Scholar
Lee BG: A new algorithm to compute the discrete cosine transform. IEEE Trans. Acoust. Speech Signal, Process 1984, 32(6):1243-1245. 10.1109/TASSP.1984.1164443
Article Google Scholar
Wang Z: On computing the discrete Fourier and cosine transforms. IEEE Trans. Acoust. Speech Signal, Process 1985, 33(4):1341-1344.
Article MathSciNet Google Scholar
Malvar HS: Signal Processing with Lapped Transforms. Norwood: Artech House; 1992.
Google Scholar
Tran TD: The LiftLT: Fast lapped transforms via lifting steps. IEEE Signal Process. Lett 2000, 7(6):145-148.
Article Google Scholar
Skodras A, Christopoulis C, Ebrahimi T: The JPEG2000 still image compression standard. IEEE Signal Process. Mag 2001, 18(5):36-58. 10.1109/79.952804
Article Google Scholar
Daubechies I, Sweldens W: Factoring wavelet transforms into lifting steps. J. Fourier Anal. Appl 1998, 4(3):245-267.
Article MathSciNet Google Scholar
Dufaux F, Sullivan GJ, Ebrahimi T: The JPEG XR image coding standard. IEEE Signal Process. Mag 2009, 26(6):195-199.
Google Scholar
Tu C, Srinivasan S, Sullivan GJ, Regunathan S, Malvar HS: Low-complexity hierarchical lapped transform for lossy-to-lossless image coding in JPEG XR/HD photo. In Proceedings of SPIE Application of Digital Image Processing XXXI. San Diego; 12 Aug 2008:70730-70730.
Chapter Google Scholar
Sweldens W: The Lifting Scheme: A New Philosophy in Biorthogonal Wavelet Constructions. In Proc. of SPIE Wavelet Applications in Signal and Image Processing III. San Diego; 1 Sept 1995:68-79.
Chapter Google Scholar
Sweldens W: The lifting scheme: a custom-design construction of biorthogonal wavelets. Elsevier Appl. Comput. Harmon. Anal 1996, 3(2):186-200. 10.1006/acha.1996.0015
Article MathSciNet Google Scholar
Sweldens W: The lifting scheme: a construction of second generation wavelets. SIAM J. Math. Anal 1997, 29(2):511-546.
Article MathSciNet Google Scholar
Hao P, Shi Q: Matrix factorizations for reversible integer mapping. IEEE Trans. Signal Process 2001, 49(10):2314-2324. 10.1109/78.950787
Article MathSciNet Google Scholar
Chen YJ, Amaratunga KS: M -Channel lifting factorization of perfect reconstruction filter banks and reversible M -band wavelet transforms. IEEE Trans. Circuits Syst. II 2003, 50(12):963-976. 10.1109/TCSII.2003.820233
Article Google Scholar
Chen YJ, Oraintara S, Amaratunga KS: Dyadic-based factorizations for regular paraunitary filterbanks and M -band orthogonal wavelets with structural vanishing moments. IEEE Trans. Signal Process 2005, 53: 193-207.
Article MathSciNet Google Scholar
Suzuki T, Tanaka Y, Ikehara M: Lifting-based paraunitary filterbanks for lossy/lossless image coding. In Proc. of EURASIP EUSIPCO’06. Florence; 4–8 Sept 2006:1-5.
Google Scholar
She Y, Hao P, Paker Y: Matrix factorizations for parallel integer transformation. IEEE Trans. Signal Process 2006, 54(12):4675-4684.
Article Google Scholar
Iwamura S, Tanaka Y, Ikehara M: An efficient lifting structure of biorthogonal filter banks for lossless image coding. In Proc. of IEEE ICIP’07. San Antonio; 16 Sept–19 Sept 2007:433-436.
Google Scholar
Suzuki T, Ikehara M: Lifting Factorization based on Block Parallel System of M -Channel Perfect Reconstruction Filter Banks. In Proc. of EURASIP EUSIPCO’09. Glasgow; 24–28 Aug 2009:1-4.
Google Scholar
Suzuki T, Ikehara M: M -channel paraunitary filter banks based on direct lifting structure of building block and its inverse transform for lossless-to-lossy image coding. IEICE Trans. Fundamentals 2010, E93-A(8):1457-1464. 10.1587/transfun.E93.A.1457
Article Google Scholar
Suzuki T, Ikehara M, Nguyen TQ: Generalized block-lifting factorization of M -channel biorthogonal filter banks for lossy-to-lossless image coding. IEEE Trans. Image Process 2012, 21(7):3220-3228.
Article MathSciNet Google Scholar
Komatsu K, Sezaki K: Reversible discrete cosine transform. In Proc. of IEEE ICASSP’98. Seattle; 12–15 May 1998:1769-1772.
Google Scholar
Chen YJ, Oraintara S, Nguyen TQ: Integer discrete cosine transform (IntDCT). In Proc. of IEEE 2nd ICICS’99. Singapore; 7–10 Dec 1999:1-5.
Google Scholar
Tran TD: The BinDCT: fast multiplierless approximation of the DCT. IEEE Signal Process. Lett 2000, 7(6):141-144.
Article Google Scholar
Liang J, Tran TD: Fast multiplierless approximations of the DCT with the lifting scheme. IEEE Trans. Signal Process 2001, 49(12):3032-3044. 10.1109/78.969511
Article Google Scholar
Chokchaitam S, Iwahashi M, Jitapunkul S: A new unified lossless/lossy image compression based on a new Integer DCT. IEICE Trans. Inf Syst 2005, E88-D(2):403-413.
Google Scholar
Suzuki T, Ikehara M: Design of block lifting-based discrete cosine transform type-II and IV. In Proc. of IEEE DSPWS’09. Marco Island; 4–7 Jan 2009:480-484.
Google Scholar
Suzuki T, Ikehara M: Integer DCT based on direct-lifting of DCT-IDCT for lossless-to-lossy image coding. IEEE Trans. Image Process 2010, 19(11):2958-2965.
Article MathSciNet Google Scholar
Suzuki T, Ikehara M: Integer fast lapped orthogonal transform based on direct-lifting of DCTs for lossless-to-lossy image coding. In Proc. of IEEE ICASSP’11. Prague; 22–27 May 2011:1525-1528.
Google Scholar
Suzuki T, Kyochi S, Tanaka Y, Ikehara M, Aso H: Multiplierless lifting based FFT via fast Hartley transform. In Proc. of IEEE ICASSP’13. Vancouver; 26–31 May 2013:5603-5607.
Google Scholar
Strang G, Nguyen T: Wavelets and Filter Banks. Wellesley-Cambridge Press; 1996.
Google Scholar
Said A, Pearlman WA: A new, fast, and efficient image codec based on set partitioning in hierarchical trees. IEEE Trans. Circuits Syst. Video Technol 1996, 6(3):243-250. 10.1109/76.499834
Article Google Scholar
Tran TD, Nguyen TQ: A progressive transmission image coder using linear phase uniform filterbanks as block transforms. IEEE Trans. Image Process 1999, 8(11):1493-1507. 10.1109/83.799878
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank the anonymous reviewers for providing many constructive suggestions that significantly improve the presentation of this paper. This work was supported by JSPS Grant-in-Aid for Young Scientists (B) grant number 25820152.

Author information

Authors and Affiliations

Faculty of Engineering, Information and Systems, University of Tsukuba, Tsukuba, 305-8573, Japan
Taizo Suzuki
Department of Electronics and Electrical Engineering, Keio University, Yokohama, 223-8522, Japan
Masaaki Ikehara

Authors

Taizo Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Masaaki Ikehara
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Taizo Suzuki.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Suzuki, T., Ikehara, M. Integer fast lapped transforms based on direct-lifting of DCTs for lossy-to-lossless image coding. J Image Video Proc 2013, 65 (2013). https://doi.org/10.1186/1687-5281-2013-65

Download citation

Received: 24 July 2012
Accepted: 03 December 2013
Published: 27 December 2013
DOI: https://doi.org/10.1186/1687-5281-2013-65

Integer fast lapped transforms based on direct-lifting of DCTs for lossy-to-lossless image coding

Abstract

1 Introduction

1.1 Notations

2 Review

2.1 Fast lapped transform (FLT)

2.2 Direct-lifting structure

3 IntFLTs based on direct-lifting of DCTs

3.1 Direct-lifting of DCTs

3.2 Lifting structure of rotation matrix with π/4 angle

3.3 Lifting structure of scaling part

4 Results

4.1 Coding gain

4.2 Lossy-to-lossless image coding

5 Conclusions

Endnotes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Rights and permissions

About this article

Cite this article

Share this article

Keywords