Video Coding Using 3D Dual-Tree Wavelet Transform

Wang, Beibei; Wang, Yao; Selesnick, Ivan; Vetro, Anthony

doi:10.1155/2007/42761

Research Article
Open access
Published: 13 March 2007

Video Coding Using 3D Dual-Tree Wavelet Transform

Beibei Wang¹,
Yao Wang¹,
Ivan Selesnick¹ &
…
Anthony Vetro²

EURASIP Journal on Image and Video Processing volume 2007, Article number: 042761 (2007) Cite this article

1931 Accesses
13 Citations
Metrics details

Abstract

This work investigates the use of the 3D dual-tree discrete wavelet transform (DDWT) for video coding. The 3D DDWT is an attractive video representation because it isolates image patterns with different spatial orientations and motion directions and speeds in separate subbands. However, it is an overcomplete transform with 4: 1 redundancy when only real parts are used. We apply the noise-shaping algorithm proposed by Kingsbury to reduce the number of coefficients. To code the remaining significant coefficients, we propose two video codecs. The first one applies separate 3D set partitioning in hierarchical trees (SPIHT) on each subset of the DDWT coefficients (each forming a standard isotropic tree). The second codec exploits the correlation between redundant subbands, and codes the subbands jointly. Both codecs do not require motion compensation and provide better performance than the 3D SPIHT codec using the standard DWT, both objectively and subjectively. Furthermore, both codecs provide full scalability in spatial, temporal, and quality dimensions. Besides the standard isotropic decomposition, we propose an anisotropic DDWT, which extends the superiority of the normal DDWT with more directional subbands without adding to the redundancy. This anisotropic structure requires significantly fewer coefficients to represent a video after noise shaping. Finally, we also explore the benefits of combining the 3D DDWT with the standard DWT to capture a wider set of orientations.

[1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22]

References

Hsiang S-T, Woods JW: Embedded video coding using invertible motion compensated 3-D subband/wavelet filter bank. Signal Processing: Image Communication 2001,16(8):705-724. 10.1016/S0923-5965(01)00002-9
Google Scholar
Xu J, Xiong Z, Li S, Zhang Y-Q: Memory-constrained 3-D wavelet transform for video coding without boundary effects. IEEE Transactions on Circuits and Systems for Video Technology 2002,12(9):812-818. 10.1109/TCSVT.2002.803231
Article Google Scholar
Andreopoulos Y, van der Schaar M, Munteanu A, Barbarien J, Schelkens P, Cornelis J: Fully-scalable wavelet video coding using in-band motion compensated temporal filtering. Proceedings of IEEE International Conference on Accoustics, Speech, and Signal Processing (ICASSP '03), April 2003, Hong Kong 3: 417-420.
Google Scholar
Secker A, Taubman D: Lifting-based invertible motion adaptive transform (LIMAT) framework for highly scalable video compression. IEEE Transactions on Image Processing 2003,12(12):1530-1542. 10.1109/TIP.2003.819433
Article Google Scholar
Joint Scalable Video Model 2.0 Reference Encoding Algorithm Description ISO/IEC JTC1/SC29/WG11/N7084. Buzan, Korea, April 2005
Kingsbury N: A dual-tree complex wavelet transform with improved orthogonality and symmetry properties. Proceedings of IEEE International Conference on Image Processing (ICIP '00), September 2000, Vancouver, BC, Canada 2: 375-378.
Google Scholar
Do MN, Vetterli M: The contourlet transform: an efficient directional multiresolution image representation. IEEE Transactions on Image Processing 2005,14(12):2091-2106.
Article MathSciNet Google Scholar
Reeves TH, Kingsbury NG: Overcomplete image coding using iterative projection-based noise shaping. Proceedings of IEEE International Conference on Image Processing (ICIP '02), September 2002, Rochester, NY, USA 3: 597-600.
Article Google Scholar
Sivaramakrishnan K, Nguyen T: A uniform transform domain video codec based on dual tree complex wavelet transform. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '01) , May 2001, Salt Lake City, Utah, USA 3: 1821-1824.
Google Scholar
Selesnick I, Li KY: Video denoising using 2D and 3D dual-tree complex wavelet transforms. Wavelets: Applications in Signal and Image Processing X, August 2003, San Diego, Calif, USA, Proceedings of SPIE 5207: 607-618.
Google Scholar
Selesnick I, Baraniuk RG, Kingsbury NC: The dual-tree complex wavelet transform. IEEE Signal Processing Magazine 2005,22(6):123-151.
Article Google Scholar
Wang B, Wang Y, Selesnick I, Vetro A: An investigation of 3D dual-tree wavelet transform for video coding. Proceedings of International Conference on Image Processing (ICIP '04), October 2004, Singapore 2: 1317-1320.
Google Scholar
Kim B-J, Xiong Z, Pearlman WA: Low bit-rate scalable video coding with 3-D set partitioning in hierarchical trees (3-D SPIHT). IEEE Transactions on Circuits and Systems for Video Technology 2000,10(8):1374-1387. 10.1109/76.889025
Article Google Scholar
Wang B, Wang Y, Selesnick I, Vetro A: Video coding using 3-D dual-tree discrete wavelet transforms. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '05), March 2005, Philadelphia, Pa, USA 2: 61-64.
Google Scholar
Xu D, Do MN: Anisotropic 2D wavelet packets and rectangular tiling: theory and algorithms. Wavelets: Applications in Signal and Image Processing X, August 2003, San Diego, Calif, USA, Proceedings of SPIE 5207: 619-630.
Google Scholar
Xu D, Do MN: On the number of rectangular tilings. IEEE Transactions on Image Processing 2006,15(10):3225-3230.
Article Google Scholar
Mallat SG, Zhang Z: Matching pursuits with time-frequency dictionaries. IEEE Transactions on Signal Processing 1993,41(12):3397-3415. 10.1109/78.258082
Article MATH Google Scholar
Gribonval R, Vandergheynst P: On the exponential convergence of matching pursuits in quasi-incoherent dictionaries. IEEE Transactions on Information Theory 2006,52(1):255-261.
Article MathSciNet MATH Google Scholar
Neff R, Zakhor A: Very low bit-rate video coding based on matching pursuits. IEEE Transactions on Circuits and Systems for Video Technology 1997,7(1):158-171. 10.1109/76.554427
Article Google Scholar
Bolcskei H, Hlawatsch F: Oversampled filter banks: optimal noise shaping, design freedom, and noise analysis. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '97), April 1997, Munich, Germany 3: 2453-2456.
Google Scholar
Hua J, Xiong Z, Wu X: High-performance 3-D embedded wavelet video (EWV) coding. Proceedings of 4th IEEE Workshop on Multimedia Signal Processing (MMSP '01), October 2001, Cannes, France 569-574.
Google Scholar
Boettcher JB, Fowler JE: Video coding using a complex wavelet transform and set partitioning. to appear in IEEE Signal Processing Letters, September 2007

Download references

Author information

Authors and Affiliations

Electrical and Computer Engineering Department, Polytechnic University, Brooklyn, NY, 11201, USA
Beibei Wang, Yao Wang & Ivan Selesnick
Mitsubishi Electric Research Laboratories, Cambridge, MA, 02139, USA
Anthony Vetro

Authors

Beibei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ivan Selesnick
View author publications
You can also search for this author in PubMed Google Scholar
Anthony Vetro
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Beibei Wang.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Wang, B., Wang, Y., Selesnick, I. et al. Video Coding Using 3D Dual-Tree Wavelet Transform. J Image Video Proc 2007, 042761 (2007). https://doi.org/10.1155/2007/42761

Download citation

Received: 14 August 2006
Revised: 14 December 2006
Accepted: 05 January 2007
Published: 13 March 2007
DOI: https://doi.org/10.1155/2007/42761

Video Coding Using 3D Dual-Tree Wavelet Transform

Abstract

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords