TR2005-019

Video Coding Using 3-D Dual-Tree Discrete Wavelet Transforms


    •  Wang, B., Wang, Y., Selesnick, I., Vetro, A., "Video Coding Using 3-D Dual-Tree Discrete Wavelet Transforms", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), March 2005, vol. 2, pp. 61-64.
      BibTeX TR2005-019 PDF
      • @inproceedings{Wang2005mar,
      • author = {Wang, B. and Wang, Y. and Selesnick, I. and Vetro, A.},
      • title = {Video Coding Using 3-D Dual-Tree Discrete Wavelet Transforms},
      • booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
      • year = 2005,
      • volume = 2,
      • pages = {61--64},
      • month = mar,
      • issn = {1520-6149},
      • url = {https://www.merl.com/publications/TR2005-019}
      • }
  • MERL Contact:
  • Research Area:

    Digital Video

Abstract:

This paper explores the use of a recently introduced 3-D dual-tree discrete wavelet transform (DDWT) for video coding. The 3-D DDWT is an attractive video representation because it siolates motion along different directions in separate subbands. However, it is an overcomplete transform with 8:1 or 4:1 redundancy. Based on the effectiveness of the iterative projection-based noise shaping scheme proposed by Kingsbury on reducing the number of coefficients, and our prior invetigation about the correlation between subbands at the same spatial/temporal location, both in the significance map and in actual coefficient values, a new video coding scheme using 3D DDWT is proposed. The proposed video codec does not require motion compensation and provides better performance than the 3D SPIHT codec, both objectively and subjectively, despite the fact that the raw number of coefficients resulting from the 3-D DDWT is much more than that of the conventional 3-D DWT. The proposed coder allows full scalability in spatial, temporal and quality dimensions.

 

  • Related News & Events

    •  NEWS    ICASSP 2005: 4 publications by Anthony Vetro, Ajay Divakaran, Huifang Sun and others
      Date: March 18, 2005
      Where: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
      MERL Contacts: Anthony Vetro; Huifang Sun
      Brief
      • The papers "Fast Adaptive Fuzzy Post-Filtering for Coding Artifacts Removal in Interlaced Video" by Nie, Y., Kong, H.-S., Vetro, A. and Barner, K., "Video Coding Using 3-D Dual-Tree Discrete Wavelet Transform" by Wang, B., Wang, Y., Selesnick, I. and Vetro, A., "A Companding Front End for Noise-Robust Automatic Speech Recognition" by Guinness, J., Raj, B., Schmidt-Nielsen, B., Turicchia, L. and Sarpeshkar, R. and "Layered Dynamic Mixture Model for Pattern Discovery in Asynchronous Multi-Modal Streams" by Xie, L., Kennedy, L., Chang, S.-F., Divakaran, A., Sun, H. and Lin, C.-Y. were presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP).
    •