Video Coding Using 3-D Dual-Tree Discrete Wavelet Transforms
| Citation: |
* Wang, B.; Wang, Y.; Selesnick, I., Vetro, A., "Video Coding Using 3-D Dual-Tree Discrete Wavelet Transform", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), ISSN: 1520-6149, Vol. 2, pp. 61-64, March 2005 (IEEE Xplore) |
| MERL Report: | TR2005-019 |
This paper explores the use of a recently introduced 3-D dual-tree discrete wavelet transform (DDWT) for video coding. The 3-D DDWT is an attractive video representation because it siolates motion along different directions in separate subbands. However, it is an overcomplete transform with 8:1 or 4:1 redundancy. Based on the effectiveness of the iterative projection-based noise shaping scheme proposed by Kingsbury on reducing the number of coefficients, and our prior invetigation about the correlation between subbands at the same spatial/temporal location, both in the significance map and in actual coefficient values, a new video coding scheme using 3D DDWT is proposed. The proposed video codec does not require motion compensation and provides better performance than the 3D SPIHT codec, both objectively and subjectively, despite the fact that the raw number of coefficients resulting from the 3-D DDWT is much more than that of the conventional 3-D DWT. The proposed coder allows full scalability in spatial, temporal and quality dimensions.