TR2007-086

Efficient MPEG-2 to H.264/AVC Transcoding of Intra-Coded Video


    •  Xin, J.; Vetro, A.; Sun, H.; Su, Y., "Efficient MPEG-2 to H.264/AVC Transcoding of Intra-Coded Video", EURASIP Journal on Applied Signal Processing, ISSN: 1110-8657, Vol. 2007, No. 1, pp. 217, January 2007.
      BibTeX Download PDF
      • @article{Xin2007jan,
      • author = {Xin, J. and Vetro, A. and Sun, H. and Su, Y.},
      • title = {Efficient MPEG-2 to H.264/AVC Transcoding of Intra-Coded Video},
      • journal = {EURASIP Journal on Applied Signal Processing},
      • year = 2007,
      • volume = 2007,
      • number = 1,
      • pages = 217,
      • month = jan,
      • issn = {1110-8657},
      • url = {http://www.merl.com/publications/TR2007-086}
      • }
  • MERL Contacts:
  • Research Areas:

    Digital Video, Multimedia


This paper presents an efficient transform-domain architecture and corresponding mode decision algorithms for transcoding intra-coded video from MPEG-2 to H.264/ACV. Low complexity is achieved in several ways. First, our architecture employs direct conversion of the transform coefficients, which eliminates the need for the inverse discrete cosine transform (DCT) and forward H.264/AVC transform. Then, within this transform-domain architecture, we perform macroblock-based mode decisions based on H.264/AVC transform coefficients, which is possible using a novel method of calculating distortion in the transform domain. The proposed method for distortion calculation could be used to make rate-distortion optimized mode decisions with lower complexity. Compared to the pixel-domain architecture with rate-distortion optimized mode decision, simulation results show that there is a negligible loss in quality incurred by the direct conversion of transform coefficients and the proposed transform-domain mode decision algorithms, while complexity is significantly reduced. To further reduce the complexity, we also propose two fast mode decision algorithms. The first algorithm ranks modes based on a simple cost function in the transform domain, then computes the rate-distortion optimal mode from a reduced set of ranked modes. the second algorithm exploits temporal correlations in the mode decision between temporally adjacent frames. Simulation results show that these algorithms provide additional computational savings over the proposed transform-domain architecture while maintaining virtually the same coding efficiency.