Abstract
A fast mode decision algorithm is proposed in this paper to accelerate the process of transcoding videos into H.264 with arbitrary rate spatial resolution down-scaling. The proposed algorithm consists of three steps. First, an early-stop technique is introduced to determine the 16×16-mode blocks, which take up about 70% of all the macroblocks; then, a bottom-up merging process is performed to determine the mode of rest non-early-stopped blocks; and then, we adopt half-pixel motion estimation to further refine the acquired predictive motion vectors. In order to obtain the predictive motion vectors for early-stop and merging processes, we propose a motion vector composition scheme, which can reuse the information in the input pre-encoded videos to handle the spatial resolution down-scaling. Experimental results showed that our algorithm is about four times faster than the Cascaded-Decoder-Encoder method and has negligible PSNR drop and little bit rate increase.
Similar content being viewed by others
References
Ahmad, I., Wei, X.H., Sun, Y., Zhang, Y.Q., 2005. Video transcoding: An overview of various techniques and research issues. IEEE Trans. on Multimedia, 7(5):793–804. [doi:10.1109/TMM.2005.854472]
Chang, A., Wong, P.H.W., Yeung, Y.M., Au, O.C., 2004. Fast multi-block selection for H.264 video coding. Inter. Sym. Circuits and Systems, 3:817–820.
Kucukgoz, M., Sun, M.T., 2004. Early-stop and Motion Vector Re-using for MPEG-2 to H.264 Transcoding. Inter. Conf. SPIE-IS&T Electronic Imaging.
Shanableh, T., Ghanbari, M., 2000. Heterogeneous video transcoding to lower spatial-temporal resolutions and different encoding formats. IEEE Trans. on Multimedia, 2(2):101–110. [doi:10.1109/6046.845014]
Takahashi, K., Satoch, K., Suzuki, T., Yagasaki, Y., 2001. Motion Vector Synthesis Algorithm for MPEG2-to-MPEG4 Transcoder. Visual Communications and Image Processing. San Jose, CA, p.872–882.
Tu, Y.K., Yang, J.F., Shen, Y.N., Sun, M.T., 2003. Fast variable size block motion estimation using merging procedure with an adaptive threshold. Inter. Conf. Multimedia and Expo., 2:789–792.
**n, J., Sun, M.T., Choi, B.S., Chun, K.W., 2002. An HDTV-to-SDTV spatial transcoder. IEEE Trans. on Circuits and Systems for Video Technology, 12(11):998–1008. [doi:10.1109/TCSVT.2002.805508]
Zhou, Z., Sun, M.T., Hsu, S., 2004. Fast variable block-size motion estimation algorithms based on merge and split procedures for H.264/MPEG-4 AVC. Inter. Sym. Circuits and Systems, 3:725–728.
Author information
Authors and Affiliations
Additional information
Project supported by the National Natural Science Foundation of China (No. 60573176), the Key Technologies R & D Program of Zhejiang Province (Nos. 2005C23047 and 2004C11052), China
Rights and permissions
About this article
Cite this article
Bu, Jj., Mo, Lj., Chen, C. et al. Fast mode decision algorithm for spatial resolutions down-scaling transcoding to H.264. J. Zhejiang Univ. - Sci. A 7 (Suppl 1), 70–75 (2006). https://doi.org/10.1631/jzus.2006.AS0070
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1631/jzus.2006.AS0070