| 研究生: |
李育銘 Yu-Ming Lee |
|---|---|
| 論文名稱: |
低計算複雜度之H.264/AVC視訊編碼 Low Computational Complexity H.264/AVC Video Coding |
| 指導教授: |
林銀議
Yinyi Lin |
| 口試委員: | |
| 學位類別: |
博士 Doctor |
| 系所名稱: |
資訊電機學院 - 通訊工程學系 Department of Communication Engineering |
| 畢業學年度: | 100 |
| 語文別: | 英文 |
| 論文頁數: | 125 |
| 中文關鍵詞: | H.264/AVC視訊編碼壓縮 、模式決策演算法 、零區塊模式決策 、零區塊提前偵測 、多幅參考畫面之移動估測 、最佳碼率失真函數 、碼率失真函數 |
| 外文關鍵詞: | mode decision, early zero-block detection, rate distortion cost function, H.264/AVC, zero-block decision, multiple reference frames motion estimation, rate distortion optimization |
| 相關次數: | 點閱:12 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
H.264/AVC視訊編碼標準的編碼效能(coding performance)遠勝於之前的影像編碼標準。這是因為H.264/AVC使用了很多先進的技術,除了可變區塊大小、多幅參考畫面和四分之一精確度之移動估測技術外,其中更利用碼率失真最佳化技術(rate distortion optimization, RDO)檢查所有的畫面間模式與畫面內模式,以達到最佳的編碼效能。然而這些先進的技術造成H.264/AVC的編碼複雜度遠大於先前所有的視訊編碼標準。
為了降低H.264/AVC計算複雜度並維持其編碼影像品質,本論文分別進行快速模式決策演算法及快速多幅參考畫面演算法之研究。在第二章中,進行畫面間模式決策之研究,我們利用8×8及4×4零區塊以判別該巨區塊之移動特性(stationary/non-stationary)並提出先進零區塊模式決策演算法(advanced zero-block inter mode decision)。在第三章中,我們提出可選擇性多幅參考畫面移動估測演算法(selective multiple reference frames motion estimation),其中將多幅參考畫面視為穩定的高斯隨機程序,並利用其穩定特性以判別該巨區塊是否需要繼續參考下一張參考畫面(reference frames)。另外,提出兩種提前中止參考畫面移動估測的演算法(early termination),進一步降低其運算複雜度。在第四章中,我們提出快速畫面內模式決策(intra mode decision)及畫面內模式預測(intra mode prediction)演算法,首先利用該巨區塊之離散餘弦係數中的低頻成份以進行畫面內模式預測(intra prediction);接著利用畫面內模式預測中之殘餘值資訊,包括絕對轉換誤差及變異數,以減少畫面內模式預測中方向性候選模式(candidates)。由各章節實驗結果可以發現,我們針對畫面間模式決策、多幅參考畫面之移動估測及畫面內模式決策,所提出的演算法皆可以有效地降低H.264/AVC編碼之運算複雜度,並且維持一定的影像品質。
The H.264/AVC video coding standard achieves significantly better performance in both PSNR and visual quality at the same bit-rate compared to prior video coding standards. This is due to the fact that the H.264/AVC encoder features many advanced techniques such as variable block size motion estimation, multiple reference frames motion estimation (MRFME) and quarter-pixel motion estimation. One important technique is the uses of Lagrangian rate-distortion optimization (RDO) for inter mode decision as well as intra mode decision. The RDO technique is used to check all possible inter modes and intra modes to find the best coding result to obtain highest coding efficiency, but the computational load is far beyond prior video coding standards.
To reduce the computational complexity and maintain good coding performance, in this dissertation we investigate and develop efficient algorithms for H.264/AVC video coding. In Chapter 2 many efficient methods for inter mode decision have been investigated. We make use of both 8×8 and 4×4 zero-blocks to describe the temporal stationary or non-stationary for video sequences and propose an advanced zero-block inter mode decision algorithm. In Chapter 3, we propose a selective multiple reference frames motion estimation (SMRFME), which characterizes MRFME as a stationary Gaussian random process, and uses the stationary property to check whether a mode or a block is necessary to perform motion estimation on next reference frames. In addition, two early stop criteria to further lessen computational cost. In Chapter 4, we suggest an efficient algorithm which is accomplished in two stage. In the first stage the low-frequency AC components of discrete cosine transform (DCT) block of an MB is used to select intra 4×4 prediction (I4MB) or intra 16×16 prediction (I16MB); while in the second stage sum of absolute transform difference (SATD) coefficients, including SATD value and its variance, is used to skip improper mode for RDO mode decision. The experimental results show that our proposed algorithms achieve significant computation improvement when compared to other distinct algorithms, while maintaining good coding performance.
[1] T. Wiegand et al., “Overview of the H.264/AVC video coding standard,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 7, pp. 560-576, July 2003.
[2] A. Joch, F. Kossentini, H. Schwarz, T. Wiegand, and G. J. Sullivan, “Performance comparison of video coding standards using Lagrangian coder control,” in Proc. IEEE ICIP, Sept. 2002, vol. 2, pp. 501-504.
[3] Z. Zhou, M. T. Sun and Y. F. Hsu, “Fast variable block-size motion estimation algorithm based on merge and split procedures for H.264/MPEG-4 AVC,” in Proc. IEEE ISCAS, May 2004, vol. 3, pp. 725-728.
[4] K. C. Hou, M. J. Chen and C. T. Hsu, “Fast motion estimation by motion vector merging procedure for H. 264,” in Proc. IEEE ICME, July 2005, pp. 1444–1447.
[5] P. Yin, H. Y. Cheong, A. M. Tourapis and J. Boyce, “Fast mode decision and motion estimation for JVT/H.264,” in Proc. IEEE ICIP, Sept. 2003, vol. 3, pp. 853-856.
[6] Z. Zhou and M. T. Sun, “Fast macroblock inter mode decision and motion estimation for H.264/MPEG-4 AVC,” in Proc. IEEE ICIP, Oct. 2004, vol. 2, pp. 789-792.
[7] D. Wu, F. Pan, K. P. Lim, S. Wu, Z. G. Li, X. Lin, S. Rahardja and C. C. Ko, “Fast intermode decision in H.264/AVC video coding,” IEEE Trans. Circuits Syst. Video Technol., vol.15, no. 6, pp. 953-958, July 2005.
[8] X. Jing and L. P. Chau, “An efficient inter mode decision approach for H.264 video coding,” in Proc. IEEE ICME, July 2004, pp. 1111-1114.
[9] Y. M. Lee and Y. Lin, “Zero-block mode decision algorithm for H.264/AVC”, IEEE Trans. Image Processing, vol. 18, no. 3, pp. 524-533, Mar. 2009.
[10] A. C. Yu, “Efficient block-size selection algorithm for inter-frame coding in H.264/MPEG-4 AVC,” in Proc. IEEE ICASSP, May 2004, vol. 3, pp. 169-172.
[11] C. Kim and C. C. J. Kuo, “A feature-based approach to fast H.264 intra/inter mode decision,” in Proc. IEEE ISCAS, May 2005, vol. 1, pp. 308–311.
[12] G. Bailo, M. Bariani, I. Barbieri and M. Raggio, ”Search window size decision for motion estimation algorithm in H.264 video coder,” in Proc. IEEE ICIP, Oct. 2004, vol. 3, pp. 1453–1456.
[13] S. H. Ri, Y. Vatis and J. Ostermann, “Fast Inter-mode decision in an H.264/AVC encoder using mode and Lagrangian cost function,” IEEE Trans. Circuits Syst. Video Technol., vol.19, no. 2, pp. 302-306, Feb. 2009.
[14] H Zeng, C. Cai and K. K. Ma, “Fast mode decision for H.264/AVC based on macroblock motion activity,“ IEEE Trans. Circuits Syst. Video Technol., vol.19, no. 4, pp. 1-10, April 2009.
[15] L. A. Sousa, “General method for eliminating redundant computations in video coding,” Electron. Lett., vol. 36, pp. 306-307, Feb. 2000.
[16] Y. H. Moon, G. Y. Kim and J. H. Kim, ”An improved early detection algorithm for all-zero-blocks in H.264 video encoding,” IEEE Trans. Circuits Syst. Video Technol., vol. 15, pp. 1053–1057, Aug. 2005.
[17] H. Wang, S. Kwong and C.W. Kok, “An efficient mode decision algorithm for H.264/AVC encoding optimization,” IEEE Trans. Multimedia, vol. 9, No. 4, pp. 882-888, June 2007.
[18] D.Wu, K.P. Lim, T.K. Chiew, J.Y. Tham and K.H. Goh, “An adaptive thresholding technique for the detection of all-zeros blocks in H.264, in Proc. IEEE ICIP, Sept. 2007, pp. V-329-V-332.
[19] H. Wang, S. Kwong and C.W. Kok, “Efficient prediction algorithm of integer DCT coefficients for H.264/AVC optimization,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, No.4, pp.547-552, Apr. 2006.
[20] Z. Xie, Y. Liu, J. Liu and T. Yang, “A general method for detecting all-zero-blocks prior to DCT and quantization,” IEEE Trans. Circuits Syst. Video Technol., vol. 17, no. 2, Feb. 2007.
[21] W. Y. Chiu, Y. M. Lee and Y. Lin, “Efficient zero-block mode decision algorithm for High bit-rate coding in H.264/AVC,” in Proc. IEEE ISCAS, May 2010, vol. 3, pp. 4189-4192.
[22] C. W. Ting, L. M. Po and C. H. Cheung, “Centre-based frame selection algorithms for fast multi-frame motion estimation in H.264,” in Proc. IEEE ICNNSP, Dec. 2003, pp. 14-17.
[23] C. W. Ting, W. H. Lam and L. M. Po, “Fast block-matching motion estimation by recent-biased search for multiple reference frames,” in Proc. IEEE ICIP, Oct. 2004, pp. 1445-1448.
[24] M. E. Al-Mulla, C. N. Canagarajah and D. R. Bull, “Simplex minimization for single- and multiple-reference motion estimation,” IEEE Trans. Circuits Syst. Video Technol., vol. 11, no. 12, pp. 1209-1220, Dec. 2001.
[25] C. K. Chiang and S. H. Lai, “Fast multi-reference frame motion estimation via downhill simplex search,” in Proc. IEEE ICME, Jul. 2006, pp. 121-124.
[26] C. J. Duanmu, M. O. Ahmad and M. N. S. Swamy, “A continuous tracking algorithm for long-term memory motion estimation,” in Proc. IEEE ISCAS, May 2003, pp.356-359.
[27] M. J. Chen, Y. Y. Chiang, H. J. Li and M. C. Chi, “Efficient multi-frame motion estimation algorithm for MPEG-4 AVC/JVT/H.264,” in Proc. IEEE ISCAS, May 2004, pp. III-737-40.
[28] Y. H. Hsiao, T. H. Lee and P. C. Chang, “Short/long-term motion vector prediction in multi-frame video coding system,” in Proc. IEEE ICIP, Oct. 2004, pp. 1449-1452.
[29] M. J. Chen, G. L. Li, Y. Y. Chiang and C. T. Hsu, “Fast multiframe motion estimation algorithms by motion vector composition for the MPEG-4/AVC/ H.264 standard,” IEEE Trans. Multimedia, vol. 8, no. 3, pp. 478-487, Jun. 2006.
[30] Y. Su and M. T. Sun, “Fast multiple reference frame motion estimation for H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, no. 3, pp. 447-452, Mar. 2006.
[31] S. D. Kim, and M. H. Sunwoo, “Efficient frame selection schemes for multi-frame and variable block size motion estimation, “in Proc. IEEE ICME, Jun. 2008, pp. 733-736.
[32] A. Chang, O. C. Au and Y. M. Yeung, “A novel approach to fast multi-frame selection for H.264 video coding,” in Proc. IEEE ISCAS, May 2003, pp.704-707.
[33] X. Li, E. Q. Li and Y. K. Chen, “Fast multi-frame motion estimation algorithm with adaptive search strategies in H.264,” in Proc. IEEE ICASSP, May 2004, pp. 369-372.
[34] Y. W. Huang, B. Y. Hsieh, S. Y. Chien, S. Y. Ma and L. G. Chen, “Analysis and complexity reduction of multiple reference frames motion estimation in H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, no. 7, pp. 507-522, Apr. 2006.
[35] L. Shen, Z. Liu, Z. Zhang and G. Wang, “An adaptive and fast multiframe selection algorithm for H.264 video coding”, IEEE Signal Process. Letter, vol. 14, no. 11, pp. 836-839, Nov. 2007.
[36] Y. M. Lee, Y. F. Wang, J. R. Wang and Y. Lin, “An adaptive and efficient selective multiple reference frames motion estimation for H.264 video coding,” IEEE Pacific-Rim Symposium on Image and Video Technology (PSIVT), Jan. 2009, pp. 509-518.
[37] T. Y. Kuo and H. J. Lu, “Efficient reference frame selector for H.264”, IEEE Trans. Circuits Syst. Video Technol., vol. 18, no. 3, pp. 400-405, Mar. 2008.
[38] R. E. Blahut, Principles and Practice of Information Theory, Reading MA: Addison-Wesley, 1991
[39] T. Berger, Rate Distortion Theory, Prentice-Hall, Inc., N. J., 1971.
[40] H. Kim and Y. Altunbasak, “Low-complexity macroblock mode selection for H.264/AVC encoder,” in Proc. IEEE ICIP, 2004, pp. 765-768.
[41] C. Kim, H. H. Shih, and C. C. J. Kuo, “Feature-based intra-prediction mode decision for H.264,” in Proc. IEEE ICIP, 2004, pp. 769-772.
[42] C. H. Tseng, H. M. Wang, and J. F. Yang, “Enhanced intra-4x4 mode decision for H.264/AVC coders,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, no. 8, pp. 1027-1032, Aug. 2006.
[43] M. G. Sarwer, and L. M. Po, “Bit rate estimation for cost function decision of H.264/AVC,” in Proc. ICME, July 2007, pp. 1579-1581.
[44] Y. K. Tu, J. F. Yang, and M.T. Sun, “Efficient Rate-distortion modeling for efficient H.264/AVC encoding,” IEEE Trans. Circuits Syst. Video Technol., vol. 17, no. 5, pp. 530-543, May 2007.
[45] C. L. Yang, L. M. Po, and W. H. Lam, “A fast H.264 intra prediction algorithm using macroblock properties,” in Proc. IEEE ICIP, pp. 461-464, 2004.
[46] F. Pan, X. Lin, S. Rahardja, K. P. Lim, Z. G. Li, D. Wu, and S. Wu, “Fast mode decision algorithm for intraprediction in H.264/AVC video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 15, no. 7, pp. 813-822, July 2005.
[47] C. Huang, S. Zhung, and S. H. Lai, “Efficient intra mode selection using image structure tensor for H.264/AVC,” in Proc. IEEE ICIP, Sept. 2007, pp. V-289-V-292.
[48] B. La, M. Eom, and Y. Choe, “Fast mode decision for intra prediction in H.264/AVC encoder,” in Proc. IEEE ICIP, pp. V-321-V-324, Sept. 2007.
[49] J. F. Wang, J. C. Wang, J. T. Chen, A. C. Tsai, and A. Paul,” A novel fast algorithm for intra mode decision in H.264/AVC encoders,” in Proc. IEEE ISCAS, July 2006, pp. 3498-3501.
[50] J. C. Wang, J. F. Wang, J. F. Yang, and J. T. Chen, “A fast mode decision algorithm and its VLSI design for H.264/AVC intra-prediction,” IEEE Trans. Circuits Syst. Video Technol., vol. 17, no. 10, pp. 1414-1422, Oct. 2005.
[51] A. C. Tsai, A. Paul, J. C. Wang, J. F. Wang,” Intensity gradient technique for efficient intra-prediction in H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 18, no. 5, pp. 694-698, May 2008.
[52] Z. Wei, K. N. Ngan, and H. Li, “An efficient intra-mode selection algorithm for H.264 based on edge classification and rate-distortion estimation”, Signal Processing: Image Communication, vol. 23, no. 9, pp. 699-710, Sept. 2008.
[53] H. Li, K. N. Ngan, and Z. Wei, “Fast and efficient method for block edge classification and its application in H.264/AVC video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 18, no. 6, pp. 756-768, June 2008.
[54] L. L. Wang and W. C. Siu, “H.264 fast intra mode selection algorithm based on direction different measure in the pixel domain,” in Proc. IEEE ICASSP, Apr. 2009, pp. 1037-1040.
[55] Z. He and S. K. Mitra, “A linear source model and a unified rate control algorithm for DCT video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 12, no. 11, pp. 970-982, Nov. 2002.
[56] Z. He and S. K. Mitra, “Optimum bit allocation and accurate rate control for video coding via ρ-domain source modeling,” IEEE Trans. Circuits Syst. Video Technol., vol. 12, no. 10, pp. 840-849, Oct. 2002.
[57] R. W. Buccigrossi and E. P. Simoncelli, “Image compression via joint statistical characterization in the wavelet domain,” IEEE Trans. Image Processing, vol. 8, no. 12, pp. 1688-1701, Dec. 1999.
[58] A. J. Viterbi and J. K. Omura, Principles of Digital Communication and Coding, New York: McGraw-Hill, 1979.
[59] A. N. Netravali and B. G. Haskell, Digital Pictures: Representation and Compression, New York: Plenum, 1988.
[60] Reference Software version 12.2, http://iphome.hhi.de/suehring/tml/download/
[61] G. Bjontegaard, “Calculation of average PSNR difference between RD curves”, ITU-T Q.6/16, Doc. VCEG-M33, Apr. 2001.