| 研究生: |
許哲維 Jhe-wei Syu |
|---|---|
| 論文名稱: |
H.264可調式視訊編碼之空間層際移動估測快速演算法 Fast Inter-Layer Motion Estimation Algorithm on Spatial Scalability in H.264/AVC Scalable Extension |
| 指導教授: |
張寶基
Pao-chi Chang |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 通訊工程學系 Department of Communication Engineering |
| 畢業學年度: | 97 |
| 語文別: | 中文 |
| 論文頁數: | 81 |
| 中文關鍵詞: | 快速演算法 、移動估測 、層際間預測 、可調式視訊編碼 |
| 外文關鍵詞: | motion estimation, inter-layer prediction, scalable video coing, fast algorithm |
| 相關次數: | 點閱:7 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
由於視訊壓縮編碼、網路以及儲存能力的發展,使得多媒體系統的應用越來越普及與廣泛,因此,如何有效率的提供資訊給不同條件限制的使用者變得非常重要,可調式視訊的概念也就此誕生。
H.264可調式視訊編碼標準(SVC)是目前最新的可調式視訊編碼技術,它以H.264為核心,並加入了層際間預測的概念,大幅改善了以往視可調式訊編碼標準的缺點,儘管如此,運算複雜度太過龐大一直是SVC面臨的最大問題,特別在開啟層際間預測之空間可調架構下,增強層移動估測複雜度幾乎占了整體複雜度的90%以上。
有鑑於此,本論文利用了增強層兩種預測移動向量間的關係,以及增強層各模式間最佳移動估測方向的相關性,提出了一套應用在開啟層際間預測之空間可調架構下的移動估測快速演算法。實驗結果顯示,與原始JSVM 9.12全域搜尋相比,最多可節省67.4%的運算複雜度,而最多只犧牲不到0.05dB的視訊品質。
With the improvements of video coding technology, network infrastructures, storage capacity, and CPU computing capability, the applications of multimedia systems become wider and more popular. Therefore, how to efficiently provide video sequences to users under different constraints is very important, and scalable video coding is one of the best solutions to this problem.
H.264 scalable extension (SVC) that is constructed based on H.264/AVC is the most recent scalable video coding standard. SVC utilizes the inter-layer prediction to substantially improve the coding efficiency comparing with the prior scalable video coding standards. Nevertheless, this technique results in extremely large computation complexity which obstructs it from practical use. Especially on spatial scalability, the complexity of the enhancement layer motion estimation occupies above 90% of the total complexity. The main objective of this work is to reduce the computation complexity while maintaining both the video quality and the bit-rate.
This thesis proposes a fast inter-layer motion estimation algorithm on temporal and spatial scalabilities for SVC. We utilize the relation between two motion vector predictors from the base layer as well as the enhancement layer respectively and the correlation between all the modes to reduce the number of search times. The simulation results show that the proposed algorithm can save the computation complexity up to 67.4% compared to JSVM9.12 with less than 0.0476dB video quality degradation.
[1] Advanced Video Coding for Generic Audiovisual Services, ITU-T Rec. H.264 and ISO/IEC 14496-10 (MPEG-4 AVC), ITU-T and ISO/IEC JTC 1, Version 1: May 2003, Version 2: May 2004, Version 3: Mar. 2005, Version 4: Sept. 2005, Version 5 and Version 6: June 2006, Version 7: Apr. 2007, Version 8 (including SVC extension): Consented in July 2007.
[2] T. Wiegand, G. J. Sullivan, G. Bjontegaard, and A. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Trans. Circuits Syst. Video Tech., vol. 13, no. 7, pp. 560-576, Jul. 2003.
[3] K. P. Lim, G. Sullivan, and T. Wiegand, “Text Description of Joint Model Reference Encoding Methods and Decoding Concealment Methods,” Joint Video Team, Doc. JVT-N046, Jan. 2005.
[4] Generic Coding of Moving Pictures and Associated Audio Information-Part 2: Video, ITU-T Rec. H.262 and ISO/IEC 13818-2 (MPEG-2 Video), ITU-T and ISO/IEC JTC 1, Nov. 1994.
[5] Video Coding for Low Bit Rate communication, ITU-T Rec. H.263, ITU-T, Version 1: Nov. 1995, Version 2: Jan. 1998, Version 3: Nov. 2000.
[6] Coding of audio-visual objects-Part 2: Visual, ISO/IEC 14492-2 (MPEG-4 Visual), ISO/IEC JTC 1, Version 1: Apr. 1999, Version 2: Feb. 2000, Version 3: May 2004.
[7] H. Schwarz, D. Marpe, and T. Wiegand, “Hierarchical B Pictures,” Joint Video Team, Doc. JVT-P014, Jul. 2005.
[8] H. Schwarz, D. Marpe, and T. Wiegand, “Analysis of hierarchical B-pictures and MCTF,” in Proc. ICME, Toronto, ON, Canada, Jul. 2006, pp. 1929-1932
[9] M. Wien, H. Schwarz, and T. Olbaum, “Performance Analysis of SVC,” IEEE Trans. Circuits Syst. Video Tech., vol. 17, no. 9, pp. 1194-1203, Sep. 2007.
[10] H. Schwarz, D. Marpe, and T. Weigand, “Overview of the Scalable Video Coding Extension of the H.264/AVC Standard,” IEEE Trans. Circuits Syst. Video Tech., vol. 17, no. 9, pp. 1103-1120, Sep. 2007.
[11] C. A. Segall and Gary J. Sullivan, “Spatial Scalability Within the H.264/AVC Scalable Video Coding Extension,” IEEE Trans. Circuits Syst. Video Tech., vol. 17, no. 9, pp. 1121-1135, Sep. 2007.
[12] H. C. Huang, W. H. Peng, T. Chiang, and H. M. Hang, “Advances in the Scalable Amendment of H.264/AVC,” IEEE Communications Magazine, vol. 45, no. 1, pp. 68-77, Jan. 2007.
[13] H. Schwarz and M. Wien, “The Scalable Video Coding Extension of the H.264/AVC Standard,” IEEE Signal Processing Magazine, vol. 25, no. 2, pp. 135-141, Mar. 2008.
[14] E. Francois, J. Viéron, G. Marquant, N. Burdin, and P. Lopez, “Generic Extended Spatial Scalability,” Joint Video Team, Doc. JVT-O041, Apr. 2005.
[15] J. Viéron, E. Francois, and N. Burdin, “CE10: Unified Motion Upsampling in Extended Spatial Scalability,” Joint Video Team, Doc. JVT-P019, Jul. 2005.
[16] G. Marquant, E. Francoise, N. Burdin, P. Lopez, and J. Viéron, “Extended Spatial Scalability for Nondyadic Video Formats: From SDTV to HDTV,” presented at the SPIE VCIP, Beijing, China, Jul. 2005.
[17] H. Li, Z. G. Li, and C. Wen, “Fast Mode Decision for Temporal Scalable Video Coding,” in Proc. Picture Coding Symp., Beijing, China, Apr. 2006.
[18] H. Li and Z. G. Li, “Fast Mode Decision Algorithm for Inter-frame Coding in Fully Scalable Video Coding,” IEEE Trans. Circuits Syst. Video Tech., vol. 16, no. 7, pp. 889-895, Jul. 2006.
[19] S. Lim, L. Yang, and B. Jeon, “Fast Coding Mode Decision for Scalable Video Coding,” International Conference on Advanced Communication Technology(ICACT), vol. 3, pp. 1897-1900, Feb. 2008.
[20] H. Li, Z. G. Li, C. Wen, and L. P. Chau, “Fast Mode Decision for Spatial Scalable Video Coding,” IEEE International Symposium on Circuits and Systems, pp. 3005-3008, May. 2006.
[21] H. Li, Z. G. Li, and C. Wen, “Fast Mode Decision for Coarse Grain SNR Scalable Video Coding,” IEEE International Conference on Acoustics, Speech and Signal Processing(ICSSP), vol. 2, pp. 545-548, May. 2006.