以線上統計為基礎應用於CGS可調式編碼器之快速模式決策

簡易檢索 / 詳目顯示

回結果列表

研究生：	吳柏宗 Bo-Zong Wu
論文名稱：	以線上統計為基礎應用於CGS可調式編碼器之快速模式決策 Online Statistics Based Fast Mode Decision for Scalable Video Coding with CGS
指導教授：	唐之瑋 Chih-Wei Tang
口試委員:
學位類別：	碩士 Master
系所名稱：	資訊電機學院 - 通訊工程學系 Department of Communication Engineering
畢業學年度：	98
語文別：	中文
論文頁數：	69
中文關鍵詞：	快速模式決策、貝氏分類器、線上統計、可調式視訊編碼
外文關鍵詞：	fast mode decision, Bayesian classifier, on-line statistic, scalable video coding
相關次數：	點閱：13 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

可調式視訊編碼透過層際間(inter-layer)預測的技術，大幅改善編碼效能，但也大幅提高了計算複雜度。對於不同的視訊影片和編碼參數下，如何降低編碼的複雜度是增加可調式視訊編碼應用性的關鍵。本論文對於CGS可調性提出了一個以線上統計為基礎的快速模式決策演算法，包含為訓練階段和加速階段。在訓練階段，我們線上統計層際間的模式分佈、RD cost increase以及層際內RD cost increase關聯性以確保此統計能適應不同編碼設定下與不同視訊內容，並應用以此設計貝氏分類器和提早終止演算法以達成快速模式決策。在加速階段，依據貝氏分類器之loss functions可減少候選modes的數量並排序候選modes，提早終止演算法則進一步減少測試modes的數量。實驗顯示，我們提出的加速演算法可以節省73%左右的編碼時間，如果只考慮加速階段，則最多可節省78%左右的編碼時間，並且在不同的量化參數(QP)與影片內容時皆下幾乎不會造成PSNR的下降及位元率(bitrate)的上升。

Duo to the high coding complexity of scalable video coding (SVC) that tests all candidate modes in RD optimization (RDO), in this thesis, we propose an on-line statistics based fast mode decision algorithm for SVC with CGS scalability. This algorithm consists of a training stage and an acceleration stage. At the training stage, the encoder gets the correlations of inter-layer mode distributions and intra and inter-layer RD cost increases. By this, we not only consider the probability distribution functions of optimal modes but also take the RD cost increase that caused by selecting a non-optimal mode into account. At the acceleration stage, the Bayesian classifier and an early termination algorithm are applied to reduce the candidate modes in RDO. By online statistics, the encoder can change the decision rule to reduce the RD performance loss for different sequences and QP sets. The experimental results show that our proposed algorithm can speed up to 73%. At the acceleration stage, our proposed algorithm can speed up to 78% with negligible RD performance loss.

摘要	i
Abstract	ii
目錄	iv
圖目錄	vi
表目錄	viii
第一章	緒論	1
1	前言	1
2	研究動機	2
3	研究方法	3
4	論文架構	3
第二章 H.264 可調式視訊編碼器介紹	4
1 可調式視訊編碼器介紹	4
2 可調式視訊編碼架構	5
2.1 時間可調性(Temporal Scalability )	6
2.2 空間可調性(Temporal Scalability)	7
2.3 雜訊比可調性(SNR Scalability)	8
3 基礎層之編碼技術介紹	8
4 增進可調式視訊編碼效率之工具	10
4.1 層際間畫面內編碼預測(Inter-Layer Intra Prediction)	11
4.2 層際間運動向量預測(Inter-Layer Motion Prediction)	12
4.3 層際間殘餘資訊預測(Inter-Layer Residual Prediction)	12
5 總結	13
第三章 可調式視訊編碼器之快速演算法現況	14
1快速運動估測演算法	14
2快速模式決策演算法	15
3 提早終止(Early Termination)模式決策演算法	17
4 總結	18
第四章	本論文所出之快速巨集區塊模式演算法	19
1 以線上統計為基礎之快速巨集區塊模式決策演算法	19
2以貝氏分類器為基礎之快速巨集區塊模式決策	20
2.1 層際間巨集區塊之關聯性	20
2.2 層際內Block Mode分佈關聯性	31
2.3 層際內RD Cost Increase關聯性	32
2.4 應用貝氏分類器於快速模式決策	34
3提早終止巨集區塊模式決策演算法	36
3.1 層際間RD cost increase關聯性	36
3.2應用層際間RD cost increase於提早終止區塊模式決策演算法	38
第五章 實驗結果	43
1 實驗環境與參數設定	43
2貝氏分類器之快速模式決策演算法	44
3結合貝氏分類器與提早終止之快速模式決策演算法	47
4 總結	53
第六章 結論與未來展望	54
參考文獻	55
Publications	57

                                

[1] T. Wiegand, G. Sullivan, and A. Luthra, “Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification (ITU-T Rec. H.264 | ISO/IEC 14496-10 AVC),” ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, JVT-G050r1, 2003.
[2] T. Wiegand, G. Sullivan, J .Reichel, H. Schwarz, and M. Wien, “ Joint Draft 10 of SVC Amendment,” ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, JVT-W201, 2007.
[3] H. Schwarz, D. Marpe, and T. Wiegand, “Overview of the scalable video coding extension of the H.264/AVC standard,” IEEE Trans. Circuits Syst. Video Technol., vol. 17, no. 9, pp. 1103–1120, Sept. 2007.
[4] H.-C. Huang, W.-H. Peng, T. Chiang, and H.-M. Hang, “Advances in the scalable amendment of H.264/AVC,” IEEE communications Magazine, vol. 45, no. 1, pp. 68-77, Jan. 2007.
[5] H. Schwarz, D. Marpe, and T. Wiegand, “Hierarchical B pictures,” Joint Video Team, Doc. JVT-P074, Jul. 2005.
[6] M. Wien, H. Schwarz, amd T. Olbaum, “Performance analysis of SVC,” in Proceedings of IEEE International Conference on Multimedia and Expo, vol. 17, no. 9, pp. 1194-1203, Sept. 2007.
[7] H. Schwarz, D. Marpe, and T. Wiegand, “Analysis of hierarchical B-pictures and MCTF,” in Proceedings of Digital Object Identifier, pp. 1929-1932, 2006.
[8] Generic Coding of Moving Pictures and Associates Audio Information-Part 2: Video, ITU-T Rec. H.262 and ISO/IEC JTC 1, 2009.
[9] Video Coding for Low Bit Rate Communication, ITU-T Rec. H.263, ITU-T, Version 1: Nov. 1994, Version 2: Jan. 1998, Version 1: Nov. 2000.
[10] Coding of Audio-Video Objects-Part 2: Visual, ISO/IEC 14492-2 (MPEG-4 Visual), ISO/IEC JTC 1, Version 1: Apr. 1999, Version 2: Feb. 2000, Version 3: May 2004.
[11] Y.-D. Wu and C.-W. Tang, “The motion directed fast mode decision for spatial and CGS scalable video coding,” in Proceedings of IEEE International Conference on Broadband Multimedia Systems and Broadcasting, Apr. 2008.
[12] H.-C. Lin, W.-H. Peng, and H.-M. Hang, “Low-complexity macroblock mode decision algorithm for combined CGS and temporal scalability,” document JVT-W029.doc, ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, San Jose, CA, Apr. 2007.
[13] H. Li, Z.-G. Li, and C. Wen, “Fast mode decision for coarse grain SNR scalable video coding,” in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 545-548, 2006.
[14] H. Li, Z. G. Li, C. Wen, and S. Xie, “Fast mode decision for coarse granular scalability via switched candidate mode sets,” in Proceeding of IEEE International Conference on Multimedia and Expo, pp. 1323-1326, Jul. 2007.
[15] H. Li and Z. G. Li, “Fast mode decision algorithm for inter-frame coding in fully scalable video coding,” IEEE Trans. Circuits Syst. Video Techno., Vol. 16, No.7, pp. 889-895, July 2006.
[16] S.-T. Kim, K. Konda, C.-S. Park, C.-S. Cho, and S.-J. Ko, “ Fast mode decision algorithm for inter-layer coding in scalable video coding,” IEEE Trans. Consumer Electronics, vol. 55, no. 3, pp. 1572-1580, 2009.
[17] C.-H. Yeh, K.-J. Fan, M.-J. Chen, and G.-L. Li, “Fast mode decision algorithm for scalable video coding using Bayesian theorem detection and Markov process,” IEEE Trans. Circuits Syst. Video Technol., vol. 20, no. 4, Apr. 2010.
[18] J. Ren and N. Kehtarnavaz, “Fast adaptive early termination for mode selection in H.264 scalable video coding,” in Proceedings of IEEE International Conference on Image Processing, pp. 2464-2467, 2008.
[19] C.-S. Park, B.-K. Dan, H. Choi, and S.-J. Ko, “A statistical approach for fast mode decision in scalable video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 19,no. 12 , pp. 1915-1920, Dec. 2009.
[20] S.-W. Jung, S.-J. Beak, C. -S. Park, and S.-J. Ko, “ Fast mode decision using all zero block detection for fidelity and spatial scalable video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 20, no. 2, pp. 201-206, Feb. 2010.
[21] H.-C. Lin, W.-H. Peng, and H.-M. Hang, “Fast context-adaptive mode decision algorithm for scalable video coding with combined coarse-grain quality (CGS) and temporal scalability,” IEEE Trans. Circuits Syst. Video Technol., vol. 20, no. 5, pp. 732-748, May 2010.
[22] J. Reichel, H. Schwarz, and M. Wien, Joint Scalable Video Model JSVM- 9, document JVT-V202.doc, ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, Jan. 2007.

簡易檢索 / 詳目顯示

相關論文