多視角視訊編碼之快速參考畫面方向決策｜國立中央大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	董宜芩 Yi-Cin Dong
論文名稱：	多視角視訊編碼之快速參考畫面方向決策 Fast Reference Picture List Decision Algorithm for Multi-view Video Coding
指導教授：	唐之瑋 Chih-Wei Tang
口試委員:
學位類別：	碩士 Master
系所名稱：	資訊電機學院 - 通訊工程學系 Department of Communication Engineering
畢業學年度：	98
語文別：	中文
論文頁數：	81
中文關鍵詞：	多視角視訊編碼、快速演算法、參考畫面方向、預測方向
外文關鍵詞：	multi-view video coding, fast algorithm, reference picture list, prediction direction
相關次數：	點閱：10 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

相較於單視角視訊，多視角視訊可提供使用者更豐富的資訊，但也伴隨著可觀的資料量。單視角視訊編碼僅開發時間方向之畫面間相似性，以降低多餘的運算量，而多視角視訊編碼則開發視角方向之畫面間的關聯性。因此，如何同時開發時間與視角方向的資料多餘性，並更有效降低計算複雜度，不失影像品質是值得探討的研究課題。本論文提出之快速演算法結合兩部分，分別為快速畫面間預測方向決策(FIPD)與快速參考畫面方向決策(FRPL)。快速畫面間預測方向決策在enhancement views中，可參考時間方向相鄰畫面與相鄰視角之對應區塊的運動特性，快速決定目前區塊是否僅採取時間預測方向。而快速參考畫面方向決策則進而可應用於base view與enhancement views，以16×16區塊模式之運動或視角估測之最小rate distortion cost為依據，取得目前區塊可能較偏好之參考畫面方向與畫面間預測方向(時間或視角)，快速決策該區塊其餘不同區塊模式的參考畫面方向與畫面間預測方向，以降低搜尋次數。此提出之演算法可與其他快速模式決策演算法互相結合，以達到更高之編碼效能。由JMVC 6.0編碼器之實驗結果顯示，在PSNR品質可忽略些許降低且位元率也在可接受範圍的情況下，本論文所提出之演算法在可節省約55%至62%的編碼時間。

Multi-view videos can provide users rich entertaining experiences. Since they accompany with a huge amount of data, the development of multi-view video codings is stimulated. Although multi-view video coding explores the inter-view correlation, the computational complexity is also increased. Therefore, how to develop a fast algorithm without losing image quality is an important issue of MVC. In this thesis, we propose a novel algorithm by combining the fast predictor of inter-frame prediction direction (FIPD) and the fast decision of reference picture list of motion and disparity estimation (FRPL). In enhancement views, the fast inter frame direction predictor may prefer temporal prediction according to the motion characteristics of the block to be coded. Next, the reference picture list of each inter mode candidate of the current block follows the decision of the 16×16 macroblock mode of the same block in base view and enhancement views. Therefore, the inter frame prediction of one macroblock is reduced to a single prediction direction with a single reference picture list. The experimental results show that the proposed scheme reduces up to 62% of encoding time with negligible loss of PSNR and bit-rate increase compared with the original JMVC 6.0.

摘要		I
Abstract	II
致謝		II
目錄		IV
圖目錄	VI
表目錄	IX
第一章	緒論	1
1.	前言	1
2.	研究動機	2
3.	研究方法	1
4.	論文架構	1
第二章	多視角視訊編碼器介紹	2
1.	單視角視訊編碼器簡介	2
1.1.	畫面預測方式介紹	3
1.2.	B-picture特性	4
1.3.	最佳預測模式決策	6
2.	多視角視訊編碼現況	7
3.	多視角視訊編碼介紹	11
3.1.	多視角視訊編碼器架構	11
3.2.	基於B-picture之預測方向選擇	12
4.	總結	13
第三章	多視角視訊編碼之快速演算法現況	15
1快速畫面間編碼演算法	15
2畫面間預測方向決策與區塊模式決策之快速演算法	16
3畫面間參考畫面方向決策與區塊模式決策之快速演算法	17
4 總結	19
第四章	本論文提出之快速多視角視訊編碼演算法	21
1.	本論文採用之多視角視訊編碼預測架構	21
2.	快速參考畫面方向決策演算法(FRPL)	23
2.1.	最佳區塊模式(optimal mode)機率分析	23
2.1.1.	全體預測架構下之最佳區塊模式機率分析	24
2.1.2.	不同預測方向之最佳區塊模式機率分析	26
2.1.3.	最佳時間方向與視角方向預測之機率分析	30
2.2.	預測方向與參考畫面方向之分析	32
2.3.	本論文提出之快速參考畫面方向決策演算法	35
3.	結合快速參考畫面方向決策(FRPL)及快速畫面間預測方向決策演算法(FIPD)	40
3.1.	快速畫面間預測方向決策演算法	40
3.2.	本論文提出之結合快速參考畫面方向決策(FRPL)及快速畫面間預測方向決策演算法(FIPD)	44
4.	總結	45
第五章	實驗結果	46
1.	模擬環境及實驗參數說明	46
2.	快速參考畫面方向決策演算法(FRPL)之實驗結果	48
3.	結合快速畫面間預測方向決策(FIPD)演算法之實驗結果	53
4.	總結	61
第六章	結論與未來展望	62
參考文獻	63

                                

[1] Video codec for audiovisual services at p_64 kbit/s, ITU-T recommendation H.261, ITU-T, version 1: November 1990, version 2: March 1993.
[2] Generic coding of moving pictures and associated audio information—part 2: Video, ITU-T recommendation H.262 and ISO/IEC 13818-2 (MPEG-2 Video), ITU-T and ISO/IEC JTC 1, November 1994.
[3] Video coding for low bit rate communication, ITU-T Rec. H.263, ITU-T, version 1: November 1995, version 2: January 1998, version 3: November 2000.
[4] T. Wiegand, G. Sullivan, G. Bjontegaard, and A. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, no. 7, pp. 560-576, July 2003.
[5] J.-P. Lin and C.-W. Tang, “A fast direction predictor of inter frame prediction for multi-view video coding,” in Proceedings of IEEE International Symposium on Circuits and Systems, Taiwan, May 2009.
[6] M. Flierl and B. Girod, “Generalized B pictures and the draft H.264/AVC video compression standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, pp. 587-597, July 2003.
[7] C. Sullivan and T. Wiegand, “Rate-distortion optimization for video compression,” IEEE Signal Processing Magzine, vol. 15, pp. 74-90, November 1998.
[8] T. Wiegand, H. Schwarz, A. Joch, F. Kossentini, and G. J. Sullivan, “Rate-constrained coder control and comparison of video coding standards,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, pp. 688-703, July 2003.
[9] Y. Chen, Y.-K. Wang, K. Ugur, M. Hannuksela, J. Lainema, and M. Gabbouj, “The emerging MVC standard for 3D video services,” EURASIP Journal on Advances in Signal Processing, vol. 2009, March 2009.
[10] M. Tanimoto, “Free viewpoint television—FTV,” in Proceedings of IEEE International Conference on Picture Coding Symposium, San Francisco, December 2004.
[11] ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, “Comparative study of MVC prediction structures,” JVT-P014, Marrakech, Morocco, January 2007.
[12] P. Merkle, A. Smolic, K. M‥uller, and T. Wiegand, “Efficient prediction structures for multiview video coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 11, pp. 1461-1473, November 2007.
[13] H. Schwarz, D. Marpe, and T. Wiegand, “Hierarchical B pictures,” JVT-P014, Poznan, Poland, July 2005.
[14] ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, “Joint multiview video model (JMVM) 1.0” JVT-T209, Klagenfurt, Austria, July 2006.
[15] ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, “JMVC 1.0 software” JVT-AA212, Geneva, April 2008.
[16] L. Shen, Z. Liu, T. Yan, Z. Zhang, and P. An, “View-adaptive motion estimation and disparity estimation for low complexity multiview video coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 6, pp. 925–930, June 2010.
[17] P. He, M. Yu, Z. Peng, and G. Jiang, “Fast mode selection and disparity estimation for multiview video coding,” in Proceedings of 3rd IEEE International Symposium on Intelligent Information Technology Application Workshops, pp. 109-121, November 2009.
[18] W. Zhu, W. Jiang, and Y. Chen, “A fast inter mode decision for multiview video coding,” in Proceedings of IEEE International Conference on Information Engineering and Computer Science, pp. 1-4, December 2009.
[19] L. Shen, Z. Liu, S. Liu, Z. Zhang and P. An , “Selective disparity estimation and variable size motion estimation based on motion homogeneity for multi-view coding,” IEEE Transactions on Broadcasting, vol. 55, no.4, pp. 761-766, December 2009.
[20] J.-C. Chiang, L.-M. Liu, and W.-N. Lie, “A hierarchical two-stage neural-classifier for mode decision of H.264/AVC stereo video encoding,” in Proceedings of IEEE International Conference on 3DTV, pp. 317-320, May 2008.
[21] Z. Peng, M. Yu, and G. Jiang, “A fast encoding algorithm for multiview video coding, ” in Proceedings of IEEE International Symposium on Intelligent Information Technology Applications, vol. 1, pp. 497-500, November 2009.
[22] Y. Si, M. Yu, Z. Peng, and G. Jiang, “New multi-reference frame selection for multiview video coding,” in Proceedings of International Symposium on Intelligent Information Systems and Applications, pp. 39-42, November 2009.
[23] Z.-Y. Chen, J.-W. Syu, and P.-C. Chang, “Fast inter-layer motion estimation algorithm on spatial scalability in H.264/AVC scalable extension,” accepted by IEEE International Conference on Multimedia & Exposition, July 2010.
[24] D.-H. Han and Y.-L. Lee, “Fast mode decision using global disparity vector for multiview video Coding,” in Proceedings of IEEE International Conference on Future Generation Communication and Networking Symposia, December 2008.
[25] ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, “Common test conditions for multiview video coding”, JVT-T207, Klagenfurt, Austria, July, 2006.
[26] A. Vetro, M. McGuire, W. Matusik, A. Behrens, J. Lee, H. Pfister, and Mitsubishi Electric Research Labs(USA), “Multiview video test sequences from MERL,” ISO/IEC JTC1/SC29/WG11 m12077, Busan, Korea, April 2005.

簡易檢索 / 詳目顯示

相關論文