| 研究生: |
董宜芩 Yi-Cin Dong |
|---|---|
| 論文名稱: |
多視角視訊編碼之快速參考畫面方向決策 Fast Reference Picture List Decision Algorithm for Multi-view Video Coding |
| 指導教授: |
唐之瑋
Chih-Wei Tang |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 通訊工程學系 Department of Communication Engineering |
| 畢業學年度: | 98 |
| 語文別: | 中文 |
| 論文頁數: | 81 |
| 中文關鍵詞: | 多視角視訊編碼 、快速演算法 、參考畫面方向 、預測方向 |
| 外文關鍵詞: | multi-view video coding, fast algorithm, reference picture list, prediction direction |
| 相關次數: | 點閱:10 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
相較於單視角視訊,多視角視訊可提供使用者更豐富的資訊,但也伴隨著可觀的資料量。單視角視訊編碼僅開發時間方向之畫面間相似性,以降低多餘的運算量,而多視角視訊編碼則開發視角方向之畫面間的關聯性。因此,如何同時開發時間與視角方向的資料多餘性,並更有效降低計算複雜度,不失影像品質是值得探討的研究課題。本論文提出之快速演算法結合兩部分,分別為快速畫面間預測方向決策(FIPD)與快速參考畫面方向決策(FRPL)。快速畫面間預測方向決策在enhancement views中,可參考時間方向相鄰畫面與相鄰視角之對應區塊的運動特性,快速決定目前區塊是否僅採取時間預測方向。而快速參考畫面方向決策則進而可應用於base view與enhancement views,以16×16區塊模式之運動或視角估測之最小rate distortion cost為依據,取得目前區塊可能較偏好之參考畫面方向與畫面間預測方向(時間或視角),快速決策該區塊其餘不同區塊模式的參考畫面方向與畫面間預測方向,以降低搜尋次數。此提出之演算法可與其他快速模式決策演算法互相結合,以達到更高之編碼效能。由JMVC 6.0編碼器之實驗結果顯示,在PSNR品質可忽略些許降低且位元率也在可接受範圍的情況下,本論文所提出之演算法在可節省約55%至62%的編碼時間。
Multi-view videos can provide users rich entertaining experiences. Since they accompany with a huge amount of data, the development of multi-view video codings is stimulated. Although multi-view video coding explores the inter-view correlation, the computational complexity is also increased. Therefore, how to develop a fast algorithm without losing image quality is an important issue of MVC. In this thesis, we propose a novel algorithm by combining the fast predictor of inter-frame prediction direction (FIPD) and the fast decision of reference picture list of motion and disparity estimation (FRPL). In enhancement views, the fast inter frame direction predictor may prefer temporal prediction according to the motion characteristics of the block to be coded. Next, the reference picture list of each inter mode candidate of the current block follows the decision of the 16×16 macroblock mode of the same block in base view and enhancement views. Therefore, the inter frame prediction of one macroblock is reduced to a single prediction direction with a single reference picture list. The experimental results show that the proposed scheme reduces up to 62% of encoding time with negligible loss of PSNR and bit-rate increase compared with the original JMVC 6.0.
[1] Video codec for audiovisual services at p_64 kbit/s, ITU-T recommendation H.261, ITU-T, version 1: November 1990, version 2: March 1993.
[2] Generic coding of moving pictures and associated audio information—part 2: Video, ITU-T recommendation H.262 and ISO/IEC 13818-2 (MPEG-2 Video), ITU-T and ISO/IEC JTC 1, November 1994.
[3] Video coding for low bit rate communication, ITU-T Rec. H.263, ITU-T, version 1: November 1995, version 2: January 1998, version 3: November 2000.
[4] T. Wiegand, G. Sullivan, G. Bjontegaard, and A. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, no. 7, pp. 560-576, July 2003.
[5] J.-P. Lin and C.-W. Tang, “A fast direction predictor of inter frame prediction for multi-view video coding,” in Proceedings of IEEE International Symposium on Circuits and Systems, Taiwan, May 2009.
[6] M. Flierl and B. Girod, “Generalized B pictures and the draft H.264/AVC video compression standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, pp. 587-597, July 2003.
[7] C. Sullivan and T. Wiegand, “Rate-distortion optimization for video compression,” IEEE Signal Processing Magzine, vol. 15, pp. 74-90, November 1998.
[8] T. Wiegand, H. Schwarz, A. Joch, F. Kossentini, and G. J. Sullivan, “Rate-constrained coder control and comparison of video coding standards,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, pp. 688-703, July 2003.
[9] Y. Chen, Y.-K. Wang, K. Ugur, M. Hannuksela, J. Lainema, and M. Gabbouj, “The emerging MVC standard for 3D video services,” EURASIP Journal on Advances in Signal Processing, vol. 2009, March 2009.
[10] M. Tanimoto, “Free viewpoint television—FTV,” in Proceedings of IEEE International Conference on Picture Coding Symposium, San Francisco, December 2004.
[11] ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, “Comparative study of MVC prediction structures,” JVT-P014, Marrakech, Morocco, January 2007.
[12] P. Merkle, A. Smolic, K. M‥uller, and T. Wiegand, “Efficient prediction structures for multiview video coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 11, pp. 1461-1473, November 2007.
[13] H. Schwarz, D. Marpe, and T. Wiegand, “Hierarchical B pictures,” JVT-P014, Poznan, Poland, July 2005.
[14] ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, “Joint multiview video model (JMVM) 1.0” JVT-T209, Klagenfurt, Austria, July 2006.
[15] ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, “JMVC 1.0 software” JVT-AA212, Geneva, April 2008.
[16] L. Shen, Z. Liu, T. Yan, Z. Zhang, and P. An, “View-adaptive motion estimation and disparity estimation for low complexity multiview video coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 6, pp. 925–930, June 2010.
[17] P. He, M. Yu, Z. Peng, and G. Jiang, “Fast mode selection and disparity estimation for multiview video coding,” in Proceedings of 3rd IEEE International Symposium on Intelligent Information Technology Application Workshops, pp. 109-121, November 2009.
[18] W. Zhu, W. Jiang, and Y. Chen, “A fast inter mode decision for multiview video coding,” in Proceedings of IEEE International Conference on Information Engineering and Computer Science, pp. 1-4, December 2009.
[19] L. Shen, Z. Liu, S. Liu, Z. Zhang and P. An , “Selective disparity estimation and variable size motion estimation based on motion homogeneity for multi-view coding,” IEEE Transactions on Broadcasting, vol. 55, no.4, pp. 761-766, December 2009.
[20] J.-C. Chiang, L.-M. Liu, and W.-N. Lie, “A hierarchical two-stage neural-classifier for mode decision of H.264/AVC stereo video encoding,” in Proceedings of IEEE International Conference on 3DTV, pp. 317-320, May 2008.
[21] Z. Peng, M. Yu, and G. Jiang, “A fast encoding algorithm for multiview video coding, ” in Proceedings of IEEE International Symposium on Intelligent Information Technology Applications, vol. 1, pp. 497-500, November 2009.
[22] Y. Si, M. Yu, Z. Peng, and G. Jiang, “New multi-reference frame selection for multiview video coding,” in Proceedings of International Symposium on Intelligent Information Systems and Applications, pp. 39-42, November 2009.
[23] Z.-Y. Chen, J.-W. Syu, and P.-C. Chang, “Fast inter-layer motion estimation algorithm on spatial scalability in H.264/AVC scalable extension,” accepted by IEEE International Conference on Multimedia & Exposition, July 2010.
[24] D.-H. Han and Y.-L. Lee, “Fast mode decision using global disparity vector for multiview video Coding,” in Proceedings of IEEE International Conference on Future Generation Communication and Networking Symposia, December 2008.
[25] ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, “Common test conditions for multiview video coding”, JVT-T207, Klagenfurt, Austria, July, 2006.
[26] A. Vetro, M. McGuire, W. Matusik, A. Behrens, J. Lee, H. Pfister, and Mitsubishi Electric Research Labs(USA), “Multiview video test sequences from MERL,” ISO/IEC JTC1/SC29/WG11 m12077, Busan, Korea, April 2005.