| 研究生: |
林函萱 Han-hsuan Lin |
|---|---|
| 論文名稱: |
基於遮蔽及色彩資訊之適應性深度視訊編碼 Adaptive |
| 指導教授: |
唐之瑋
Chih-wei Tang |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 通訊工程學系 Department of Communication Engineering |
| 論文出版年: | 2014 |
| 畢業學年度: | 102 |
| 語文別: | 中文 |
| 論文頁數: | 78 |
| 中文關鍵詞: | 3D high-efficiency video coding (3D-HEVC) 、multi-view video coding (MVC) 、合成視角 、遮蔽 、垂直紋理複雜度 、rate-distortion optimization (RDO) |
| 外文關鍵詞: | 3D high-efficiency video coding (3D-HEVC), multi-view video coding (MVC), synthesized view, occlusion, vertical texture complexity, rate-distortion optimization |
| 相關次數: | 點閱:14 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
多視角視訊轉播漸成為趨勢,而自由視角雖可提供多視角供使用者選擇觀賞,但合成視角畫面品質仍有改進空間,此可由3D視訊編碼端提升合成畫面品質。因此,本論文提出針對深度視訊編碼,提升合成視角畫面中高複雜度垂直紋理及遮蔽之部分區域品質,其設計共包含兩個部分。從先前的研究成果得知,合成視角的失真除了受深度圖的品質影響,也和色彩視訊的垂直紋理複雜度有關聯性,因此第一部分參考色彩視訊區塊之垂直紋理複雜度,判斷深度視訊之目前區塊是否為易造成合成視角影像失真區域,以決定適用的深度視訊編碼模式決策之RD cost,此部分採用多視角視訊編碼架構MVC (multi-view video coding)。由於立體視覺影像中,遮蔽區域的深度資訊不確定性會影響合成視角畫面品質,因此在第二部分,本論文利用left right check (LRC)找出左右視角所對應的遮蔽圖,再結合由色彩視訊的重建畫面垂直紋理複雜度資訊,將區塊分成不同等級,對rate-distortion optimization (RDO)之RD cost採取不同的Lagrange參數調整,此部分採用3D-HEVC (3D high-efficiency video coding)編碼架構。由修改3D-HEVC之參考軟體HTM6.0的深度視訊編碼器實驗結果顯示,本論文所使用的演算法在第一部份實驗BDBR平均上升0.25%,而BDPSNR平均下降0.01dB,第二部份的實驗結果顯示平均BDBR少0.22%,合成視角的平均BDPSNR下降約0.01dB,而遮蔽區域且垂直紋理複雜度高之區域其合成視角畫面品質提升。
The tendency of TV service is broadcasting with multi-view videos. Although free-view TV provides optional views to users, there is still the room to improve the quality of color videos synthesized from 3D reconstructed videos. Thus, this thesis proposes adaptive depth video coding to improve the quality of regions which are occluded in the stereopsis and have highly vertical texture in the reconstruct color video. The research is composed of two parts. From the previous research result, we know distortions in synthesized views depend on not only coding distortions in the depth video but also the vertical textures in the reconstructed color video. Consequently, the first part consults the vertical complexity of textures in the reconstructed color video to detect regions with more distortions in the synthesized view. The appropriate RD cost function for depth video coding is selected. The implementation is based on multi-view video coding (MVC). Since the uncertainty of occluded regions in the stereoscopy reduces the quality of synthesized view, this thesis applies left-right check (LRC) based occlusion detection. The information of the occluded map and the vertical texture complexity map is fused to select proper Lagrange parameter in the RD cost function for each largest coding unit (LCU) in the left and right views. The implementation is based on 3D high-efficiency video coding. Our experimental results show that the first part of the proposed scheme has 0.25% BDBR increment and only loss 0.01dB BDPSNR in virtual views compared with the original JMVC 6.0.3. In the experiment of the second part of the proposed scheme, the result shows that the average BDBR decrease is 0.22% and loss of BDPSNR is 0.01dB, with better synthesized viewing quality in the blocks with high vertical complexity and occlusion.
[1] Y.-S. Ho and K.-J. Oh, “Overview of multi-view video coding,” in Proceedings of IEEE International Workshop on Systems, Signals and Image Processing, pp. 5-12, June 2007.
[2] K. Muller, H. Schwarz, D. Marpe, C. Bartnik, S. Bosse , H. Brust, T. Hinz, H. Lakshman, P. Merkle, F.H. Rhee, G. Tech, M. Winken, and T. Wiegand, “ 3D high-efficiency video coding for multi-view video and depth data,” IEEE Transactions on Image Processing, Vol. 22, No. 9, pp. 3366-3378, Sept. 2013.
[3] K.-J. Oh, A. Vetro, and Y.-S. Ho, “Depth coding using a boundary reconstruction filter for 3-D video system,” IEEE Transactions on Circuits and Systems for Video Technology, Vol. 21, No. 3, pp. 350-359, Mar. 2011.
[4] V. A. Nguyen, M. Dongbo, and M.N. Do, “ Efficient techniques for depth video compression using weighted mode filtering,” IEEE Transactions on Circuits and Systems for Video Technology, Vol. 23, No. 2, pp. 189-202, Feb. 2013.
[5] B. T. Oh, J. Lee, and D.-S. Park, “ Depth map coding based on synthesized view distortion function,” IEEE Journal of Selected Topics in Signal Processing, Vol. 5, No. 7, pp. 1344-1352, Nov. 2011.
[6] 呂志宏,以高品質合成視角為導向之快速深度視訊編碼模式決策,國立中央大學通訊工程學系,碩士論文,民國101年6月。
[7] G. Egnal and R.P. Wildes, “Detecting binocular half-occlusions: empirical comparisons of five approaches,” IEEE Transections on Pattern Analysis and Matching Intelligence, Vol. 25, No. 8, pp. 1127-pp.1133, Nov. 2002.
[8] G.-J. Sullivan, J. Ohm, W.-J. Han, and T. Wiegand,“Overview of the high efficiency video coding (HEVC) standard,” IEEE Transactions on Circuits and Systems for Video Technology, Vol. 22, No. 12, pp. 1649 – 1668, Dec. 2012.
[9] H. Schwarz, D. Marpe, and T. Wiegand, “Analysis of hierarchical B pictures and MCTF,” in Proceedings of IEEE International Conference on Multimedia and Expo, pp. 1929-1932, July 2006.
[10] T. Wiegand, G. Sullivan, G. Bjøntegaard, and A. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Transactions on Circuits and Systems for Video Technology, Vol. 13, No. 7, pp. 560-576, July 2003.
[11] X. Shen and Y. Lu, “CU splitting early termination based on weighted SVM,” EURASIP Journal on Image and Video Processing, Vol. 2013, No. 1,pp. 1-11, Dec. 2013.
[12] ISO/IEC JTC1/SC29/WG11, “High Efficiency Video Coding (HEVC) Test Model 10 (HM10) Encoder Description,” Doc JCTVC-L1002_v3, Geneva, Jan. 2013.
[13] ISO/IEC JTC1/SC29/WG11, “3D-HEVC Test Model 3,” Doc JCT3V-C1005_d0, Geneva, Jan. 2013.
[14] K. Muller, P. Merkle, G. Tech ,and T. Wiegand, “3D video coding with depth modeling modes and view synthesis optimization,” in Proceedings of Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 1-4, Hollywood, Canada, Dec. 2012.
[15] W. Hu, O. C. Au, L. Sun, W. Sun, L. Xu, and Y. Li, “Adaptive depth map filter for blocking artifact removal and edge preserving,” in Proceedings of IEEE International Symposium on Circuits and Systems, pp. 369-372, Seoul, May 2012.
[16] D. De Silva, W. Fernando, H. Kodikaraarachchi, S. Worrall, and A. Kondoz, “A depth map post-processing framework for 3D-TV systems based on compression artifact analysis,” IEEE Journal of Selected Topics in Signal Processing, No. 99, Aug. 2011.
[17] V. A. Nguyen, D. Min, and M.N. Do, “ Efficient techniques for depth video compression using weighted mode filtering,” IEEE Transactions on Circuits System for Video Technology, Vol. 23, No. 2, pp. 189-202, Feb. 2013.
[18] C. Lee and Y.-S. Ho, “ View synthesis using depth map for 3D video,” in Proceedings of Asia-Pacific Signal and Information Processing Association on Annual Summit and Conference, pp. 350-357, Oct. 2009.
[19] G. Cernigliaro, M. Naccari, F. Jaureguizar, J. Cabrera, and N. Garcia, “Depth perceptual video coding for free viewpoint video based on H.264/AVC,” in Proceedings of Picture Coding Symposium (PCS), pp. 153-156, Krakow, May 2012.
[20] S. Ince and J. Konrad, “Geometry-based estimation of occlusions from video frame pairs,” IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp.933-936, March 2005.
[21] C.-J. Tsai, C.-W. Tang, C.-H. Chen, and Y.-H. Yu, “Adaptive rate-distortion optimization using perceptual hint,” IEEE International Conference on Multimedia and Expo, Vol. 1, pp. 667-670, Taipei, June 2004.
[22] ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, “JMVC 1.0 software,” JVT-AA212, Geneva, April 2008.
[23] ISO/IEC MPEG & ITU-T VCEG, “Multi-view Video plus Depth (MVD) Format for Advanced 3D Video System,” Doc. JVT-W-100, April 2007.
[24] ISO/IEC JTC1/SC29/WG11, “View synthesis algorithm in view synthesis reference software 2.0(VSRS2.0),” Doc. M16090, Feb. 2009.
[25] ISO/IEC JTC1/SC29/WG11, “Common Test Conditions of 3DV Core Experiments,” Doc. JCT3V-C1100, Geneva, Jan. 2013.
[26] Z. Peng, M. Yu, G. Jiang, Y. Si, and F. Chen, “Virtual view synthesis oriented fast depth video encoding algorithm,” in Proceedings of IEEE International Conference on Industrial and Information System, Vol. 1, pp. 204-207, Dalian, July 2010.
[27] G. Bjontegaard, “Calculation of average PSNR difference between RD-curves,” ITU-T Q6/SG16, Doc. VCEG-M33, April 2001.
[28] ISO/IEC JTC1/SC29/WG11, “JCT-3V AHG Report: MV-HEVC and 3D-HEVC Software Integration (AHG5),” Doc. JCT3V-H0005, Valencia, March 2014.