| 研究生: |
林加典 Chia-Tien Lin |
|---|---|
| 論文名稱: |
H.264降解析度轉換編碼中模式決策與移動再估測之研究 Mode Decision and Motion Re-estimation for H.264/AVC Video Downscaling Transcoding |
| 指導教授: |
林銀議
Yinyi Lin |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 通訊工程學系 Department of Communication Engineering |
| 畢業學年度: | 98 |
| 語文別: | 中文 |
| 論文頁數: | 104 |
| 中文關鍵詞: | 零區塊決策 、離散餘弦轉換 、模式決策 、移動估測 、預測移動向量 、H.264/AVC 、視訊轉換編碼 、降解析度 |
| 外文關鍵詞: | zero-block decision, predictive motion vector, discrete cosine transform, video transcoding, H.264/AVC, downsizing(downscaling), motion estimation, mode decision |
| 相關次數: | 點閱:21 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
由於目前使用者網路頻寬不同的限制,要將廣播視訊傳送到不同解析度的用戶端設備是一項重要議題,而降解析度轉換編碼是一種將已編碼視訊串流的位元率或畫面大小經過轉換編碼後更能適應傳輸網路的限制與不同設備之解析度的技術,並可以有效提供這方面的應用。由於H.264中加入很多了高複雜度的預測和壓縮技術,使其可以提供比以往的視訊壓縮標準更佳的壓縮效能,但同時也因此讓H.264編碼端複雜度過高,所以將H.264的視訊串流解碼後再進完整編碼(被稱為複雜型串接式像素域轉換編碼)是相當耗時的,若要將之運用於即時之轉換編碼系統是有難度的。由於直接利用原始解碼端之預測資訊進行再編碼,雖然能使轉換在編碼時間大幅下降,但其編碼效能並不好。在本篇論文,即是針對H.264轉換編碼架構中,探討如何在降低解析度之應用下以最快的速度進行轉換編碼,所以我們提出了降解析度轉換編碼中快速區塊模式決策與移動再估測演算法,能在維持一定的畫面品質效能下,有效的加速轉換編碼時間。
Through the network limitation of client devices, it is a challenging matter to transmit broadcasting videos to different resolutions terminal devices. Down-scaling video transcoding is a technique for adapting the bit rate or frame size of the encoded video to suit better the constraints of transmission network and target devices, then it can offer effient solutions for this type of application. H.264 supports many advanced compression techniques that can achieve better coding performance than the previous standards. However, the compression techniques require high computational complexity, so if we fully decoding and re-encoding the video bitstream in H.264 process, referred to as complex cascaded pixel domain transcoding (CCPDT), is quite involved and time consuming, which would be difficult to achieve real time implementation.
Because of direct reusing information existing in the original incoming video stream can reduces computational complexity, but the coding efficiency is not good. In this thesis, we will explore how to perform H.264 downscaling transcoding as fast as possible, so we propose fast mode decision and motion re-estimation algorithms for downscaling transcoding. The experimental results show that our propose transcoding algorithms can maintain the coding performance and speed up transcoding process simultaneously.
【1】“Generic coding of moving pictures and associated audio information,”ISO/IEC 13818-2: Video (MPEG-2), May 1996.
【2】“Video coding for low bit rate communication, version 1,”ITU-T recommendation H.263, 1995.
【3】“Coding of audio-visual objects - Part 2: Visual,” in ISO/IEC 14496-2(MPEG-4 Visual Version 1), Apr. 1999.
【4】“Draft ITU-T recommendation and final draft international standard of joint video specification (ITU-T Rec. H.264 | ISO/IEC 144496-10 AVC),” Joint Video Team of ISO/IEC and ITU-T, March 2003.
【5】“Advanced video coding for generic audiovisual services”(ITU-T Rec. H.264| ISO/IEC 144496-10 AVC), Joint Video Team of ISO/IEC and ITU-T, March 2005.
【6】T. Wiegand, G. J. Sullivan, G. Bjontegaard, and A. Luthra, “Overview of the H.264/AVC video coding standard,”IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, pp. 560-576, July 2003.
【7】M. Ravas, M. Mattavelli and C. Clerc, “A computational complexity comparison of MPEG4 and JVT codecs,” Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG, Austria, 22-26 July, 2002.
【8】A. T. Connie, P. Nasiopoulos, V. C. M. Leung and Y. P. Fallah, “Video packetization techniques for enhancing H.264 video transmission over 3G networks,” Proceeding of IEEE Consumer Communications and Networking Conference, Jan. 2008, pp. 800-804.
【9】J. Youn, M.T. Sun and J. Xin, “Video transcoder architectures for bit rate scaling of H.263 bit streams,” Proceeding of the seventh ACM international conference on Multimedia, Nov. 1999, pp. 243-250.
【10】C. D. Wu and Y. Lin, “Efficient inter/intra mode decision for H.264/ AVC inter frame transcoding,” Proceeding of IEEE International Conference on Image Processing, Nov. 2009, pp. 3697-3700.
【11】M. J. Chen, M. C. Chu and C. W. Pan, “Efficient motion estimation algorithm for reduced frame rate video transcoder,” IEEE Transactions on Circuits and System for Video Technology, vol. 12, pp. 269-275, Apr. 2002.
【12】J. Xin, M. T. Sun, B. S. Choi and K. W. Chun, “An HDTV to SDTV Spatial transcoder,” IEEE Transactions on Circuits and System for Video Technology, vol. 12, no. 11, pp. 998-1008, Nov. 2002.
【13】G. Fernandez-Escribano, H. Kalva, P. Cuenca, L. Orozco-Barbosa and A. Garrido, “A fast MB mode decision algorithm for MPEG-2 to H.264 P-frame transcoding,” IEEE Transactions on Circuits and System for Video Technology, vol. 18, no. 2, pp. 172-185, Feb. 2008.
【14】A. Vectro, C. Christopoulos and H. Sun, “Video transcoding archi- tectures and techniques: An overview,” IEEE Signal Processing Magazine, vol. 20, pp. 18-29, March 2003.
【15】J. Xin, C. W. Lin and M. T. Sun, “Digital Video Transcoding,” Proceedings of the IEEE, vol. 93, no. 1, Jan. 2005, pp. 84-97.
【16】K. T. Fung and W. C. Siu , “Low Complexity H.263 to H.264 Video Transcoding Using Motion Vector Decomposition,” Proceeding of IEEE International Symposium on Circuits and Systems, vol. 2, May 2005, pp. 908-911.
【17】S. F. Chang and D. G. Messerschmitt, “Manipulation and Compositing of MC-DCT Compressed Video,” IEEE Journal on Selected Areas in Communications, vol. 13, no. 1, pp. 1–11, Jan. 1995.
【18】H. Shen, X. Sun, F. Wu, H. Li and S. Li, “A fast downsizing video transcoder for H.264/AVC with rate-distortion optimal mode decision,” Proceeding of IEEE International Conference on Multimedia and Expo, July 2006, pp.2017-2020.
【19】M. von dem Knesebeck and P. Nasiopoulos, “A fast mode decision algorithm for downscaled transcoding of H.264 preencoded video,” Proceeding of Digest of Technical Papers International Conference on Consumer Electronics, Jan. 2010, pp. 87-88.
【20】P. Zhang, Y. Lu, Q. Huang and W. Gao, “Mode mapping method for H.264/AVC spatial downscaling transcoding,” Proceeding of IEEE International Conference on Image Processing, vol. 4, Oct. 2004, pp. 2781-2784.
【21】C. T. Lin, W. C. Teng, and Y. Lin, “Efficient inter/intra mode decision for H.264/AVC video downscaling transcoding,” to be presented in PCM 2010, Shanghai, China, Sep. 2010.
【22】Y. Lee and Y. Lin, “Zero-block mode decision algorithm for H.264/ AVC,” IEEE Transactions on Image Processing, vol. 18, no. 3, pp. 524-533, Mar. 2009.
【23】K. T. Fung and W. C. Siu, “DCT-based video downscaling transcoder using split and merge technique,” IEEE Transactions on Image Processing, vol. 15, no. 2, pp. 394-403, Feb. 2006.
【24】J. Wang, E. H. Yang and X. Yu, “An efficient motion estimation method for H.264-based video transcoding with spatial resolution conversion,” Proceeding of IEEE International Conference on Multimedia and Exp, July 2007, pp. 444-447.
【25】 B. Shen, I. K. Sethi and B. Vasudev, “Adaptive motion-vector resamling for compressed video downscaling,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, no. 6, pp. 929-936, Sept. 1999.
【26】Y. P. Tan and H. Sun, “Fast motion re-estimation for arbitrary down- sizing video transcoding using H.264/AVC standard,” IEEE Transactions on Consumer Electronics, vol. 50, No. 3, pp. 887-894, Aug. 2004.
【27】 S. H. Yang, B. Y. Chen and K. H. Wang, “H.264 fast inter-mode selection based on coded block patterns,” IEICE Transactions on Information and Systems, vol. E92-D, no.6, pp. 1324-1327, June 2009.
【28】Z. Xuan, Y. Zhenghua and Y. Songyu, “Method for detecting all-zero DCT coefficients ahead of discrete cosine transformation and quantization,” Electronics Letters, vol. 34, no. 19, pp. 1839-1840, Sep. 1998.
【29】 W. Y. Chiu, Y. M. Lee and Y. Lin, “Efficient zero-block mode decision algorithm for high bit-rate coding in H.264/AVC,” Proceeding of IEEE International Symposium on Circuits and Systems, May 2010, pp. 4189-4192.
【30】I. Richardson, H.264 and MPEG-4 video compression, Wiley Press, Dec. 2003.
【31】K. C. Hou, M. J. Chen and C. T. Hsu, “Fast motion estimation by motion vector merging procedure for H.264,” Proceeding of IEEE International Conference on Multimedia & Expo, July 2005, pp. 1444-1447.
【32】K. L. Chung and L. C. Chang, “A new predictive search area approach for fast block motion estimation,” IEEE Transactions on Image Processing, vol. 12, no. 6, pp. 648-652, June 2003.
【33】Joint Video Team software JM12.2 http://iphome.hhi.de/suehring/tml/download/