| 研究生: |
吳靖宇 Ching-Yu Wu |
|---|---|
| 論文名稱: |
基於內容適應式模型之H.264/AVC快速演算法與位元率控制 Model-Based Fast Algorithm and Rate Control for H.264/AVC |
| 指導教授: |
蘇柏齊
Po-Chyi Su |
| 口試委員: | |
| 學位類別: |
博士 Doctor |
| 系所名稱: |
資訊電機學院 - 資訊工程學系 Department of Computer Science & Information Engineering |
| 論文出版年: | 2014 |
| 畢業學年度: | 102 |
| 語文別: | 英文 |
| 論文頁數: | 129 |
| 中文關鍵詞: | H.264/AVC 、畫面內預測 、位元率控制 、品質控制 、失真-量化 、位元率-量化 、感興趣區域 |
| 外文關鍵詞: | H.264/AVC, Intra Prediction, Rate Control, Quality Control, Distortion-Quantization, Rate-Quantization, Region of Interest |
| 相關次數: | 點閱:22 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
H.264/AVC影片編碼已經成為目前最常被使用的編碼標準,許多研究也因此致力於追求更高效率的影片編碼,以及維持良好的位元率-失真表現。在本篇論文中,我們研發了許多內容適應性的模型來加快H.264/AVC的編碼速度,以及達到更好的位元率和品質控制。
本論文由三個部分組成。首先,我們提出了一個畫面內預測的快速演算法。區塊邊緣的重建像素以及區塊內的內容,會先經由投影的方式產生兩組向量,再進一步計算出預測區塊冗餘。此一特徵值能夠有效地先行刪除一些較可能被位元-失真最佳化所刪除的預測模式,來提升編碼速度。根據預測區塊冗餘以及其他在編碼過程中擷取的資訊,本研究提出了更進一步的跳躍方法來跳過某些編碼模式和區塊種類,使得畫面內編碼能進一步加速。
再者,增加了畫面內編碼的效率之後,我們探討畫面內編碼的位元-量化關係。如何適當地分配量化參數給I-畫面,對於影片編碼來說相當重要。我們提出了一個內容適應性的位元-量化模型,來預測I-畫面的的位元使用量。藉由分析量化參數以及區塊複雜度之間的關係,在目標位元率決定之後,決定一個適合的量化參數來進行編碼。由於提出的模型是建構在巨區塊層,感興趣區域可以藉此使用較多的位元、以及較低的量化參數來編碼,進而達到提升人眼的視覺品質。
最後,我們藉由估測失真與量化參數間的關係,進一步探討位元率控制以及品質控管問題。我們提出了一個內容適應性的失真-量化模型,來預測畫面或區塊的失真程度。和之前的位元-量化模型類似,該模型只有一個可使用巨區塊內容調整的參數,並且能在每張畫面被編碼之前,就先行預測該畫面的失真程度。在由訊噪比所定義的畫面品質被設定之後,該模型能夠幫助計算出適合的量化參數。藉由此模型,我們進一步探討兩個恆定畫質的影片編碼應用,希望能幫助相關專業應用,例如影片的儲存以及編輯,達到更好的品質與效果。
H.264/AVC has become the most frequently used video codec nowadays. A lot of efforts have been made to pursue highly efficient video coding and to maintain good rate-distortion performances of video compression. In this dissertation, several content-adaptive models are developed to increase the speed of video encoding and to achieve better rate/quality control in H.264/AVC. The dissertation consists of three major parts. First, an efficient intra-prediction mode decision mechanism is presented. A projection-based approach, which utilizes the reconstructed surrounding pixels and block content to compute the predicted block residuals (PBR), can effectively eliminate the less probable modes from the computation of Rate Distortion Optimization. According to the PBR and coding information acquired during the encoding process, some prediction modes and macroblock types can be further skipped to accelerate the intra coding. Then, after considering the efficiency of intra coding, we research the issue of Rate-Quantization (R-Q) in the intra coding of H.264/AVC. Assigning an appropriate Quantization Parameter (QP) to the intra-coded frames is very important to the video coding. A content-adaptive R-Q model is thus presented to predict the bit usage of intra-coded frames. The relationship between the QP of a macroblock and the block complexity is derived so that a suitable QP can be determined under a target bit-rate. Since the proposed model is built on macroblocks, Region of Interest (ROI) coding can also be achieved. By adjusting the QP value at the macroblock level, more bits can be assigned to the ROI to better preserve its perceptual quality. Finally, we tackle the problem of rate/quality control for regular video encoding by estimating the resultant quality or distortion associated with QP. A Distortion-Quantization (D-Q) model is proposed to predict the distortion level, which is defined as the difference between the original video frame and the decoded one in the sum of squared errors. As in the R-Q model, the proposed D-Q model also has only one adjustable parameter related to the macroblock content and provides a mapping between QP and the corresponding distortion before the exact encoding process. Given a targeted frame quality measured in peak signal to noise ratio (PSNR), this model helps to assign a suitable QP value to each frame. Two applications are then considered, i.e., the single-pass constant frame PSNR coding and the two-pass coding with the additional bitrate or storage constraint, both of which can facilitate such applications of video archiving and editing.
[1] T. Wiegand, G. Sullivan, G. Bjntegaard, and A. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, no. 7, pp. 560–576, Sep. 2003.
[2] A. Kondoz, Visual Media Coding and Transmission. Wiley Publishing, 2010.
[3] C.-Y. Wu and P.-C. Su, “Fast intra-coding for H.264/AVC by using projection-based predicted block residuals,” IEEE Transactions on Multimedia, vol. 15, no. 5, pp. 1083 – 1093, Aug. 2013.
[4] C.-Y. Wu, P.-C. Su, and H.-C. Hsu, “A joint content adaptive rate-quantization model and region of interest intra coding of H.264/AVC,” in IEEE International Conference on Multimedia and Expo (ICME 2014), July 2014.
[5] C.-Y.Wu and P.-C. Su, “A content-adaptive distortion-quantization model for H.264/AVC and its applications,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 24, no. 1, pp. 113 – 126, Jan. 2014.
[6] ——, “A content-adaptive distortion-quantization model for intra coding in H.264/AVC,” in The 20th IEEE international Conference on Computer Communication Networks (ICCCN 2011), July 2011, pp. 1 – 6.
[7] C.-Y. Wu, P.-C. Su, L.-W. Huang, and C.-Y. Chiou, “Constant frame quality control for H.264/AVC,” APSIPA Transactions on Signal and Information Processing, vol. 2.
[8] F. Pan, X. Lin, S. Rahardja, K. P. Lim, Z. G. Li, D. Wu, and S. Wu, “Fast mode decision algorithm for intraprediction in H.264/AVC video coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 15, no. 7, pp. 813–822, July 2005.
[9] J.-C. Wang, J.-F. Wang, J.-F. Yang, and J.-T. Chen, “A fast mode decision algorithm and its vlsi design for H.264/AVC intra-prediction,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 10, pp. 1414–1422, Oct. 2007.
[10] K. Bharanitharan, B.-D. Liu, J.-F. Yang, and W.-C. Tsai, “A low complexity detection of discrete cross differences for fast H.264/AVC intra prediction,” IEEE Transactions on Multimedia, vol. 10, no. 7, pp. 1250–1260, November 2008.
[11] A.-C. Tsai, A. Paul, J.-C. Wang, and J.-F. Wang, “Intensity gradient technique for efficient intra-prediction in H.264/AVC,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 18, no. 5, pp. 694–698, May 2008.
[12] Z. Wei, K. N. Ngan, and H. Li, “An efficient intra-mode selection algorithm for H.264 based on edge classification and rate-distortion estimation,” Signal Processing: Image Communication, vol. 23, no. 9, pp. 699 – 710, Oct. 2008.
[13] S. Milani, “Fast H.264/AVC FRExt intra coding using belief propagation,” IEEE Transactions on Image Processing, vol. 20, no. 1, pp. 121–131, Jan. 2011.
[14] Y.-H. Huang, T.-S. Ou, and H. H. Chen, “Fast decision of block size, prediction mode, and intra block for H.264 intra prediction,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 8, pp. 1122–1132, August 2010.
[15] H. Zeng, K.-K. Ma, and C. Cai, “Hierarchical intra mode decision for H.264/AVC,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 6, pp. 907–912, June 2010.
[16] Y.-M. Lee, Y.-T. Sun, and Y. Lin, “SATD-based intra mode decision for H.264/AVC video coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 3, pp. 463–469, March 2010.
[17] C.-H. Tseng, H.-M. Wang, and J.-F. Yang, “Enhanced intra-44 mode decision for H.264/AVC coders,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no. 8, pp. 1027–1032, August 2006.
[18] G. J. Sullivan. and T. Wiegand, “Rate-distortion optimization for video compression,” IEEE Signal Processing Magazine, vol. 15, no. 6, pp. 74–90, Nov 1998.
[19] Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG, “H.264/AVC reference software Joint Model 15.1.”
[20] G. J. Sullivan, P. Topiwala, and A. Luthra, “The H.264/AVC advanced video coding standard: Overview and introduction to the fidelity range extensions,” in SPIE Conference on Applications of Digital Image Processing XXVII, August 2004.
[21] “Corel stock photo library 3,” Corel Corporation, Ontario, Canada.
[22] A. C. Cameron and F. A. G.Windmeijer, “An R-squared measure of goodness of fit for some common nonlinear regression models,” Journal of Econometrics, vol. 77, no. 2, pp. 329–342, April 1997.
[23] T.-K. Tan, G. Sullivan, and T. Wedi, “Recommended simulation common conditions for coding efficiency experiments: Revision 1,” in ITU-T Video Coding Experts Group (VCEG) Meeting, document VCEG-AE010r1, Jan. 2007.
[24] G. Bjontegaard, “Calculation of average PSNR differences between RDcurves,” in ITU-T VCEG Meeting, document VCEG-M33, March 2001.
[25] G. J. Sullivan, H. Yu, S.-I. Sekiguchi, H. Sun, T. Wedi, S. Wittmann, Y.-L. Lee, A. Segall, and T. Suzuki, “New standardized extensions of MPEG4-AVC/H.264 for professional-quality video applications,” in IEEE International Conference on Image Processing, vol. 1, 2007, pp. I–13 – I–16.
[26] Panasonic Broadcast, “AVC-Intra (H.264 Intra) Compression,” ftp://ftp.panasonic.com/pub/Panasonic/Drivers/PBTS/papers/WP_AVC-Intra.pdf, Sep. 2007, [Online; accessed 15-February-2014].
[27] Y. G. Lee and B. C. Song, “An intra-frame rate control algorithm for ultralow delay H.264/Advanced Video Coding (AVC),” IEEE Transactions on Circuits and Systems for Video Technology, vol. 19, no. 5, pp. 747–752, May 2009.
[28] J. Li and E. Abdel-Raheem, “Efficient rate control for H.264/AVC intra frame,” IEEE Transactions on Consumer Electronics, vol. 56, no. 2, pp. 1043–1048, May 2010.
[29] Y. Chang and M. Kim, “Hybrid codec-based intra-frame joint rate control for stereoscopic video,” IEEE Signal Processing Letters, vol. 18, no. 10, pp. 543–546, 2011.
[30] L. Tiana, Y. Zhoua, and Y. Sun, “Novel rate control scheme for intra frame video coding with exponential rate-distortion model on H.264/AVC,” Journal of Visual Communication and Image Representation, vol. 23, no. 6, pp.
873–882, Aug. 2012.
[31] P.-C. Su and C.-Y. Wu, “A joint watermarking and ROI coding scheme for annotating traffic surveillance videos,” EURASIP Journal on Advances in Signal Processing, vol. 2010, no. 1, p. 658328, 2010.
[32] W.-J. Tsai and T.-L. Chou, “Scene change aware intra-frame rate control for H.264/AVC,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 12, pp. 1882 – 1886, Dec. 2010.
[33] X. Jing, L.-P. Chau, and W.-C. Siu, “Frame complexity-based rate-quantization model for H.264/AVC intraframe rate control,” IEEE Signal Processing Letters, vol. 15, pp. 373–376, 2008.
[34] L. Itti, C. Koch, and E. Niebur, “A model of saliency-based visual attention for rapid scene analysis,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 20, no. 11, pp. 1254–1259, Nov. 1998.
[35] Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG, “H.264/AVC reference software Joint Model 17.1.”
[36] Z. Chen and K. N. Ngan, “Recent advances in rate control for video coding,” Signal Processing: Image Communication, vol. 22, no. 1, pp. 19–38, Jan. 2007.
[37] N. Cherniavsky, G. Shavit, M. F. Ringenburg, R. E. Ladner, and E. A. Riskin, “Multistage: A MINMAX bit allocation algorithm for video coders,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 1, pp. 59–67, Jan. 2007.
[38] M. Tagliasacchi, G. Valenzise, and S. Tubaro, “Minimum variance optimal rate allocation for multiplexed H.264/AVC bitstreams,” IEEE Transactions on Image Processing, vol. 17, no. 7, pp. 1129–1143, July 2008.
[39] A. Ninassi, O. L. Meur, and P. L. C. D. Barba, “Considering temporal variations of spatial visual distortions in video quality assessment,” IEEE Journal of Selected Topics in Signal Processing, vol. 3, no. 2, pp. 253–265, April 2009.
[40] H.Wu and K. Rao, Eds., Digital Video Image Quality and Perceptual Coding. CRC Press, 2006.
[41] A. B. Watson, Ed., Digital Images and Human Vision. MIT Press, 1993.
[42] M. Kunt and C. van den Branden Lambrecht, Eds., Special Issue on Image and Video Quality Metrics, Signal Processing, vol. 70, no. 3, Nov. 1998.
[43] G.-M. Muntean, G. Ghinea, P. Frossard, M. Etoh, F. Speranza, and H. R. Wu, Eds., Special Issue: Quality Issues on Mobile Multimedia Broadcasting, IEEE Transactions on Broadcasting, vol. 54, no. 3, part II, Sep. 2008.
[44] L. J. Karam, T. Ebrahimi, S. S. Hemami, T. N. Pappas, R. J. Safranek, Z. Wang, and A. B. Watson, Eds., Special Issue on Visual Media Quality Assessment, IEEE Journal of Selected Topics in Signal Processing, vol. 3, no. 2, April 2009.
[45] W. Lin, T. Ebrahimi, P. C. Loizou, S. Moller, and A. R. Reibman, Eds., Special Issue on New Subjective and Objective Methodologies for Audio and Visual Signal Processing, IEEE Journal of Selected Topics in Signal Processing, vol. 6, no. 6, Oct. 2012.
[46] Q. Huynh-Thu and M. Ghanbari, “The accuracy of PSNR in predicting video quality for different video scenes and frame rates,” Telecommunication Systems, vol. 49, no. 1, pp. 35–48, Jan. 2012.
[47] S. Ma, W. Gao, and Y. Lu, “Rate-distortion analysis for H.264/AVC video coding and its application to rate control,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 15, no. 12, pp. 1533–1544, Dec. 2005.
[48] H. Wang and S. Kwong, “Rate-distortion optimization of rate control for H.264 with adaptive initial quantization parameter determination,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 18, no. 1, pp. 140–144, Jan. 2008.
[49] B. Yan and M. Wang, “Adaptive distortion-based intra-rate estimation for H.264/AVC rate control,” IEEE Signal Processing Letters, vol. 16, no. 3, pp. 145–148, Mar. 2009.
[50] N. Kamaci, Y. Altunbasak, and R. M. Mersereau, “Frame bit allocation for the H.264/AVC video coder via Cauchy-density-based rate and distortion models,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 15, no. 8, pp. 994–1006, August 2005.
[51] L. Guo, O. C. Au, M. Ma, Z. Liang, and P. H. W. Wong, “A novel analytic quantization-distortion model for hybrid video coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 19, no. 5, pp. 627–641, May 2009.
[52] H.-M. Hu, B. Li, W. Lin, W. Li, and M.-T. Sun, “Region-based rate control for H.264/AVC for low bit-rate applications,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, no. 11, pp. 1564–1576, Nov. 2012.
[53] Y. H. Tan, C. Yeo, and Z. Li, “Single-pass rate control with texture and nontexture rate-distortion models,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, no. 8, pp. 1236–1245, Aug. 2012.
[54] F. D. Vito and J. C. D. Martin, “PSNR control for GOP-level constant quality in H.264 video coding,” in Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, Dec. 2005,
pp. 612 –617.
[55] T. Ozcelebi, F. D. Vito, A. M. Tekalp, M. R. Civanlar, M. O. Sunay, and J. C. D. Martin, “An analysis of constant bitrate and constant PSNR video encoding for wireless networks,” in IEEE International Conference on Communications, vol. 11, June 2006, pp. 5301 –5306.
[56] K.-L. Huang and H.-M. Hang, “Consistent picture quality control strategy for dependent video coding,” IEEE Transactions on Image Processing, vol. 18, no. 5, pp. 1004–1014, May 2009.
[57] D. Zhang, K. N. Ngan, and Z. Chen, “A two-pass rate control algorithm for H.264/AVC high definition video coding,” Signal Processing:Image Communication, vol. 24, no. 5, pp. 357–367, May 2009.
[58] B. Han and B. Zhou, “VBR rate control for perceptually consistent video quality,” IEEE Transactions on Consumer Electronics, vol. 54, no. 4, pp. 1912–1919, Nov. 2008.
[59] J. Huang, J. Sun, and W. Gao, “A novel two-pass VBR coding algorithm for the H.264/AVC video coder based on a new analytical R-D model,” in Proceedings of Picture Coding Symposium, 2007.
[60] J. Sun, Y. Duan, J. Li, J. Liu, and Z. Guo, “Rate-distortion analysis of deadzone plus uniform threshold scalar quantization and its application-part II: Two-pass VBR coding for H.264/AVC,” IEEE Transactions on Image Processing, vol. 22, no. 1, pp. 215–228, Jan. 2013.
[61] X. M. Zhang, A. Vetro, Y. Q. Shi, and H. Sun, “Constant quality constrained rate allocation for FGS-coded video,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, no. 2, pp. 121–130, Feb. 2003.
[62] M. Dai, D. Loguinov, and H. M. Radha, “Rate-distortion analysis and quality control in scalable internet streaming,” IEEE Transactions on Multimedia, vol. 8, no. 6, pp. 1135–1146, Dec. 2006.
[63] C.-W. Seo, J.-K. Han, and T. Q. Nguyen, “Rate control scheme for consistent video quality in scalable video codec,” IEEE Transactions on Image Processing, vol. 20, no. 8, pp. 2166–2176, Aug. 2011.
[64] N. Kamaci and G. Al-Regib, “Impact of video parameters on the DCT coefficient distribution for H.264-like video coders,” in SPIE Procceding, Visual Information Processing and Communication III, vol. 8305.
[65] D.-K. Kwon, M.-Y. Shen, and C.-C. J. Kuo, “Rate control for H.264 video with enhanced rate and distortion models,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 5, pp. 517–529, May 2007.
[66] K. Sayood, Introduction to Data Compression. San Francisco, CA: Morgan Kaufmann Publishers, 2006.
[67] J. Lee, I. Shin, and H. Park, “Adaptive intra-frame assignment and bit-rate estimation for variable GOP length in H.264,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no. 10, pp. 1271–1279, Oct. 2006.
[68] S.-C. Hsia and S.-H. Wang, “Adaptive video coding control for real-time H.264/AVC encoder,” Journal of Visual Communication and Image Representation, vol. 20, no. 7, pp. 463–477, Oct. 2009.
[69] K. Chang, B. Yang, and J. Luo, “A novel rate control scheme for H.264/AVC based on scene change,” in IEEE/ACIS International Conference on Computer and Information Science, Los Alamitos, CA, USA, 2009, pp. 1085–1089.
[70] A. E. Mohr, “Bit allocation in sub-linear time and the multiple-choice knapsack problem,” in Proceedings of Data Compression Conference, 2002, pp. 352–361.
[71] A. Ortega, “Variable bit-rate video coding,” in Compressed Video over Networks, M.-T. Sun and A. Reibman, Eds. Marcel Dekker, New York, 2000, pp. 343–382.
[72] T.-K. Tan, G. Sullivan, and T. Wedi, “Recommended simulation common conditions for coding efficiency experiments,” in ITU-T Video Coding Experts Group (VCEG) Meeting, document VCEG-AA010, Oct. 2005.
[73] G. J. Sullivan, J.-R. Ohm, W.-J. Han, and T. Wiegand, “Overview of the high efficiency video coding (HEVC) standard,” Circuits and Systems for Video Technology, IEEE Transactions on, vol. 22, no. 12, pp. 1649–1668, Dec 2012.
[74] T.-J. Liu, Y.-C. Lin, W. Lin, and C.-C. J. Kuo, “Visual quality assessment: recent developments, coding applications and future trends,” APSIPA Transactions on Signal and Information Processing, vol. 2.