| 研究生: |
李昱潔 Yu-chieh Li |
|---|---|
| 論文名稱: |
基於平穩主觀視覺之H.264時間可調性視訊編碼位元率分配機制 H.264/SVC Rate Allocation based on Graceful Degradation of Subjective Quality in Frame Rate Switching |
| 指導教授: |
張寶基
Pao-chi Chang |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 通訊工程學系 Department of Communication Engineering |
| 畢業學年度: | 98 |
| 語文別: | 中文 |
| 論文頁數: | 108 |
| 中文關鍵詞: | 主觀視覺 、時間可調性 、位元率分配 |
| 外文關鍵詞: | subjective quality, temporal scalability, rate allocation |
| 相關次數: | 點閱:21 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
H.264可調式視訊編碼是以 H.264/AVC 為基底來進行可調式視訊編碼,且提供三種可調適架構包含時間、空間與品質可調性,使系統不需經過重新轉碼即可因應不同使用者的軟/硬體及網路環境條件。其中時間層可調性架構可同時提供不同畫面播放率之視訊資料供使用者切換。傳統上使用H.264/SVC時間層上量化參數的建議設定,在切換畫面率的時後會造成不小的視覺品質差距,因此在實際的應用上,如何有效率的分配位元率給不同畫面播放率的時間層以達到縮減連續兩層之間的主觀視覺品質,是十分重要的議題。
本論文提出一套在網路資源有限的情況下,考量使用者主觀視覺感受的H.264可調式視訊編碼時間層位元率分配機制,本論文所提出的方法可以讓視訊在頻寬變動而需切換畫面播放率的情況下,降低不同畫面播放率主觀視訊品質之差質。實驗結果顯示,所提方法可有效率的分配位元率使各畫面播放率時間層達到較為接近的主觀視覺品質,且對於不同視訊在不同的頻寬限制下均表現良好。與傳統建議的量化參數設定相比,最多可使原本主觀視覺差異從4.03dB降至2.8dB。
H.264 scalable extension (SVC), which is constructed based on H.264/AVC, is a new scalable video coding standard. H.264/SVC incorporates temporal, spatial, and SNR scalability so that it has not only high compression efficiency but also the capability to be adapted to heterogeneous user/network environments. Temporal scalability that can support multiple display frame rates is an efficient tool to work with a wide range of bitrates. With the JVT (Joint Video Team) recommended QP setting for H.264/SVC temporal layers, a large subjective quality gap between different layers occurs in frame rate switching. Thus how to reduce the difference of subjective quality by efficient rate allocation among multiple temporal layers for a given total bitrate is an important issue.
This work proposes a rate allocation method for SVC temporal scalability based on perceptual quality metric. The proposed method gracefully lowers video quality in frame rate switching under the circumstance of bandwidth fluctuation. In simulations, several video sequences with various total rate constraints are experimented. The proposed method can efficiently allocate the rate for each temporal layer with closer subjective video quality when the bandwidth is insufficient. Compared with the JVT recommended method, the difference of subjective quality in frame rate switching is reduced from 4.03dB to 2.8dB.
[1] Advanced Video Coding for Generic Audiovisual Services, ITU-T Rec. H.264 and ISO/IEC 14496-10 (MPEG-4 AVC), ITU-T and ISO/IEC JTC 1, Version 1: May 2003, Version 2: May 2004, Version 3: Mar. 2005, Version 4: Sept. 2005, Version 5 and Version 6: June 2006, Version 7: Apr. 2007, Version 8 (including SVC extension): Consented in July 2007.
[2] X. Li, E.Q. Li and Y.K. Chen, “Fast Multi-frame motion Estima-tion Algorithm with Adaptive Search Strategies in H.264,” IEEE International Conference Acoustics, Speech, and Signal Processing (ICASSP ''04), VOL. 3, May 2004, pp. iii- 369-72.
[3] H. Schwarz, D. Marpe, and T. Wiegand, “Overview of the scalable video coding extension of the H.264/AVC Standard,” IEEE Transactions on Circuits and Systems for Video Technology, VOL. 17, NO. 9, pp. 1103–1120, Sep.2007.
[4] A. Segall and G. J. Sullivan, “Spatial Scalability Within the H.264/AVC Scalable Video Coding Extension,” IEEE Transactions on Circuits and Systems for Video Technology, VOL. 17, NO. 9, pp. 1121–1135, Sep. 2007.
[5] T.Wiegand, G. J. Sullivan, J. Reichel, H. Schwarz, and M.Wien, Joint Draft 11 of SVC Amendment, Joint Video Team, Doc. JVT-X201, Jul.2007.
[6] H. Schwarz, D. Marpe, and T. Wiegand, “Comparison of MCTF and Closed-loop Hierarchical-B Pictures,” JVT-P059, July, 2005.
[7] M. Wien, H. Schwarz, and T. Olbaum, “Performance Analysis of SVC,” IEEE Transactions on Circuits and Systems for Video Tech-nology, pp.1194~1203, VOL.17, NO.9, Sep. 2007.
[8] H. Schwarz, D. Marpe, and T. Wiegand, “Hierarchical B Pictures,” Poznan, Poland, Doc. JVT-P014, July 2005.
[9] R. Feghali, F. Speranza, D. Wang, and A. Vincent, “Video quality metric for bit rate control via joint adjustment of quantization and frame rate,” IEEE Trans. on Broadcasting, VOL. 53, NO. 1, pp. 441–446,Mar. 2007.
[10] ITU-R BT.500 “Methodology for the Subjective Assessment of the Quality for Television Pictures”, ITU-R Std., Rev. 11, June 2002.
[11] ITU-T Rec. P.910, “Subjective video quality assessment methods for multimedia applications, ” Sept. 1999.
[12] D. Wang, F. Speranza, A. Vincent, T. Martin, and P. Blanchfield, “Towards optimal rate control: a study of the impact of spatial resolution, frame rate, and quantization on subjective Video quality and bit rate,” in Proceedings of Visual Communications and Image Processing 2003, Lugano, Switzerland, July 8–11, 2003, pp. 198–209.
[13] C. S. Kim, D. Suh, T. M. Bae, Y. M. Ro,“ Quality Metric for H.264/AVC Scalable Video Coding with Full Scalability,” in Proc. SPIE, VOL. 6492, pp.64921P-1-64921P-12, San Jose, California USA, Jan. 28-Feb. 1, 2007.
[14] S. Winkler, “A perceptual distortion metric for digital color video”, in Proc. SPIE, VOL. 3644, May 1999, pp.175-184.
[15] J. Lubin, and D. Fibush, “Sarnoff JND vision model”, T1A1.5Working group Document, T1 Standards Committee, 1997.
[16] A.B. Watson, J. Hu, and J.F. McGowan III, “Digital video quality metric based on human vision”,Journal of Electronic imaging, VOL. 10, NO. 1, Jan 2001, pp. 20-29.
[17] Z. Wang, and A.C. Bovik, “A universal image quality index”, IEEE Signal Processing Letters, VOL. 9, NO. 3, Mar. 2002, pp. 81-84.
[18] L. Czuni, G. Csaszar, and A. Licsar, “Estimating the Optimal Quantization Parameter in H.264”, IEEE International Conference on Pattern Recognition(ICPR), Sep. 2006, pp. 330-333.
[19] Y. Cho, J. Liu, D.-K. Kwon and C.-C. J. Kuo,” H.264/SVC Tem-poral Bit Allocation With Dependent Distortion Model”, ICASSP 09, Taipei, Taiwan, April 2009, pp. 641 – 644.
[20] Y. F. Ou, T. Liu, Z. Zhao, Z. Ma and Y. Wang, “Modeling the im-pact of frame rate on perceptual quality of video,” IEEE ICIP 2008, San Diego, Oct. 2008, pp. 689–692.