| 研究生: |
崔博翔 Po-Hsiang Tsui |
|---|---|
| 論文名稱: |
以ResNet演算法應用於HEVC畫面內解碼端後處理 Post-Processing for HEVC Intra Prediction with ResNet algorithm |
| 指導教授: |
林銀議
Yin-Yi Lin |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 通訊工程學系 Department of Communication Engineering |
| 論文出版年: | 2022 |
| 畢業學年度: | 110 |
| 語文別: | 中文 |
| 論文頁數: | 122 |
| 中文關鍵詞: | HEVC 、畫面內預測 、影像後處理 、高斯遮罩 、ResNet |
| 外文關鍵詞: | HEVC, Intra Prediction, Image post-processing, Gaussian mask, ResNet |
| 相關次數: | 點閱:8 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
在科技迅速發展的現今,人們的生活與科技產品形影不離,對於影像方面的追求也逐步提升,但隨著影像解析度越來越高的同時,所需負擔的無疑是龐大的資料傳輸量,為了更有效的對這些影像進行壓縮,HEVC(High Efficiency Video Coding)使用的壓縮技術能比上一代的壓縮標準提高約兩倍的壓縮率,但是在編碼壓縮的同時,影像會產生不可逆的失真,如何在節省時間的同時,讓失真影像盡可能地接近原始影像正是研究的重點。
近幾年也有許多研究是以深度學習應用於HEVC中增強影像品質,本論文是在HEVC畫面內預測中以後處理的方式提出了二個主題來增強影像品質,第一種是以高斯遮罩的方式提供網路模型額外資訊,與HEVC參考程式HM-16.0相比可以提升0.285(dB)的BDPSNR與降低5.16(%)的BDBR,第二種則是以ResNet架構的方式使模型性能進一步提升,可以提升0.319(dB)的Y-BDPSNR與降低5.79(%)的Y-BDBR。
Nowadays,with the rapid development of technology,people 's life is inseparable from technological products, and the pursuit of images is gradually improving. However,as the resolution of images becomes higher and higher,the burden is undoubtedly a huge amount of data transmission.
In order to compress these images more effectively,the compression technology used by HEVC(High Efficiency Video Coding) can increase the compression rate about twice as much as that of the previous generation of compression standards. However,the image will produce irreversible distortion at the same time of encoding and compressing. How to make the distorted image as close to the original image as possible while saving time is the focus of research.
In recent years, there have been many studies on the application of deep learning in HEVC to enhance image quality. In this paper, two topics are proposed to enhance image quality by post-processing for HEVC Intra prediction. The first one is Gaussian mask,the method provides additional information to the CNN model. Compared with the HEVC reference program HM-16.0,it can increase the BDPSNR by 0.285 (dB) and reduce the BDBR by 5.16 (%).The second method is to further improve the model performance by using the ResNet architecture. It can increase Y-BDPSNR of 0.319 (dB) and decrease Y-BDBR of 5.79 (%).
[1] ITU-T Rec. H.263,“Video Codec for Low Bit Rate Communication, ”1996.
[2] I.E.G.Richardson, H.264 and MPEG-4 Video Compression: Video Coding for Next-generation Multimedia. Aberdeen, U.K.:John Wiley & Sons,2003.
[3]“Generic coding of moving pictures and associated audio information,” ISO/IEC 13818-2: Video (MPEG-2),May 1996.
[4]“Coding of audio-visual objects - Part 2: Visual,” in ISO/IEC 14496-2 (MPEG-4 Visual Version 1),Apr.1999.
[5] G.J.Sullivan, J.R.Ohm, W.J.Han and T.Wiegand,” Overview of the High Efficiency Video Coding (HEVC) Standard,”IEEE Trans.CSVT,
Vol.22, no.12,Dec.2012.
[6] J.Kim, J.K. Lee, K.M. Lee,“Accurate Image Super-Resolution Using
Very Deep Convolutional Networks”,The IEEE Conference on Computer
Vision and Pattern Recognition (CVPR),2016,pp.1646-1654
[7] K.He, X.Zhang, S.Ren, J.Sun,“Deep Residual Learning for Image Recognition”,The IEEE Conference on Computer Vision and Pattern Recognition (CVPR),2016,pp.770-778
[8] B. Lim, S. Son, H. Kim, S. Nah and K. M. Lee, "Enhanced Deep Residual Networks for Single Image Super-Resolution," 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2017, pp. 1132-1140
[9] S.J. Cai,“Reduction of computation complexity for HEVC intra prediction with support vector machine,”National Central University, Master Thesis, Jun 2017.
[10] X.He, Q.Hu, X.Han, X.Zhang, C.Zhang, W.Lin, "Enhancing Hevc Compressed Videos With A Partition-Masked Convolutional Neural Network", International Conference on Image Processing(ICIP) 2018, pp.216-220
[11] Daowen Li, Lu Yu,“An In-Loop Filter Based on Low-Complexity CNN using Residuals in Intra Video Coding”, 2019 IEEE International Symposium on Circuits and Systems (ISCAS)
[12] S.M. Fan,“Study of A Deep Learning Architecture For HEVC Decoder”, Department of Communication Engineering National Central University, Taiwan 32054, R.O.C., Jan 2020.
[13] C.H. Chen,“CNN-Based Post-Processing for HEVC Intra Prediction”, Department of Communication Engineering National Central University, Taiwan 32054, R.O.C., July 2020.
[14] C.K Hsieh,“CNN-Based Post-Processing for HEVC Inter Prediction,” Department of Communication Engineering National Central University, Taiwan 32054, R.O.C., July 2020.