| 研究生: |
林學易 Hsueh-Yi Lin |
|---|---|
| 論文名稱: |
基於家庭多媒體環境之整合影像處理系統 Integrated Video Processing System for Household Multimedia Environment |
| 指導教授: |
蔡宗漢
Tsung-Han Tsai |
| 口試委員: | |
| 學位類別: |
博士 Doctor |
| 系所名稱: |
資訊電機學院 - 電機工程學系 Department of Electrical Engineering |
| 論文出版年: | 2012 |
| 畢業學年度: | 101 |
| 語文別: | 英文 |
| 論文頁數: | 90 |
| 中文關鍵詞: | 可延伸處理 、畫面更新率提昇技術 、基於粒子特性之方法 、混合式 、數位家庭環境 、單向動態估測 、雙向動態估測 |
| 外文關鍵詞: | Scalable video processing, Particle based approach, Digital home environment, JPEG 2000, M-coder, Bi-directional ME, Unidirectional ME |
| 相關次數: | 點閱:9 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
為了解決多媒體裝置間大量的連接線路,因此多種家用無線傳 輸系統被提出,如wireless HD或者WHDI等。然而現有無線傳輸 系統主要專注於點對點高速數據傳輸,而且關於傳輸距離部分也 有嚴格限制,因此在使用上會有需多限制。因此一個整合網路機 制、無線傳輸、以及多媒體壓縮之系統被提出,稱之為WHDVI- m。在此系統中有兩個重要模組,分別為畫面更新率提昇技術以及 可延伸編解碼。
首先在基於粒子特性之畫面更新率提昇技術中,此論文藉由多 階層動態估測以及基於粒子特性之軌跡校正來產生所需之內插影 像。另外除了利用軌跡與現有內插方塊之距離外,第一階段產生 之絕對誤差和與第二階段產生之雙向絕對誤差和亦聯合考量用以 最小化誤差。根據實驗結果,本提出之演算法比起現有演算法, 其畫質高出0.9 ̃3.9 dB,其中所包含之比較對象包含單項估測以及 雙向估測之演算法。此實驗結果證實使用單項動態估測所得到之 動態向量比起雙向估測更為精準。另外也證實基於軌跡之動態估 測也能有效的選擇正確動態。
接下來在混合型畫面更新率提昇技術(Hybrid FRUC)中,將動態 估測衡量標準做進一步的強化,用以加強原來基於例子特性之畫 面更新率提昇技術中之第二階段,也就是軌跡取得部份。另外, 原始演算法中重複運算利用記憶體存取的方式化簡,讓執行時間 更加快速。最後更在大尺寸畫面中使用較大的搜尋方塊。加大大 畫面之搜尋方塊不但畫質不會下降,且複雜度會急遽降低。由實 驗結果發現,其畫質比起現有方法高出0.3 dB以上,而且複雜度降低了29 ̃90%,可以證實此演算法之效率以及畫面提昇的效果。
最後在可延伸編解碼方面,提出一多訊源可延伸編解碼之邊解 碼核心,稱之為畫面及畫質可延伸之圖片影像邊解碼技術。在此演算法中,我們採用DWT以及H.264中CABAC之M-coder用以達 到可延伸編解碼需求,並讓其適用於硬體實現。另外並設計出三 個context的基於bit平面之串流輸出演算法。經過實驗結果發現,我們的速度快了三倍,而壓縮率卻只有低0.04,證實我們提出的方 法不但快速,而且以壓縮能力來說並不遜色於JPEG2000。
To solve huge amount of interconnection between multimedia de- vices, wireless digit home systems, such as wireless HD, WHDI, and UWB, are built to meet the digit home requirement. How- ever, traditional wireless system focus on point-to-point high-speed transmission, while the distance and the data rate is severely limited. Therefore, a unified wireless digit home environment is pro- posed, named wireless high definition video interface-mobility, multiple access, and multimedia (WHDVI-m). In this thesis, an integrated scalable video processing system is proposed, inspired by the WHDVI-m system.
First, visual-quality particle based frame rate up conversion algorithm is proposed (named particle based frame rate up conversion) by combining the hierarchical MV field construction stage and PMTC. Estimated MVs from the hierarchical MV field construction stage are memorized and passed to PMTC to retrieve true MVs. Beside distance minimization procedure, joint evaluation of cost-function prevents untrue MVs from. According to the experiment results, proposed PFRUC outperforms existing works by 0.9 ̃3.9 dB, including bi-directional and unidirectional approach. The experiment results prove the necessity of unidirectional motion. On the other hand, PMTC, inherited from the bi-directional FRUC concept, is efficient in differentiating the true MVs from untrue ones.
Second, new cost evaluation is formulated in hybrid frame rate up conversion to replacement original cost evaluation in particle based motion trajectory calibration (the second stage of PFRUC). Repeated operation in particle based frame rate up conversion is removed by initial motion vector assignment. Subsequent motion vector mapping is designed for three purposes, including noise minimization, search range expansion, and computation complexity reduction. According to the experiment results, the visual quality is enhanced by 0.3 dB, compared with state of the art unidirectional motion estimation based frame rate up conversion, while those process higher visual quality compared with bi-directional based approaches. On the other hand, the computation complexity of hybrid frame rate up conversion is less than existing works by over 29% and 90% in processing CIF and full-HD videos.
Finally, a new lossless compression codec is proposed, named multiple- source scalable video compressor. Overall system contains the dis- crete wavelet transform, quality driven bitplane sequencer, and bi- nary arithmetic coder with some specific considerations. In binary arithmetic coder, M-coder is adopted to enhance the hardware realization performance, while the compression is slightly enhanced. Besides, two-pass quality driven bitplane sequencer is proposed based on the distribution of zeros and ones in significance layer. According to the experiment results, the computation time of pro- posed scalable image/video compressor is one third of openJPEG (JEPG 2000) while the average compression ratio of the proposed work is only 0.04 behind.
[1] S.-J. Kang, K.-R. Cho, and Y. H. Kim, “Motion compensated frame rate up-conversion using extended bilateral motion estimation,” IEEE Transactions on Consumer Electronics, vol. 53, no. 4, pp. 1759 –1767, 2007.
[2] T.-H. Tsai and H.-Y. Lin, “High visual quality particle based frame rate up conversion with acceleration assisted motion trajectory cali- bration,” IEEE Journal of Display Technology, vol. 8, no. 6, pp. 341 –351, june 2012.
[3] T.-H. Tsai, H.-Y. Lin, and H.-G. Chen, “Overlapped block-based adap- tive bilateral motion estimation,” in Asia Pacific Conference on Cir- cuits and Systems, 2010, to be published.
[4] “Wireless HD homepage,” Website, January 2008, http://www.wirelesshd.org/.
[5] “WHDI homepage,” Website, http://www.whdi.org/.
[6] P. Haavisto, J. Juhola, and Y. Neuvo, “Fractional frame rate up- conversion using weighted median filters,” IEEE Transactions on Con- sumer Electronics, vol. 35, no. 3, pp. 272–278, Aug. 1989.
[7] L. Zong and N. Bourbakis, “Digital video and digital TV: a compar- ison and the future directions,” in 1999 International Conference on Information Intelligence and Systems, 1999, pp. 470–481.
[8]“Cathode ray tube–from Wikipedia,” Website, 2011. [Online]. Available: http://en.wikipedia.org/wiki/Cathode ray tube
[9] Y.-F. Chen, C.-C. Chen, and K.-H. Chen, “Mixed color sequential technique for reducing color breakup and motion blur effects,” Journal of Display Technology, vol. 3, no. 4, pp. 377 –385, dec. 2007.
[10] L.-Y. Liao, C.-W. Chen, and Y.-P. Huang, “Local blinking HDR LCD systems for fast MPRT with high brightness LCDs,” Journal of Display Technology, vol. 6, no. 5, pp. 178 –183, may 2010.
[11] SERIES H: AUDIOVISUAL AND MULTIMEDIA SYSTEMS Infras- tructure of audiovisual services - Coding of moving video: Video coding for low bit rate communication, ITU-T Recommendation H.263 Std.
[12] B.-D. Choi, J.-W. Han, C.-S. Kim, and S.-J. Ko, “Motion-compensated frame interpolation using bilateral motion estimation and adaptive overlapped block motion compensation,” IEEE Transactions on Cir- cuits and Systems for Video Technology, vol. 17, no. 4, pp. 407 –416, 2007.
[13] B.-T. Choi, S.-H. Lee, and S.-J. Ko, “New frame rate up-conversion using bi-directional motion estimation,” IEEE Transactions on Con- sumer Electronics, vol. 46, no. 3, pp. 603 –609, Aug. 2000.
[14] J. Zhai, K. Yu, J. Li, and S. Li, “A low complexity motion compensated frame interpolation method,” in 2005 IEEE International Symposium on Circuits and Systems (ISCAS’05), May 2005, pp. 4927 – 4930 Vol. 5.
[15] B.-W. Jeon, G.-I. Lee, S.-H. Lee, and R.-H. Park, “Coarse-to-fine frame interpolation for frame rate up-conversion using pyramid struc- ture,” IEEE Transactions on Consumer Electronics, vol. 49, no. 3, pp. 499 – 508, 2003.
[16] S.-J. Kang, D.-G. Yoo, S.-K. Lee, and Y. Kim, “Multiframe-based bi- lateral motion estimation with emphasis on stationary caption process- ing for frame rate up-conversion,” IEEE Transactions on Consumer Electronics, vol. 54, no. 4, pp. 1830–1838, 2008.
[17] A. Akin, M. Cetin, B. Erbagci, O. Karakaya, and I. Hamzaoglu, “An adaptive bilateral motion estimation algorithm and its hardware archi- tecture,” in Proc. 18th IEEE/IFIP VLSI System Chip Conf. (VLSI- SoC), 2010, pp. 207–212.
[18] D. Wang, L. Zhang, and A. Vincent, “Motion-compensated frame rate up-conversion—part I: Fast multi-frame motion estimation,” IEEE Transactions on Broadcasting, vol. 56, no. 2, pp. 133–141, 2010.
[19] D. Wang, A. Vincent, P. Blanchfield, and R. Klepko, “Motion- compensated frame rate up-conversion—part II: New algorithms for frame interpolation,” IEEE Transactions on Broadcasting, vol. 56, no. 2, pp. 142–149, 2010.
[20] S.-C. Tai, Y.-R. Chen, Z.-B. Huang, and C.-C. Wang, “A multi-pass true motion estimation scheme with motion vector propagation for frame rate up-conversion applications,” Journal of Display Technology, vol. 4, no. 2, pp. 188 –197, june 2008.
[21] H. Liu, R. Xiong, D. Zhao, S. Ma, and W. Gao, “Multiple hypothe- ses bayesian frame rate up-conversion by adaptive fusion of motion- compensated interpolations,” Circuits and Systems for Video Tech- nology, IEEE Transactions on, vol. 22, no. 8, pp. 1188 –1198, aug. 2012.
[22] N. Jacobson, Y.-L. Lee, V. Mahadevan, N. Vasconcelos, and T. Nguyen, “A novel approach to fruc using discriminant saliency and frame segmentation,” Image Processing, IEEE Transactions on, vol. 19, no. 11, pp. 2924 –2934, nov. 2010.
[23] T.-S. Wang, K.-S. Choi, H.-S. Jang, A. Morales, and S.-J. Ko, “En- hanced frame rate up-conversion method for uhd video,” Consumer Electronics, IEEE Transactions on, vol. 56, no. 2, pp. 1108 –1114, may 2010.
[24] H.-S. Oh and H.-K. Lee, “Block-matching algorithm based on an adaptive reduction of the search area for motion estimation,” Real-Time Imaging, vol. 6, no. 5, pp. 407 – 414, 2000. [Online]. Available: http://www.sciencedirect.com/science/article/ B6WPR-45BCPGY-F/2/375fce36233fd84b1414b0544a602a35
[25] J. Vanne, E. Aho, T. Hamalainen, and K. Kuusilinna, “A high- performance sum of absolute difference implementation for motion estimation,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no. 7, pp. 876 –883, 2006.
[26] JPEG 2000 Part I Final Committee Draft Version 1.0, ISO/IEC Std., March 2000.
[27] “OpenJPEG version 1.4 — an open-source JPEG 2000 codec,” Web- site, January 2011, http://www.openjpeg.org/.
[28] W. Pearlman, A. Islam, N. Nagaraj, and A. Said, “Efficient, low- complexity image coding with a set-partitioning embedded block coder,” Circuits and Systems for Video Technology, IEEE Transac- tions on, vol. 14, no. 11, pp. 1219 – 1235, nov. 2004.
[29] A. Said and W. Pearlman, “A new, fast, and efficient image codec based on set partitioning in hierarchical trees,” Circuits and Systems for Video Technology, IEEE Transactions on, vol. 6, no. 3, pp. 243 –250, jun 1996.
[30] S.-T. Hsiang, “Embedded image coding using zeroblocks of sub- band/wavelet coefficients and context modeling,” in Data Compression Conference, 2001. Proceedings. DCC 2001., 2001, pp. 83 –92.
[31] T.-H. Tsai, Y.-H. Lee, and Y.-Y. Lee, “Design and analysis of high- throughput lossless image compression engine using VLSI-oriented FELICS algorithm,” Very Large Scale Integration (VLSI) Systems, IEEE Transactions on, vol. 18, no. 1, pp. 39 –52, jan. 2010.
[32] R. Castagno, P. Haavisto, and G. Ramponi, “A method for motion adaptive frame rate up-conversion,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 6, no. 5, pp. 436–446, Oct. 1996.
[33] J. R. Bergen, P. Anandan, T. J. Hanna, and R. Hingorani, “Hierarchical model-based motion estimation,” in Computer Vision— ECCV’92, ser. Lecture Notes in Computer Science, G. Sandini, Ed. Springer-Verlag, 1992, vol. 588, pp. 237–252. [Online]. Available: http://dx.doi.org/10.1007/3-540-55426-2 27
[34] “Color space–from wikipedia,” Website, 2011. [Online]. Available:
http://en.wikipedia.org/wiki/Color space
[35] T.-H. Tsai, H.-G. Chen, and H.-Y. Lin, “Frame rate up-conversion using adaptive bilateral motion estimation,” in WSEAS International Conference on Multimedia Systems and Signal Processing, 2009. [On- line]. Available: http://www.wseas.us/e-library/conferences/2009/ hangzhou/ACACOS/ACACOS31.pdf
[36] S.-J. Kang, D.-G. Yoo, S.-K. Lee, and Y. H. Kim, “Design and imple- mentation of median filter based adaptive motion vector smoothing for motion compensated frame rate up-conversion,” in IEEE 13th Inter- national Symposium on Consumer Electronics (ISCE ’09), May 2009, pp. 745 –748.
[37] SERIES H: AUDIOVISUAL AND MULTIMEDIA SYSTEMS Infras- tructure of audiovisual services - Coding of moving video: Advanced video coding for generic audiovisual services, ITU-T Recommendation H.264 Std.
[38] “Golomb coding,” Website, http://en.wikipedia.org/wiki/Golomb coding.