| 研究生: |
賴學翰 Hsueh-Han Lai |
|---|---|
| 論文名稱: |
應用於車內視訊之光線適應性視訊壓縮編碼器設計 An Illumination Adaptive Video Coding Scheme for In-vehicle Video Applications |
| 指導教授: |
唐之瑋
Chih-Wei Tang |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 通訊工程學系 Department of Communication Engineering |
| 畢業學年度: | 96 |
| 語文別: | 中文 |
| 論文頁數: | 75 |
| 中文關鍵詞: | 車內視訊 、光線 、視訊壓縮 |
| 外文關鍵詞: | in-vehicle, video coding, illumination |
| 相關次數: | 點閱:8 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文設計一個提高乘客方便性之視訊通訊,可應用於車內駕駛人或乘客與辦公室內或是搭乘交通工具中的人之視訊通訊。但因為在無線通訊傳輸頻寬有限,所以此系統設計考量人類視覺特性的視訊編碼,降低非人臉區域的資料量。另一方面,由於光線常因環境的變化而改變,進而造成車內視訊內容有顯著的改變。因此,在此論文中,我們提出應用於車內視訊之光線適應視訊壓縮編碼器,包含光線校正、人臉偵測以及和人類視覺注意力為基礎的視訊壓縮三個階段,而光線校正方案整合Single Scale Retinex 和間隔權重直方圖分割法。由實驗結果可知,我們的光線校正方案有效提高車內視訊的人臉偵測率。此外,我們的視訊壓縮系統不僅可以降低資料量,並同時保持良好之視訊品質。
With the advance of intelligent vehicle systems, drivers or passengers can keep interaction with people in fixed offices or other vehicles through visual communications. However, the illumination variations due to the changes of environments or weather conditions may significantly change the appearance of in-vehicle videos. Accordingly, the compression efficiency is much reduced even though the bandwidth of such wireless communications has been quite limited. There is pretty few previous work designed for efficient in-vehicle video compressions. Thus, in this paper, we propose an illumination adaptive video coding scheme for in-vehicle video applications. Since human faces are usually the most visually attended regions in such applications, this scheme consists of illumination correction, face detection, and the visual attention based video codec. The proposed illumination correction strategy combines the advantages of the single-scale Retinex (SSR) and the Interval weighted histogram separation (IWHS). The experimental results show that our illumination correction strategy effectively improves the face detection performance of in-vehicle videos. Moreover, the subjective visual quality of the proposed scheme outperforms that of H.264 with rate control since our scheme allocates bits by incorporating the human visual characteristics.
[1] http://www.its-taiwan.org.tw/, 2008
[2] M. M. Trivedi, T. Gandhi, and J. McCall, “Looking in and looking-out of a vehicle: Computer vision-based enhanced vehicle safety,”IEEE Transactions on Intelligent Transportation Systems, pp. 108-120, January 2007.
[3] P. Watta, S. Lakshmanan, and Y. Hou, “Nonparametric approaches for estimating driver pose,” IEEE Trans. Vehicular Technology, Vol. 56, No. 4, pp. 2028-2041, July 2007.
[4] C. Wu, Y. Lin, and W.J. Zhang, ” Human attention modeling in a human-machine interface based on the incorporation of contextual features in a Bayesian network,” IEEE International Conference on Systems, Man and Cybernetics, Vol. 1, pp. 760-766, 2005.
[5] L. Itti, ”Models of bottom-up and top-down visual attention”, California Institute of Technology. Ph.D. Thesis, 2000.
[6] S. Rao and N. Jayant,” Optimizing algorithms for region-of-interest video compression, with application to mobile telehealth,” IEEE Intl. Conference on Multimedia and Expo, pp.513-516, 2006.
[7] C.-W. Tang, ” Spatiotemporal visual considerations for video coding,” IEEE Transactions on Multimedia, Vol. 9, No. 2, pp. 231-238, Feb. 2007.
[8] S.-C. Pei and C.-L. Lai, “Very low bit-rate coding algorithm for stereo video with spatio-temporal HVS model and binary correlation disparity estimator,” IEEE J. Select. Areas Commun., Vol. 16, No. 1, pp. 98-107, Jan. 1998.
[9] D. Chai and K. N. Ngan, “Foreground/background video coding scheme,” in Proc. IEEE Int. Symp. Circuits Syst., Vol. II, pp. 1448-1451, June 1997.
[10] M.-J. Chen, M.-C. Chi, C.-T. Hsu, and J.-W. Chen, “ROI video coding based on H.263+ with robust skin-color detection technique,” IEEE Trans. Consumer Electronics, Vol. 49, pp.724-730, Aug. 2003.
[11] M.-H. Yang, D. J. Kriegman, and N. Ahuja, “Detecting faces in images: a survey,” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 24, pp. 34-58, Jan. 2002.
[12] Y. Shi, J. Yang, and R. Wu, ”Reducing illumination based on nonlinear gamma correction,” IEEE Intl. Conference on Image Processing, Vol.1, pp. I- 529-I-532, Oct. 2007.
[13] M. Y. Nam and P. K. Rhee, “An efficient face recognition for variant illumination condition,” in Proc. IEEE Intl. Symposium on Intelligent Signal Processing and Communication Systems, pp. 111-115, 2004.
[14] S.-C. Pei, Y.-C. Zeng, and J.-J. Ding, “Color images enhancement using weighted histogram separation,” IEEE Intl. Conference on Image Processing, pp. 2889-2892, Oct. 2006.
[15] M. Li, R.-M. Hu, R. Zhu, and W. Li, “Video streaming on moving vehicles over seamless internetworks of WLANs and cellular networks,” in Proc. IEEE Intl. Conference on Vehicular Electronics and Safety, pp. 369- 372, 2005.
[16] K. Tischler, M. Clauss, Y. Guenter, N. Kaempchen, R. M. Schreier, and M. M. Stiegeler, “Networked environment description for advanced driver assistance systems,” in Proc. IEEE Intl. Conference on Intelligent Transportation Systems, pp. 785-790, 2005.
[17] D. J. Jobson, Z. Rahman, and G. A. Woodell, ”Properties and performance of a center/surround Retinex,” IEEE Trans. Image Processing, Vol. 6, No. 3, pp. 451-462, March 1997.
[18] Z. Rahmna, D. J. Jobson, and G. A. Woodell, ”Retinex processing for automatic image enhancement,” Journal of Electronic Imaging, Vol. 13, No. 1, pp. 100-110, Jan. 2004.
[19] Zia-ur Rahman, Daniel J. Jobson, and Glenn A. Woodell, ”A multiscale retinex for colour rendition and dynamic range compression,” in Proc. SPIE International Symposium on Optical Science, Engineering and Instrumentation, Applications of Digital Image Processing XIX, Vol. 2847, 1996.
[20] Robert J. Baron, “Mechanisms of human facial recognition”, International Journal of Man-Machine Studies, Vol. 2, pp. 137-178, 1981.
[21] D. Chai and K. N. Ngan, ”Face segmentation using skin-color map in videophone applications,” IEEE Trans. Circuits and Systems for Video Technology, Vol. 9, No. 4, pp. 551-564, Jun. 1999.
[22] S. L. Phung, A. Bouzerdoum, and D. Chai, “Skin segmentation using color pixel classification: analysis and comparison,“ IEEE Trans. Pattern Analysis and Machine intelligence, Vol. 27, No. 1, pp.148-154, January 2005.
[23] P.S. Hiremath, A. Dant, ” Detection of multiple faces in an image using skin color Information and lines-of-separability face model,” International Journal of Pattern Recognition and Artificia,Vol. 20, pp.39-61, 2006.
[24] G. Gomez and E. Morales, ”Automatic feature construction and a simple rule induction algorithm for skin detection”, In Proc. Of the ICML Workshop on Machine Learning in Computer Vision, pp. 31-38, 2002.
[25] R. C. Gonzalez and R. E. Woods, Digital Image Processing. Reading, Prentice Hall, Second Edition, 2004.
[26] A.S. Georghiades, P.N. Belhumeur, and D.J. Kriegman, “From few to many: illumination cone models for face recognition under variable kighting and pose,”IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 23, pp.643- 660, 2001.
[27] E. Land, “An alternative technique for the computation of the designator in the retinex theory of color vision,” in Proc. Nat. Acad. Sci., Vol. 83, pp. 3078-3080, 1986.
[28] V. Agarwal, B.R. Abidi, A. Koschan, and M.A. Abidi, “An overview of color constancy algorithms,”Journal of Pattern Recognition Research, pp. 42-54, 2006.
[29] S.K. Lin, S.W. Wang, S.S. Yang, Y.S. Tung, and J.L. Wu, ” Motion transitive based fast multi-frame motion estimation algorithm for MPEG-4 AVC /H.264, ” International Conference on Consumer Electronics, pp. 1-2, Jan. 2007.
[30] C. Wu, Y. Lin, and W.J. Zhang, ” Human attention modeling in a human-machine interface based on the incorporation of contextual features in a bayesian fetwork,” IEEE International Conference on Systems, Man and Cybernetics, Vol. 1, pp. 760-766, 2005.
[31] http://www.vcodex.com/h264.html, 2008
[32] X. Yang, W. Lin, Z. lu, Z. Lin, S. Rahardja, E. Ong, and S. Yao, “Rate control for videophone using local perceptual cues,” IEEE Trans. Circuits Syst. Video Technol., vol. 15, no. 4, pp. 496-507, Apr. 2005.
[33] S.-C. Pei and C.-L. Lai, “Very low bit-rate coding algorithm for stereo video with spatio-temporal HVS model and binary correlation disparity estimator,” IEEE J. Select. Areas Commun., Vol. 16, No. 1, pp. 98-107, Jan. 1998.
[34] Z. Chen, J. Han, and K. Ngan, ”Dynamic bit allocation for multiple video object coding,” IEEE Trans. Multimedia, Vol. 8, pp. 1117-1124, Dec. 2006.
[35] F. Pereira and T. Ebrahimi, The MPEG-4 Book. Upper Saddle River, NJ: Prentice-Hall, pp. 669-675, 2002
[36] Jobson, D.J. Rahman, Z. Woodell, G.A. ,” A multiscale retinex for bridging the gap between color images andthe human observation of scenes,” IEEE Transactions on Image Processing, pp. 965-976, 1997
[37] 李昀儒, “Color Image Enhancement Using Hybrid Retinex Algorithm,” 世新大學圖文傳播暨數位出版學系碩士論文, 2005