基於H.264視訊編碼資訊之即時性移動鏡頭前景分割

簡易檢索 / 詳目顯示

回結果列表

研究生：	洪偉迪 Wei-Di Hong
論文名稱：	基於H.264視訊編碼資訊之即時性移動鏡頭前景分割 Foreground segmentation for moving camera based on H.264 video coding information
指導教授：	張寶基 Pao-Chi Chang
口試委員:
學位類別：	碩士 Master
系所名稱：	資訊電機學院 - 通訊工程學系 Department of Communication Engineering
畢業學年度：	95
語文別：	英文
論文頁數：	56
中文關鍵詞：	前景分割、移動鏡頭、視訊
外文關鍵詞：	foreground segmentation, video coding, H.264, moving camera
相關次數：	點閱：18 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

對於視訊畫面之前景物體切割在視訊監控、辨識、檢索以及視訊編碼等方面一直是重要的一環。針對移動鏡頭所進行之切割相較於固定鏡頭有更廣泛的應用。由於視訊資料量龐大，在儲存及傳輸前皆需經過壓縮處理。直接利用壓縮資訊對移動鏡頭視訊內容進行移動物體切割便可省去對於原始影像之處理，為一適合即時應用之演算法。
近年視訊壓縮標準不斷演進，可用以進行切割的資訊也隨之改變。H.264標準中除了傳統的移動向量，還多出了七種模式的區塊分割可提供額外資訊。在前人的移動鏡頭前景切割演算法中，首先利用運動向量以最小平方法求得背景相對於鏡頭之移動，再將與背景移動差異較大之區塊標記為前景區塊。在上述過程中我們利用區塊模式的不同，可以在逼近背景相對運動時取得較具參考價值之運動向量，並且在決定前景區塊時提升判斷之精確度。再加以我們設計之低通濾波器進行空間域與時間域之雜訊消除，最後以可調適性的門檻值來決定切割出的前景區塊。
利用移動向量能即時處理移動鏡頭之視訊內容，較許多空間域前景切割方法有更高之實用性。實驗結果顯示利用區塊模式資訊幫助前景分割，相較於只用移動向量可獲得較準確之結果，並且運算量低，可應用於即時系統中。

Foreground segmentation for video frames has been an important role in video surveillance, pattern recognition, video indexing, and video coding. Due to the large amount of video data, videos have to be compressed before storage and transmission. Foreground segmentation based on compression information saves the processing to the original frame, therefore, is an algorithm suitable for real-time applications.
In recent years, video compression standards had been promoted rapidly. In the H.264/AVC video coding standard, in addition to motion vectors, there are also seven-mode block partitions which can provide extra information for segmentation. In former algorithms for moving object segmentation for video acquired by moving cameras, they first approximated the relative global motion model using all motion vectors, than marked the blocks with motion vectors differed from the global motion by an amount as foreground blocks. During the procedure described above, according to the different block partition modes, we can choose the preferable MVs for estimating the global motion, moreover, improve the accuracy of the judgments on blocks as being foreground or background. Finally, we refine the results with spatial and temporal filters of our design and segment foreground with proposed adaptive threshold.
With the use of motion vectors, we can process the video data from moving cameras. It makes this algorithm more practical than many object segmentation methods using spatial domain information. It also reduces the computational costs and can be used for real-time systems.

Chapter 1 Introduction	1
Chapter 2 Related works and H.264 coding information for segmentation	4
2.1 Motion-based segmentation	5
2.2 Change-based segmentation	6
2.3 Spatio-temporal segmentation	8
2.4 H.264/AVC standard and coding information used for foreground segmentation	10
Chapter 3 Proposed foreground segmentation scheme for moving camera based on H.264 coding information	14
3.1 Flowchart of the proposed foreground segmentation scheme for video data from moving camera	15
3.2 Approximating global motion model with motion vectors	17
3.3 Enhanced moving object segmentation with H.264 partition modes	30
3.4 Spatio-temporal refinement	34
3.5 Adaptive Thresholding for foreground segmentation	38
Chapter 4 Experimental results and discussion	42
4.1 Segmentation Results	42
4.2 Discussions for real-time applications	55
Chapter 5 Conclusion and future works	56
References	57

                                

[1] W. Zeng, J. Du, W. Gao, and Q. M. Huang, “Robust moving object segmentation on H.264/AVC compressed video using the block-based MRF model,”Real-Time Imaging, pp. 290–299, 2005.
[2] J. Wang, N. Patel, and W. Grosky, “Moving camera moving object segmentation in an MPEG-2 compressed video sequence,” Proceedings of the Conference on Multimedia Content Analysis, Management, and Retrieval (IS&T/SPIE Symposium on Electronic Imagery), pp. 372–379 2006.
[3] F. Arnell and L. Petersson, “Fast object segmentation from a moving camera,” Proceedings of IEEE, Intelligent Vehicles Symposium, pp. 136–141, 2005.
[4] K. Toyama, J. Krumm, B. Brumitt and B. Meyers, “Wallflower: Principles and practice of background maintenance,” Proceedings of the 1999 7th IEEE International Conference on Computer Vision (ICCV''99), pp. 255–261, 1999.
[5] A. Makarov, “Comparison of background extraction based intrusion detection algorithms,” Proceedings of the 1996 IEEE International Conference on Image Processing, ICIP''96. Part 1 (of 3), pp. 521–524, 1996.
[6] S. Pumrin, “A Framework for Dynamically Measuring Mean Vehicle Speed Using Un-Calibrated Cameras,” General Examination Report, Intelligent Transportation Systems Laboratory Department of Electrical Engineering University of Washington, December 12, 2001.
[7] J. Y. Wang, A. E. Adelson, “Spatio-temporal segmentation of video data,” Proceedings of SPIE on Image and Video Processing II, Vol. 2182, pp. 120–131, San Jose, February 1994.
[8] R. Wang, H. Zhang, and Y. Zhang, “A confidence measure based moving object extraction system built for compressed domain,” Proceedings of IEEE Int. Symp. Circuits and Systems, vol. 5, pp. 21–24, 2000.
[9] R. Cucchiara, A. Prati, and R. Vezzani, “Object segmentation in videos from moving camera with MRFs on color and motion features,” IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp 405–410, 2003.
[10] Y.-P. Tan, D. D. Saur, S. R. Kulkarni, and P. J. Ramadge, “Rapid Estimation of Camera Motion from Compressed Video with Application to Video annotation”, IEEE Trans. on Circ.and Syst. for Video Techn., vol. 10, pp. 133–146, 2000.
[11] S. Fotedar, and R. J. P. deFigueiredo, “Determination of Motion Parameters of a Moving Object. from Moving Camera Data,” Technical Report EE8708, Dept. of Electrical and Computer Engineering,. Rice University, 1988.
[12] B.K.P. Horn; B.G. Schunck., “Determining optical flow,” AI Memo 572. Massachusetts Institue of Technology, 1980.
[13] S. Kamijo, Y. Matsushita, K. Ikeuchi and M. Sakauchi, “Traffic monitoring and accident detection at intersections,” Proceedings IEEE Conference on Intelligent Transportation Systems, pp. 703–708, 1999.
[14] D. Gutchess, M. Trajkovic, E. Cohen-Solal, D. Lyons and A. K. Jain, “A background model initialization algorithm for video surveillance,” in 8th International Conference on Computer Vision, Jul 9-12 2001, 2001, pp. 733–740.
[15] R. Cucchiara, C. Grana, M. Piccardi and A. Prati, “Detecting moving objects, ghosts, and shadows in video streams,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 25, pp. 1337–1342, 2003.
[16] H. Lin, T. Liu and J. Chuang, “A probabilistic SVM approach for background scene initialization,” in International Conference on Image Processing (ICIP''02), Sep 22-25 2002, 2002, pp. 893–896.
[17] E. Durucan and T. Ebrahimi, “Change detection and background extraction by linear algebra,” Proceedings of the IEEE, vol. 89, pp. 1368–1381, 2001.
[18] C. Kim and J. Hwang, “Fast and automatic video object segmentation and tracking for content-based applications,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 12, pp. 122–129, 2002.
[19] Y. Sugaya and K. Kanatani, “Extracting moving objects from a moving camera video sequence,” Proceedings of the 10th Symposium on Sensing via Imaging Information, pp. 279–284, June 2004.
[20] “Draft ITU-T recommendation and final draft international standard of joint video specification(ITU-T Rec. H.264/ISO/IEC 14496-10 AVC,)” in Joint Video Team(JVT) of ISO/IEC MPEG and ITU-T VCEG, JVTG050,2003.
[21] T. Wiegand, G. J. Sullivan, G. Bjontegaard and A. Luthra, "Overview of the H.264/AVC video coding standard," IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, pp. 560–576, 2003.
[22] A. Smolic; J. R. Ohm, ”Robust global motion estimation using a simplified M-estimator approach,” Image Processing, 2000. Proceedings. 2000 International Conference, Volume: 1 , 2000.
[23] Y. P. Tan, S. R. Kulkarni, and P. J. Ramadge, “A new method for camera motion parameter estimation,” Proceeding of 1995 IEEE International Conference on Image Processing, pp. 406–409, 1995.
[24] H. Jeon, A. Basso, and P. F. Driessen, “Enhanced video coding based on camera motion and image synthesis,” Communications, Computers and signal Processing, 2003, PACRIM, 2003 IEEE, vol. 1, pp. 241–244, 2003.

簡易檢索 / 詳目顯示

相關論文