跳到主要內容

簡易檢索 / 詳目顯示

研究生: 吳鼎汶
Hendrik
論文名稱: 航空監控影像之區域切割與分類
Region Segmentation and Labeling in Aerial Surveillance Images
指導教授: 鄭旭詠
Hsu-Yung Cheng
口試委員:
學位類別: 碩士
Master
系所名稱: 資訊電機學院 - 資訊工程學系
Department of Computer Science & Information Engineering
畢業學年度: 100
語文別: 英文
論文頁數: 59
中文關鍵詞: 空照監控區域分割影像分類
外文關鍵詞: image labeling, region segmentation, aerial surveillance
相關次數: 點閱:15下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 近年來空照監控影像的需求不斷增加,其應用範圍廣泛,主要用在軍事與交通運輸上。在交通運輸方面,空照監控影像不但能讓警察掌握交通狀況還能協助道路秩序之管理。為了能自動化地處理空照影像,必須先對影像中的物件與區域進行分割與標記的動作,如物件偵測、事件偵測、等。
    本論文提出一套自動化空照影像之區域分割與分類系統。任何影像分割的演算法都可以套入本系統中。其中比較了兩種分割演算法,分別是分水嶺(Watershed)與均值偏移(Mean-shift)分割演算法,但由於上述演算法會導致區域被過度切割,故本系統將過小的區域進行合併。其合併方式是根據八連通的方式建構一個無向圖(undirected graph),將影像中的每個區域視為一個頂點(vertex),而連接兩區域的邊(edge)會給定一個權重(weight),該權重代表兩區域的相異程度,再根據該權重對鄰近的小區域進行合併。而對於每個區域,本系統會擷取其顏色與紋理特徵,並使用支持向量機(Support Vector Machine)將每個區域進行分類的動作,最後再將同類的鄰近區域合併,獲得最終的分類結果。經實驗證實,本論文所提出的方法能有效的對各種空照影像進行分割與分類。


    The demand for aerial surveillance video keeps growing in recent years. It has been proved to be an effective way to collect information for a wide range of applications, such as intelligence transportation or military applications. In intelligence transportation applications, aerial surveillance not only provides traffic monitoring but also assists traffic management. Manual labeling of objects and regions in aerial videos is a tedious task. Objects and regions in aerial videos need to be segmented and labeled to enable automated video processing, such as object detection, event detection, automated aerial videos understanding, etc.
    In this thesis we propose an automatic image segmentation and labeling system for aerial surveillance images. Any kind of segmentation algorithm can be applied in our system. In this work, we compare watershed and mean-shift segmentation algorithms. Because the above mentioned segmentation algorithms might lead to over-segmentation, small regions need to be merged. We then construct an undirected-graph based on 8-connected local neighborhood, where each region is a vertex and the weight of edge connecting two regions is the dissimilarity measure of this two regions. Adjacent small regions are merged according to the weights of the edges. For each region we extract low-level features and use Support Vector Machine (SVM) classifier to label the region. Based on the output of the SVM classifier adjacent regions with the same label will be further merged to obtain the final labeling result. The experimental results have shown that our proposed system can effectively segment and label various aerial images.

    CONTENTS ABSTRACT i 摘要 ii CONTENTS iii LIST OF FIGURES v LIST OF TABLES vii CHAPTER 1 INTRODUCTION 1 1.1 Motivation 2 1.2 Related Work 3 1.3 Thesis Organization 9 CHAPTER 2 REVIEW OF RELEVANT TECHINQUES 10 2.1 Gradient Magnitude 10 2.2 Watershed Segmentation 12 2.3 Mean-Shift Segmentation 15 2.4 Support Vector Machine (SVM) 19 CHAPTER 3 PROPOSED SYSTEM 22 3.1 Graph Construction 23 3.2 Feature Extraction 26 3.2.1 Color features 27 3.2.2 Texture Features 29 3.3 Classification 30 CHAPTER 4 EXPERIMENTAL RESULTS 32 4.1 System Environment and Dataset 32 4.2 Recall, Precision and F-Score 33 4.3 Experimental results by using Texture Features 33 4.4 Experimental Results by Using Color Features 35 4.5 Experimental Results by Combining Color and Texture Features 36 4.6 Discussion of experimental results 38 4.7 Performance Analysis 40 CHAPTER 5 CONCLUSIONS 44 REFERENCES 46

    [1] Lin.Renjun, X. Cao, Y. Xu, C. Wei and H. Qiao, "Airborne moving vehicle detection for video," in Intelligent Transportation Systems, 2008.
    [2] J. Choi and Y. Yang, "Vehicle detection from aerial images using local shape information," in PSIVT, 2009.
    [3] E. Pakizeh and M. Palhang, "Building detection from aerial images using Hough transform and intensity information," in 18th Iranian Conference on Electrical Engineering, 2010.
    [4] H. Cheng, D. Butler and C. Basu, "ViTex: Video To Tex and Its Application in Aerial Video Surveillance," in IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2006.
    [5] O. O. Karadag and F. T. Y. Vural, "HANOLISTIC: A hierarchical automatic image annotation system using holistic approach," in Computer Vision and Pattern Recognition Workshops, 2009.
    [6] P. Duygulu, K. Barnard, J. De Freitas and D. Forsyth, "Object recognition as machine translation : Learning a lexicon for a fixed image vocabulary," in Proceedings of the 7th European Conference on Computer Vision-Part IV, 2002.
    [7] J. Jeon, V. Lavrenko and R. Manmatha, "Automatic image annotation and retrieval using cross-media relevance models," in Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval, 2003.
    [8] L. V, R. Manmatha and J. Jeon, "A Model for learning the semantics of pictures," in Advances in Neural Information Processing Systems 16, 2004.
    [9] E. Akbas and F. T. Y. Vural, "Automatic image annotation by ensemble of visual descriptors," in Computer Vision and Pattern Recognition CVPR, 2007.
    [10] J. Canny, "A computational approach to edge detection," IEEE Transactions on Pattern Analysis and Machine Inteligence, vol. 8, no. 6, pp. 679-698, 1986.
    [11] N. Kanopoulos, N. Vasanthavada and R. L. Baker, "Design of an image edge detection filter using the Sobel operator," IEEE Journal of Solid-State Circuits, vol. 23, no. 2, pp. 358-367, 1988.
    [12] L. G. Shapiro and G. C. Stockman, Computer Vision, Prentice-Hall, 2001.
    [13] N. Otsu, "A threshold selection method from gray-level histogram," IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS, vol. 9, no. 1, pp. 62-66, 1979.
    [14] K. S. Tan and N. A. Mat Isa, "Color image segmentation using histogram thresholding – Fuzzy C-means hybrid approach," Pattern Recognition, vol. 44, no. 1, pp. 1-15, 2011.
    [15] R. C. Gonzalez and R. E. Woods, Digital Image Processing, 2 ed., 2002.
    [16] Y. Deng, B. S. Manjunath and S. Hyundoo, "Color image segmentation," in Proc. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1999.
    [17] P. F. Felzenszwalb and D. P. Huttenlocher, "Efficient graph-based image segmentation," International Journal of Computer Vision, vol. 59, 2004.
    [18] H. Cheng and D. Butler, "Segmentation of Aerial Surveillance Video Using a Mixture of Experts," in Digital Image Computing : Techniques and Applications, 2005.
    [19] H. Cheng and C. A. Bouman, "Multiscale Bayesian segmentation using a trainable context model," IEEE Transactions on Image Processing, vol. 10, no. 4, pp. 511-525, 2001.
    [20] D. Comaniciu and P. Meer, "Mean shift: a robust approach toward feature space analysis," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 5, pp. 603-619, 2002.
    [21] R. Kumar, H. S. Sawhney, J. C. Asmuth, A. Pope and S. Hsu, "Registration of video to geo-referenced imagery," in Proceedings International Conference on Pattern Recognition, 1998.
    [22] H. S. Sawhney and R. Kumar, "True multi-image alignment and its application to mosaicing and lens distortion correction," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 21, no. 3, pp. 235-243, 1999.
    [23] T. Zhao, M. Aggarwal, R. Kumar and H. Sawhney, "Real-time wide area multi-camera stereo tracking," in IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005.
    [24] H. Tao, H. S. Sawhney and R. Kumar, "Object tracking with Bayesian estimation of dynamic layer representations," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 1, pp. 75-89, 2002.
    [25] C. Cusano, R. Scettini and G. Ciocca, "Image annotation using SVM," in Proceedings of Internet Imaging IV, 2004.
    [26] V. Risojevic and Z. Babic, "Aerial image classification using structural texture similarity," in IEEE International Symposium on Signal Processing and International Technology, 2011.
    [27] P. Ahmadi and S. Sadri, "A two-step approach for surface type classification of aerial images," in IEEE International Conference on Communication and Software Networks, 2011.
    [28] L. A. Vese and T. F. Chan, "A Multiphase Level Set Framework for Image Segmentation Using the Mumford and Shah Model," International Journal of Computer Vision, vol. 50, no. 3, pp. 271-293, 2002.
    [29] F. Meyer, "Topographic distance and watershed lines," Mathematical Morphology and its Applications to Signal Processing, vol. 38, no. 1, pp. 113-125, 1994.
    [30] R. O. Duda, P. E. Hart and D. G. Stork, Pattern classification, Wiley, 2001.
    [31] K. Fukunaga and L. Hostetler, "The estimation of the gradient of a density function, with applications in pattern recognition," IEEE Transactions on Information Theory, vol. 21, no. 1, pp. 32-40, 1975.
    [32] Y. Cheng, "Mean shift, mode seeking, and clustering," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 17, no. 8, pp. 790-799, 1995.
    [33] C. Cortes and V. Vapnik, "Support vector networks," Machine Learning, vol. 20, no. 3, pp. 273-297, 1995.
    [34] "RGB color space," [Online]. Available: http://en.wikipedia.org/wiki/RGB_color_space.
    [35] "HSV Color Space," [Online]. Available: http://www.blackice.com/colorspaceHSV.htm.
    [36] C.-C. Chang and C.-J. Lin, "LIBSVM: A library for support vector machines," ACM Transactions on Intelligent Systems and Technology, vol. 2, no. 3, pp. 27:1--27:27, 2011.
    [37] G. Bradski, "The OpenCV Library," Dr. Dobb''s Journal of Software Tools, 2000.
    [38] MPEG (Moving Picture Experts Group), "MPEG-7 Overview," [Online]. Available: http://mpeg.chiariglione.org/standards/mpeg-7/mpeg-7.htm.
    [39] M. Bastan, H. Cam, U. Gudukbay and O. Ulusoy, "Bilvideo-7: an MPEG-7- compatible video indexing and retrieval system," IEEE Multimedia, vol. 17, no. 3, pp. 62-73, 2010.
    [40] M. Bastan, "MPEG-7 Feature Extraction Library," [Online]. Available: http://www.cs.bilkent.edu.tr/~bilmdg/bilvideo-7/Software.html.

    QR CODE
    :::