跳到主要內容

簡易檢索 / 詳目顯示

研究生: 蔡雨芳
Yu-Fang Tsai
論文名稱: 基於偏最小平方法之靜態手勢辨識
Hand Posture Recognition Using Partial Least Squares
指導教授: 唐之瑋
Chih-Wei Tang
口試委員:
學位類別: 碩士
Master
系所名稱: 資訊電機學院 - 通訊工程學系
Department of Communication Engineering
論文出版年: 2013
畢業學年度: 101
語文別: 中文
論文頁數: 92
中文關鍵詞: 手勢辨識偏最小平方法梯度方向直方圖支持向量機降維降低訓練時間
外文關鍵詞: Hand Posture Recognition, Partial Least Squares, Histogram of Oriented Gradients, Support Vector Machine, Dimension Reduction, Training Time Reduction
相關次數: 點閱:17下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 手勢辨識之應用可滿足人機互動的需求,故基於影像之手勢辨識為重要的研究議題。以影像為基礎之手勢辨識,易受到環境及人為影響,例如:手勢大小、光線變化及視角變化。此外,手勢辨識系統應降低訓練及測試時間,以因應遇到影像資料量大和特徵維度高之問題。
    在本論文中提出以偏最小平方法(partial least squares, PLS)降低Histogram of Oriented Gradients (HOG)的維度,再結合支持向量機(Support Vector Machine, SVM)並採用RBF核函式,於訓練階段對降維後的HOG徵描述子進行分類器之訓練。於測試階段,對新進影像擷取HOG特徵描述子,並使用在訓練階段所得的降維係數,進行對測試資料的降維,再將降維後的資料輸入SVM分類器內,以OAA分類法則決定最終辨識結果,此既能減少系統處理時間,又能有效增進辨識率。


    The needs of applications of hand posture recognition meets the demand for human-computer interaction, thus the vision based hand posture recognition is an important research topic. However, vision based hand posture recognition is vulnerable to environmental changes or human impacts, i.e. size of hand posture, lighting and view variations. In addition , a hand posture recognition system must process efficiently in training and testing stage to handle the large amount of image samples and high dimensional descriptors.
    In this paper, we propose to apply partial least squares method (partial least squares, PLS) on dimensions of Histogram of Oriented Gradients (HOG) descriptors, and to train support vector machines (Support Vector Machine, SVM) with RBF kernel function. At the training stage, we perform dimension reduction HOG descriptors and then to train SVM classifiers. In the testing phase, after obtaining of HOG feature descriptors on testing images, and then we use the weight matrix from PLS for dimension reduction. To obtain the final classification result, we use SVM classifiers, followed by OAA decision rule. With these helps , this system can not only reduce both the training and testing time, but also improve the recognition rate.

    摘要 i Abstract ii 誌謝 iii 目錄 iv 圖目錄 vi 表目錄 viii 第一章 緒論 1 1.1前言 1 1.2研究動機 1 1.3 研究方法 3 1.4 論文架構 3 第二章 以視覺為基礎的手勢辨識 5 2.1 手勢辨識系統架構 5 2.2 手勢辨識文獻回顧 6 2.2.1建立自由度(Degree of Freedom, DOF)手勢模型的手勢辨識演算法現況 6 2.2.2 基於影像特徵為主的手勢辨識演算法現況 9 2.2.3 採行降維技術的手勢辨識方案 22 2.3 分類器(Classifier) 23 2.3.1 支持向量機(Support Vector Machine,SVM) 23 2.3.1.1 訓練階段 25 2.3.1.2 測試階段 28 2.4 總結 29 第三章 特徵擷取與降維 30 3.1 特徵擷取 (Feature Extraction) 30 3.1.1特徵擷取演算法回顧 30 3.1.2 Scale Invariant Feature Transform (SIFT) 30 3.1.3 Histogram of Oriented Gradients (HOG) 32 3.1.3.1 C-HOG 33 3.1.3.2 R-HOG 33 3.2 偏最小平方法(Partial Least Squares, PLS) 35 3.3 總結 39 第四章 本論文所提出之以SVM為基礎採用PLS對HOG進行降維之手勢辨識演算法 41 4.1系統流程概述 41 4.2訓練階段 42 4.2.1 Hand Segmentation 42 4.2.2 使用雙立方內插法Bicubic Interpolation縮放ROI 43 4.2.3 HOG特徵擷取 45 4.2.4 PLS降維 46 4.2.5 SVM 分類器kernel參數選定 50 4.3 測試階段 52 4.4 總結 53 第五章 實驗結果與討論 54 5.1實驗環境及測試影像資料庫 54 5.2辨識系統效能評估 56 5.2.1 辨識系統的準確率分析 56 5.2.1.1 HOG 區塊參數設定對準確率之影響 56 5.2.1.2 SVM與PLS降低的維度之比較 58 5.2.2 時間複雜度評估 69 5.2.2.1 PLS降維 69 5.3 訓練及測試時間分析 69 5.4 總結 73 第六章 結論與未來展望 74 參考文獻 75

    [1] X. Shen, G. Hua, L. Williams and Y. Wu, “Dynamic hand gesture recognition: An exemplar-based approach from motion divergence fields, ” Image and Vision Computing, vol. 30,no. 3, pp. 227-235, March 2012.
    [2] A. Just and S. Marcel, “A comparative study of two state-of-the-art sequence processing techniques for hand gesture recognition,” Computer Vision and Image Understanding, vol. 113, no. 4, pp. 532-543, April 2009.
    [3] S.S. Ge *, Y. Yang and T.H. Lee, “Hand gesture recognition and tracking based on distributed locally linear embedding,” Image and Vision Computing, vol. 26, no. 12, pp.1607–1620, December 2008.
    [4] S. Mitra, and T. Acharya, “Gesture Recognition: A Survey,” in Proceedings of IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, vol. 37, no. 3, pp.311-324, May 2007.
    [5] SIL.org. 2013. Documentation for ISO 639 identifier: ase. [ONLINE] Available at: http://www-01.sil.org/iso639-3/documentation.asp?id=ase. [Accessed 17 May 13].
    [6] Lifeprint.com, (2004), The Fingerspelled Alphabet [ONLINE]. Available at: http://lifeprint.com/asl101/fingerspelling/images/abc1280x960.png [Accessed 18 May 13].
    [7] H. Abdi,and L.J.Williams, “Principal component analysis,” Wiley Interdisciplinary Reviews: Computational Statistics, vol. 2, no. 4, pp. 433-459, 30 June 2010.
    [8] M. Borga, “Canonical Correlation a Tutorial,” Department of Biomedical Engineering, Linköping University, 12 January 2001.
    [9] I. Helland, “On the structure of partial least squares regression," Communications in Statistics-Simulation and Computation, vol. 17, no. 2, pp. 581-607, 1988.
    [10] J. Henseler, C.M. Ringle, and Rudolf R. Sinkovics, “The use of partial least squares path modeling in international marketing, ” Advances in International Marketing, vol. 20, no.1, pp. 277-319, 2009.
    [11] C. Nölker and H. Ritter, “Visual Recognition of Continuous Hand Postures,” in Proceedings of IEEE Transactions on Neural Networks, vol. 13, no. 4, pp. 983-994, July 2002.
    [12] M. M. Hasan and P. K. Mishra, “Hand Gesture Modeling and Recognition using Geometric Features: A Review,” in Proceedings of Canadian Journal on Image Processing and Computer Vision, vol. 3 no. 1, pp. 12-26, March 2012.
    [13] J. J. Kuch and T.S. Huang, “Vision Based Hand Modelin and Tracking for Virtual Teleconferencing and Telecollaboration,” in Proceedings of IEEE International Conference on Computer Vision, pp. 666-671, 20-23 June 1995.
    [14] H. Ritter, “Parametrized self-organizing maps,” Artificial Neural Networks, vol. 3, pp. 568–577, 13–16 September 1993.
    [15] T. Kohonen, “The self-organizing map,” Proceedings of the IEEE, vol. 78, no. 9, pp. 1464–1480, September 1990.
    [16] Y. Chuang, L. Chen, G. Zhao, and G. Chen, “Hand Posture Recognition and Tracking Based on Bag-of-Words for Human Robot Interaction,” in Proceedings of IEEE International Conference on Robotics and Automation, pp. 1050-4729, 9-13 May 2011.
    [17] C-C Wang,and K-C Wang, “Hand Posture recognition using Adaboost with SIFT for Human Robot Interaction”, in Proceedings of IEEE International Conference on Advanced Robotics, pp.317-329, August 2007.
    [18] C-C Wang,and K-C Wang, “Hand Gesture Recognition Using Adaboost with SIFT, ” National Taiwan University Master Thesis, 2007.
    [19] Linda Shapiro. 2007. MSER Operator. [ONLINE] Available at:http://www.cs.washington.edu/education/courses/cse576/07sp/notes/MSER_white.pdf. [Accessed 17 April 13].
    [20] Y. Song, D. Demirdjian, and Randall Davis, “Tracking Body and Hands for Gesture Recognition: NATOPS Aircraft Handling Signals Database, ” in Proceedings of IEEE International Conference on Automatic Face &Gesture Recognition and Workshops, pp. 500-506, 21-25 March 2011.
    [21] A. Misra, A. Takashi, T. Okatani, and K. Deguchi, “Hand Gesture Recognition using Histogram of Oriented Gradients and Partial Least Squares Regression, ” in Proceedings of IAPR Conference on Machine Vision Applications, pp.479-482, 13-15 June 2011.
    [22] D. Knight, M.Tang, H. Dahlkamp, and C. Plagemann, “A Framework for Recognizing Hand Gestures, ” CS229 Final Project Paper, 2010.
    [23] S. Liwicki and M. Everingham, “Automatic Recognition of Finger spelled Words in British Sign Language," in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 07 July 2009.
    [24] A. Bosch, A. Zisserman, and X. Munoz, “Representing shape with a spatial pyramid kernel,” in Proceedings of the 6th ACM International Conference on Image and Video Retrieval, pp. 401-408, 09-11 July 2007.
    [25] B.E. Boser, I. Guyon, and V. Vapnik, “A training algorithm for optimal margin classifiers," in Proceedings of ACM Conference on Learning Theory, pp. 144-152, July 1992.
    [26] N.Cristianini and J. Shawe-Taylor, “An Introduction to Support Vector Machines and Other Kernel-based Learning Methods,” NY Cambridge University Press, 2000.
    [27] Bishop, Christopher M. Pattern recognition and machine learning. vol. 1. pp. 326-328, Springer, New York, 2006.
    [28] J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid, “Local Feature and Kernels for Classification of Texture and Object Categories: A Comprehensive Study,” Internal Journal of Computer Vision, vol. 73, no. 2, pp.213-238, June 2007.
    [29] D.G. Lowe, “Distinctive image features from scale-invariant keypoints,” International Journal of Computer Vision, vol. 60, no. 2, pp.91-110, November 2004.
    [30] H. Bay, A. Ess, T. Tuytelaars, and L. V. Gool, “SURF: speeded up robust features,” Journal of Computer Vision and Image Understanding, vol. 110, no. 3, pp. 346-359, May 2008.
    [31] K. Mikolajczyk and C. Schmid, “A performance evaluation of local descriptors,” in Proceedings of IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 10, no. 27, pp.1615-1630, October 2005.
    [32] N. Dalal and B.Triggs, “Histograms of Oriented Gradients for Human Detection,” in Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, vol. 1, pp.886-893, June 2005.
    [33] H. Abdi, “Partial least squares regression and projection on latent structure regression (PLS-Regression),” Wiley Interdisciplinary Reviews: Computational Statistics, vol. 2, no. 1, pp.97–106, 06 January, 2010.
    [34] M. Haenlein and A. M. Kaplan, “A Beginner’s Guide to Partial Least Squares Analysis,” Understanding Statistics, vol. 3, no. 4, pp.283-297, 2004.
    [35] R. Rosipal and N. Krämer, “Overview and Recent Advances in Partial Least Squares,” Proceedings of the international conference on Subspace, Latent Structure and Feature Selection, vol. 3940, pp 34-51, 2006.
    [36] A. Höskuldsson, “PLS regression methods,” Journal of Chemometrics, vol. 2, no.3 pp.211-228, June 1988.
    [37] S. De Jong, “SIMPLS:an alternative approach to partial least squares regression,” Chemometrics and Intelligent Laboratory Systems, vol. 18, pp. 251–263, March 1993.
    [38] D. Chai and K. N. Ngan, “Face segmentation using skin-color map in videophone applications,” in Proceedings of IEEE Trans. Circuits and Systems for Video Technology, vol. 9, no. 4, pp.551-564, June 1999.
    [39] G. Hoffmann, “Interpolations for Image Warping,” University of Applied Sciences in Emden. 2013.
    [40] Chih-Chung Chang and Chih-Jen Lin, LIBSVM : a library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1--27:27, 2011. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
    [41] C.-W. Hsu, C.-C. Chang, C.-J. Lin. “A practical guide to support vector classification.” April 2010.
    [42] Jochen Triesch and Christoph von der Malsburg, “Robust Classification of Hand Postures against Complex Backgrounds,” Proceedings of the Second International Conference on Automatic Face and Gesture Recognition, pp 170-175, IEEE Computer Society Press, Killington, Vermont, USA, 14-16 October 1996.
    [43] Y. Tang, Y-Q. Zhang, N.V. Chawla, and S. Krasser, “SVMs modeling for highly imbalanced classification,” in Proceedings of IEEE Transactions on Systems, Man, and Cybernetics, vol. 39, no. 1, pp. 281-288, February 2009.

    QR CODE
    :::