| 研究生: |
蔡雨芳 Yu-Fang Tsai |
|---|---|
| 論文名稱: |
基於偏最小平方法之靜態手勢辨識 Hand Posture Recognition Using Partial Least Squares |
| 指導教授: |
唐之瑋
Chih-Wei Tang |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 通訊工程學系 Department of Communication Engineering |
| 論文出版年: | 2013 |
| 畢業學年度: | 101 |
| 語文別: | 中文 |
| 論文頁數: | 92 |
| 中文關鍵詞: | 手勢辨識 、偏最小平方法 、梯度方向直方圖 、支持向量機 、降維 、降低訓練時間 |
| 外文關鍵詞: | Hand Posture Recognition, Partial Least Squares, Histogram of Oriented Gradients, Support Vector Machine, Dimension Reduction, Training Time Reduction |
| 相關次數: | 點閱:19 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
手勢辨識之應用可滿足人機互動的需求,故基於影像之手勢辨識為重要的研究議題。以影像為基礎之手勢辨識,易受到環境及人為影響,例如:手勢大小、光線變化及視角變化。此外,手勢辨識系統應降低訓練及測試時間,以因應遇到影像資料量大和特徵維度高之問題。
在本論文中提出以偏最小平方法(partial least squares, PLS)降低Histogram of Oriented Gradients (HOG)的維度,再結合支持向量機(Support Vector Machine, SVM)並採用RBF核函式,於訓練階段對降維後的HOG徵描述子進行分類器之訓練。於測試階段,對新進影像擷取HOG特徵描述子,並使用在訓練階段所得的降維係數,進行對測試資料的降維,再將降維後的資料輸入SVM分類器內,以OAA分類法則決定最終辨識結果,此既能減少系統處理時間,又能有效增進辨識率。
The needs of applications of hand posture recognition meets the demand for human-computer interaction, thus the vision based hand posture recognition is an important research topic. However, vision based hand posture recognition is vulnerable to environmental changes or human impacts, i.e. size of hand posture, lighting and view variations. In addition , a hand posture recognition system must process efficiently in training and testing stage to handle the large amount of image samples and high dimensional descriptors.
In this paper, we propose to apply partial least squares method (partial least squares, PLS) on dimensions of Histogram of Oriented Gradients (HOG) descriptors, and to train support vector machines (Support Vector Machine, SVM) with RBF kernel function. At the training stage, we perform dimension reduction HOG descriptors and then to train SVM classifiers. In the testing phase, after obtaining of HOG feature descriptors on testing images, and then we use the weight matrix from PLS for dimension reduction. To obtain the final classification result, we use SVM classifiers, followed by OAA decision rule. With these helps , this system can not only reduce both the training and testing time, but also improve the recognition rate.
[1] X. Shen, G. Hua, L. Williams and Y. Wu, “Dynamic hand gesture recognition: An exemplar-based approach from motion divergence fields, ” Image and Vision Computing, vol. 30,no. 3, pp. 227-235, March 2012.
[2] A. Just and S. Marcel, “A comparative study of two state-of-the-art sequence processing techniques for hand gesture recognition,” Computer Vision and Image Understanding, vol. 113, no. 4, pp. 532-543, April 2009.
[3] S.S. Ge *, Y. Yang and T.H. Lee, “Hand gesture recognition and tracking based on distributed locally linear embedding,” Image and Vision Computing, vol. 26, no. 12, pp.1607–1620, December 2008.
[4] S. Mitra, and T. Acharya, “Gesture Recognition: A Survey,” in Proceedings of IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, vol. 37, no. 3, pp.311-324, May 2007.
[5] SIL.org. 2013. Documentation for ISO 639 identifier: ase. [ONLINE] Available at: http://www-01.sil.org/iso639-3/documentation.asp?id=ase. [Accessed 17 May 13].
[6] Lifeprint.com, (2004), The Fingerspelled Alphabet [ONLINE]. Available at: http://lifeprint.com/asl101/fingerspelling/images/abc1280x960.png [Accessed 18 May 13].
[7] H. Abdi,and L.J.Williams, “Principal component analysis,” Wiley Interdisciplinary Reviews: Computational Statistics, vol. 2, no. 4, pp. 433-459, 30 June 2010.
[8] M. Borga, “Canonical Correlation a Tutorial,” Department of Biomedical Engineering, Linköping University, 12 January 2001.
[9] I. Helland, “On the structure of partial least squares regression," Communications in Statistics-Simulation and Computation, vol. 17, no. 2, pp. 581-607, 1988.
[10] J. Henseler, C.M. Ringle, and Rudolf R. Sinkovics, “The use of partial least squares path modeling in international marketing, ” Advances in International Marketing, vol. 20, no.1, pp. 277-319, 2009.
[11] C. Nölker and H. Ritter, “Visual Recognition of Continuous Hand Postures,” in Proceedings of IEEE Transactions on Neural Networks, vol. 13, no. 4, pp. 983-994, July 2002.
[12] M. M. Hasan and P. K. Mishra, “Hand Gesture Modeling and Recognition using Geometric Features: A Review,” in Proceedings of Canadian Journal on Image Processing and Computer Vision, vol. 3 no. 1, pp. 12-26, March 2012.
[13] J. J. Kuch and T.S. Huang, “Vision Based Hand Modelin and Tracking for Virtual Teleconferencing and Telecollaboration,” in Proceedings of IEEE International Conference on Computer Vision, pp. 666-671, 20-23 June 1995.
[14] H. Ritter, “Parametrized self-organizing maps,” Artificial Neural Networks, vol. 3, pp. 568–577, 13–16 September 1993.
[15] T. Kohonen, “The self-organizing map,” Proceedings of the IEEE, vol. 78, no. 9, pp. 1464–1480, September 1990.
[16] Y. Chuang, L. Chen, G. Zhao, and G. Chen, “Hand Posture Recognition and Tracking Based on Bag-of-Words for Human Robot Interaction,” in Proceedings of IEEE International Conference on Robotics and Automation, pp. 1050-4729, 9-13 May 2011.
[17] C-C Wang,and K-C Wang, “Hand Posture recognition using Adaboost with SIFT for Human Robot Interaction”, in Proceedings of IEEE International Conference on Advanced Robotics, pp.317-329, August 2007.
[18] C-C Wang,and K-C Wang, “Hand Gesture Recognition Using Adaboost with SIFT, ” National Taiwan University Master Thesis, 2007.
[19] Linda Shapiro. 2007. MSER Operator. [ONLINE] Available at:http://www.cs.washington.edu/education/courses/cse576/07sp/notes/MSER_white.pdf. [Accessed 17 April 13].
[20] Y. Song, D. Demirdjian, and Randall Davis, “Tracking Body and Hands for Gesture Recognition: NATOPS Aircraft Handling Signals Database, ” in Proceedings of IEEE International Conference on Automatic Face &Gesture Recognition and Workshops, pp. 500-506, 21-25 March 2011.
[21] A. Misra, A. Takashi, T. Okatani, and K. Deguchi, “Hand Gesture Recognition using Histogram of Oriented Gradients and Partial Least Squares Regression, ” in Proceedings of IAPR Conference on Machine Vision Applications, pp.479-482, 13-15 June 2011.
[22] D. Knight, M.Tang, H. Dahlkamp, and C. Plagemann, “A Framework for Recognizing Hand Gestures, ” CS229 Final Project Paper, 2010.
[23] S. Liwicki and M. Everingham, “Automatic Recognition of Finger spelled Words in British Sign Language," in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 07 July 2009.
[24] A. Bosch, A. Zisserman, and X. Munoz, “Representing shape with a spatial pyramid kernel,” in Proceedings of the 6th ACM International Conference on Image and Video Retrieval, pp. 401-408, 09-11 July 2007.
[25] B.E. Boser, I. Guyon, and V. Vapnik, “A training algorithm for optimal margin classifiers," in Proceedings of ACM Conference on Learning Theory, pp. 144-152, July 1992.
[26] N.Cristianini and J. Shawe-Taylor, “An Introduction to Support Vector Machines and Other Kernel-based Learning Methods,” NY Cambridge University Press, 2000.
[27] Bishop, Christopher M. Pattern recognition and machine learning. vol. 1. pp. 326-328, Springer, New York, 2006.
[28] J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid, “Local Feature and Kernels for Classification of Texture and Object Categories: A Comprehensive Study,” Internal Journal of Computer Vision, vol. 73, no. 2, pp.213-238, June 2007.
[29] D.G. Lowe, “Distinctive image features from scale-invariant keypoints,” International Journal of Computer Vision, vol. 60, no. 2, pp.91-110, November 2004.
[30] H. Bay, A. Ess, T. Tuytelaars, and L. V. Gool, “SURF: speeded up robust features,” Journal of Computer Vision and Image Understanding, vol. 110, no. 3, pp. 346-359, May 2008.
[31] K. Mikolajczyk and C. Schmid, “A performance evaluation of local descriptors,” in Proceedings of IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 10, no. 27, pp.1615-1630, October 2005.
[32] N. Dalal and B.Triggs, “Histograms of Oriented Gradients for Human Detection,” in Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, vol. 1, pp.886-893, June 2005.
[33] H. Abdi, “Partial least squares regression and projection on latent structure regression (PLS-Regression),” Wiley Interdisciplinary Reviews: Computational Statistics, vol. 2, no. 1, pp.97–106, 06 January, 2010.
[34] M. Haenlein and A. M. Kaplan, “A Beginner’s Guide to Partial Least Squares Analysis,” Understanding Statistics, vol. 3, no. 4, pp.283-297, 2004.
[35] R. Rosipal and N. Krämer, “Overview and Recent Advances in Partial Least Squares,” Proceedings of the international conference on Subspace, Latent Structure and Feature Selection, vol. 3940, pp 34-51, 2006.
[36] A. Höskuldsson, “PLS regression methods,” Journal of Chemometrics, vol. 2, no.3 pp.211-228, June 1988.
[37] S. De Jong, “SIMPLS:an alternative approach to partial least squares regression,” Chemometrics and Intelligent Laboratory Systems, vol. 18, pp. 251–263, March 1993.
[38] D. Chai and K. N. Ngan, “Face segmentation using skin-color map in videophone applications,” in Proceedings of IEEE Trans. Circuits and Systems for Video Technology, vol. 9, no. 4, pp.551-564, June 1999.
[39] G. Hoffmann, “Interpolations for Image Warping,” University of Applied Sciences in Emden. 2013.
[40] Chih-Chung Chang and Chih-Jen Lin, LIBSVM : a library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1--27:27, 2011. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
[41] C.-W. Hsu, C.-C. Chang, C.-J. Lin. “A practical guide to support vector classification.” April 2010.
[42] Jochen Triesch and Christoph von der Malsburg, “Robust Classification of Hand Postures against Complex Backgrounds,” Proceedings of the Second International Conference on Automatic Face and Gesture Recognition, pp 170-175, IEEE Computer Society Press, Killington, Vermont, USA, 14-16 October 1996.
[43] Y. Tang, Y-Q. Zhang, N.V. Chawla, and S. Krasser, “SVMs modeling for highly imbalanced classification,” in Proceedings of IEEE Transactions on Systems, Man, and Cybernetics, vol. 39, no. 1, pp. 281-288, February 2009.