| 研究生: |
廖雅蓮 Ya-Lien Liao |
|---|---|
| 論文名稱: | A Meta-Feature Representation Approach toImage Annotation |
| 指導教授: |
蔡志豐
Chih-Fong Tsai |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
管理學院 - 資訊管理學系 Department of Information Management |
| 畢業學年度: | 98 |
| 語文別: | 英文 |
| 論文頁數: | 86 |
| 中文關鍵詞: | 圖片特徵表示法 、影像檢索 、圖片命名 、圖片分類 |
| 外文關鍵詞: | feature representation, image retrieval, image annotation, image classification |
| 相關次數: | 點閱:14 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
隨著電腦網路與資訊科技蓬勃發展,影像自動命名與檢索的議題應運而生,如何提供使用者準確及有效率的影像檢索成為探索的目標。近年來以影像內容特徵為基礎的影像檢索方法蓬勃發展。然而,直接使用影像原始低階特徵的分類準確率偏低,造成影像經常被命名了不恰當的關鍵字。本論文提出一種新的特徵表示法達成優於使用原始低階特徵的分類準確率,稱為Meta-Features 表示法。本論文使用Corel 資料集來驗證Meta-features 能夠達成較好的分類成效。為了充分驗證,實驗採用三種知名的分類器來進行比較,分別是Support Vector Machines (SVM)、k-Nearest Neighbor (k-NN)與naïve Bayes 分類器。
This thesis proposes a feature representation approach namely meta-feature representation for automatic image annotation. Automatic image annotation technology aims to provide an efficient and effective searching environment for users to query images by keywords, which solves the limitation of Content-Based Image Retrieval (CBIR) that low-level image features are only extracted and used for similarity search. It is the fact that low-level image features do not directly correspond to high-level concepts of users. This causes many incorrect keyword assignments to images since a certain number of semantically similarn images have dissimilar low-level features and semantically dissimilar images have similar low-level features. The
meta-features proposed in this thesis are extracted from the original low-level image features by nine transformation formulas to improve annotation accuracy. We use the Corel dataset for the experiments to show the performance improvement of image annotation based on the meta-features. In particular, for classifier design, this thesis considers three well-known and popular classifiers for image annotation. They are the k-Nearest Neighbor (k-NN) classifier, Support Vector Machines (SVM), and the naïve Bayes classifier.
[1]. Antani, S., Kasturi, R., and Jain, R. (2002) A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video. Pattern Recognition, vol. 35, pp. 945-965.
[2]. Asconcelos, N. (2007). From pixels to semantic spaces: Advances in content-based image retrieval. Computer, 40(7), pp.20-26, UC San Diego.
[3]. Aslandogan, Y.A. and Yu, C.T. (1999) Techniques and systems for image and video retrieval. IEEE Transactions on Knowledge and Data Engineering, vol. 11, no. 1, pp.56-63.
[4]. Awcock, G.W. and Thomas, R. (1996) Applied image processing. McGraw-Hill, New York.
[5]. Barnard, K., Duygulu, P., Forsyth, D., de Freitas, N., Blei, D., and Jordan, M.I. (2003a) Matching words and pictures. Journal of Machine Learning Research, vol. 3, pp. 1107-1135.
[6]. Barnard, K., Duygulu, P., Guru, R., Gabbur, P., and Forsyth, D. (2003b) The effects of segmentation and feature choice in a translation model of object recognition. Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, Madison, Wisconsin, June 16-22, pp. 675-682.
[7]. Blei, D.M. and Jordan, M.I. (2003) Modeling annotated data. Proceedings of the 26th International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada, July 28-Aug. 1, pp. 127-134.
[8]. Caelli, T. and Bischof, W.F. (1997) The role of machine learning in building image interpretation systems. International Journal of Pattern Recognition and Artificial Intelligence, vol. 11, no. 1, pp. 143-168.
[9]. Castleman, K.R. (1996) Digital Image Processing. Prentice-Hall, New Jersey.
[10]. Cawkell, A.E. (1992) Selected aspects of image processing and management: review and future prospects. Journal of Information Science, vol. 18, no. 3, pp. 179-192.
[11]. Chang, C.C. and Lin, C.J., LIBSVM: a library for support vector machines, vol. 80: 604–611, 2001.
[12]. Choi, Y. and Rasmussen, E.M. (2002) Users’ relevance criteria in image retrieval in American history. Information Processing & Management, vol. 38, no. 5, pp. 695-726.
[13]. Cover, T. and Hart, P. (1967) Nearest neighbor pattern classification. IEEE Transactions on Information Theory, vol.13, no.1, pp. 21- 27.
[14]. Das, G. and Ray, S. (2006) Effect of Finite Sample Size in Content-Based Image Retrieval. Video and Signal Based Surveillance. AVSS ''06. IEEE International Conference on, vol., no., pp.96-96, Nov.
[15]. Datta, R., Joshi, D., Li, J., and Wang, J.Z. (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Computing Surveys, vol. 40, no. 2, article 5.
[16]. Daubechies, I. (1992) Ten lectures on wavelets. Society for Industrial and Applied Mathematics, Philadelphia.
[17]. Daugman, J.G. (1990) An information-theoretic view of analog representation in striate cortex. In Computational Neuroscience (pp.403-423), Schwartz, E.L. (Ed.), MIT Press, Massachusetts.
[18]. Deerwester, S., Dumais, S.T.; Furnas, G.W., Landauer, T.K. and Harshman, R. (1990) Indexing by latent semantic indexing. Journal of the American Society for Information Science, vol.41, no. 6.
[19]. Deselaers, T., Deserno, T.M., and Muller, H. (2008) Automatic medical image annotation in ImageCLEF 2007: overview, results, and discussion. Pattern Recognition Letters, vol. 29, pp. 1988-1995.
[20]. Duda, R.O., Hart, P.E., and Stork, D.G. (2001) Pattern Classification (2nd Edition). John Wiley, New York.
[21]. Eakins, J.P., Briggs, P., and Burford, B. (2004) Image retrieval interfaces: a user perspective. Proceedings of the International Conference on Image and Video Retrieval, Dublin, Ireland, July 21-23, pp. 628-637.
[22]. Givers, T. and Smeulders, A. W. M. (1996) A comparative study of several color models for color image invariant retrieval, Proceedings of the First International Workshop on Image Databases and Multi Media search, Amsterdam, The Netherlands, August 22-23, pp. 17-26.
[23]. Goodrum, A. and Spink, A. (2001) Image searching on the Excite Web search engine. Information Processing & Management, vol. 37, no. 2, pp. 295-311.
[24]. Grigorescu, S.E., Petkov, N., and Kruizinga, P. (2002) Comparison of texture features based on Gabor filters. IEEE Transactions on Image Processing, vol. 11, no. 10, pp. 1160-1167.
[25]. Gupta, A., Santini, S. and Jain, R. (1997) In search of information in visual media. Commun ACM ,40(12):35–42.
[26]. Keefe, J.M. (1990) The image as document: descriptive programs at Rensselaer. Library Trends, vol. 38, no. 4, pp. 659-681.
[27]. Idris, F. and Panchanathan, S. (1997) Review of image and video indexing techniques. Journal of Visual Communication and Image Representation, vol. 8, no. 2, pp. 146-166.
[28]. Ito, H., Kawai ,Y. and Koshimizu, H. (2008) Face Image Annotation Based on Latent Semantic Space and Rules. 12th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems, pp.766-773.
[29]. Jahne, B. (1995) Digital Image Processing: Concepts, Algorithms, and Scientific Applications. Springer-Verlag, Berlin.
[30]. Jeong, J., Hwang, C. and Jeon, B. (2009, January) An efficient method of image identification by combining image features. International Conference on Ubiquitous Information Management and Communication, Suwon, Korea.
[31]. Jeon, J., Lavrenko, V., and Manmatha, R. (2003, July) Automatic image annotation and retrieval using cross-media relevance models. Proceedings of the 26th Annual International ACM SIGIR Conference on Research and development in information retrieval, Toronto, Canada, pp. 119-126.
[32]. Wang, J.Z., Li, J. and Wiederhold, G. (2001) SIMPLIcity: semantics-sensitive integrated matching for picture libraries. IEEE Trans. Pattern Analysis and Machine Intelligence, 23(9), 947–963.
[33]. Li, J. and Wang, J.Z. (2003) Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, no. 9, pp. 1075-1088.
[34]. Livens, S., Scheunders, P., Van de Wouwer, G. and Van Dyck, D. (1997) Wavelet for texture analysis: an overview. Proceedings of the IEEE International Conference on Image Processing and its Applications, Dublin, Ireland, July 14-17, p. 581-585.
[35]. Long, F., Zhang, H., and Feng, D.D. (2003) Fundamentals of content-based image retrieval. In Multimedia Information Retrieval and Management – Technological Fundamentals and Applications. Feng, D.D., Siu, W.C., Zhang, H. (Eds.), Springer-Verlag, Germany.
[36]. Manning , C.D. , Raghavan, P. , Schütze, H. (2008) Introduction to Information Retrieval. Cambridge University Press. ISBN: 0521865719.
[37]. Markkular, M, Tico, M., Sepponen, B., Nirkkonen, K. and Sormunen, E. (2001) A test collection for the evaluation of content-based image retrieval algorithms – a user and task-based approach. Information Retrieval, vol. 4, no. 3-4, pp. 275-293.
[38]. Mathias, E. (1998) Comparing the influence of color spaces and metrics in content-based image retrieval. Proceedings of the IEEE International Symposium on Computer Graphics, Image Processing, and Vision, Rio de Janeiro, Brazil, Oct. 20-23, pp. 371-378.
[39]. Mitchell, T. (1997) Machine Learning. McGraw-Hill, New York.
[40]. Mylonas , P., Spyrou , E., Avrithis ,Y. and Kollias,S. (2009) Using visual context and region semantics for high-level concept detection. IEEE Transactions on Multimedia, vol. 11, NO. 2.
[41]. Papathomas, T.V., Conway, T.E., Cox, I.J., Ghosn, J., Miller, M.L., Minka, T.P., and Yianilos, P.N. (1998, January) Psychophysical studies of the performance of an image database retrieval system. Proceedings of the SPIE Conference on Human Vision and Electronic Imaging III, vol. 3299, San Jose, California, 24-30, pp. 591-602.
[42]. Paredes, R., Pérez, J. C., Juan, A., and Vidal, E. (2001, July) Local representations and a direct voting scheme for face recognition. In In Proc. of the Workshop on Pattern Recognition in Information Systems.
[43]. Rish, Irina. (2001). An empirical study of the naive bayes classifier. IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence.
[44]. Rui, Y., Huang, T. S., and Chang, S. F. (1999) Image retrieval: current techniques, promising directions and open issues. Journal of Visual Communication and Image Representation, vol. 10, no. 1, pp. 39-62.
[45]. Sebe, N. and Lew, M.S. (2001) Texture feature for content-based retrieval. In Principles of Visual Information Retrieval, Lew, M. S. (Ed.), Springer-Verlag, London.
[46]. Shi, J. and Malik, J. (2000) Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22 No. 8, pp. 888-905.
[47]. Smith, L. I. (2002, February) A tutorial on principal component analysis. http://www.cs.otago.ac.nz/cosc453/student_tutorials/principal_components.pdf
[48]. Tang, J., Lewis, P.H. (2007, March) A Study of Quality Issues for Image Auto-Annotation with the Corel Data-Set. IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no.3.
[49]. Tsai, C. F., McGarry, K., and Tait, J. (2006a) Qualitative evaluation of automatic assignment of keywords to images. Information Processing & Management, vol. 42, no. 1, pp. 136-154.
[50]. Tsai, C. F., McGarry, K., and Tait, J. (2006b) CLAIRE: a modular support vector image indexing and classification system. ACM Transactions on Information Systems, vol. 24, no. 3, pp. 353-379.
[51]. Tsai, C. F. and Hung, C. (2008) Automatically annotating images with keywords: a review of image annotation systems. Recent Patents on Computer Science. vol. 1, no. 1, pp. 55-68.
[52]. Tsang, I. W., Kwok , J.T., Cheung, P. M. (2005) Core vector machines: Fast SVM training on very large data sets. The Journal of Machine Learning Research, vol. 6, p.363-392.
[53]. Tuceryan, M. and Jain, A.K. (1998) Texture analysis. In The Handbook of Pattern Recognition and Computer Vision, 2nd Edition. Chen, C.H., Pau, L.F., and Wang, P. S. P. (Eds.), World Scientific, Singapore.
[54]. Venters, C. C. and Cooper, M. (2000, June) A Review of Content-Based Image Retrieval Systems. JISC Technology Application Program. Technical Report JTAP-054.
[55]. Wong, W.T. and Hsu, S.H. (2006) Application of SVM and ANN for image retrieval. European Journal of Operational Research, vol. 173, pp. 938-50.
[56]. Wang,Y., Mei,T., Gong,S. and Hua X.S. (2009) Combining global, regional and contextual features for automatic image annotation. Pattern Recognition,vol. 42, 259 – 266.
[57]. Wu, J.K., Kankanhalli, M.S., Lim, J.-H., and Hong, D. (2000) Perspectives on content-based multimedia systems. Kluwer Academic Publishers, Massachusetts.
[58]. Wu, L., Hu,Y., Li, M., Yu, N., Hua, X.S. (2009) Scale-invariant visual language modeling for object categorization. IEEE Transactions on Multimedia, vol. 11, no. 2.
[59]. Wu, X. and Kumar, V. (2009) Top ten algorithms in data mining. Knowledge and Information Systems, vol. 14 (1), pp. 1–37.