跳到主要內容

簡易檢索 / 詳目顯示

研究生: 林政廷
Cheng-Ting Lin
論文名稱: 適用於不同聽力理解程度學習者的英語影片學習系統
The Use of an English Video Learning System for the Oriented EFL Learner: Students’ Evaluation toward the Adaptive Levels of the Selected Video Clips
指導教授: 楊接期
Jie-chi Yang
口試委員:
學位類別: 碩士
Master
系所名稱: 資訊電機學院 - 網路學習科技研究所
Graduate Institute of Network Learning Technology
畢業學年度: 98
語文別: 中文
論文頁數: 71
中文關鍵詞: 影片分類支持向量機英文聽力理解場景分割
外文關鍵詞: support vector machines, video classification, scene segmentation, English listening comprehension
相關次數: 點閱:12下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本研究開發一套適用於不同聽力理解程度學習者的英語影片學習系統,希望能夠提供符合學習者聽力理解能力且充足完整的影片情境來輔助學習者英語聽力理解,並藉由SVM演算法以自動化的方式建立影片難度分級。此外,本研究設計一項實驗來評估使用字幕特徵、聲音特徵、字幕及聲音特徵,以哪種特徵方式所分類出來的影片最符合學習者聽力理解能力,以及對於輔助聽力理解成效之差異性。最後透過科技接受度和系統品質問卷調查學習者使用本系統後,對於系統的滿意度。實驗結果發現,第一,在聽力理解能力為高程度與中程度的學習者,使用字幕聲音特徵所分類出來的影片最符合其聽力理解能力。然而對聽力理解能力為低程度的學習者,使用字幕特徵所分類出來的影片,最符合其聽力理解能力。第二,不論使用字幕特徵或字幕聲音特徵所分類出來的影片,皆有助於學習者英語聽力理解。第三,學習者對於系統的滿意度,包含系統有用性、系統易用性以及系統品質上,皆給予正面的評價。


    The study proposed an English video learning system adapted to oriented EFL learners with various listening comprehension. The English video learning system not only suits to learners with various listening comprehension but also offers plenty of video context to improve learners’ listening comprehension. Moreover, the learning system would classify the videos into clips with different levels of difficulty automatically according to SVM algorithm. Besides, we conducted an experiment to evaluate the effects for using features based on various attributes, such as subtitle, audio, both subtitle and audio. The study validated that which feature suit and help learners most on listening comprehension, as well as the differences between them. We made use of the questionnaires of Technology Acceptance Model and System Quality to evaluate the learners’ satisfaction toward the learning system after using it. The results of the experiment are shown as follows. First, the high and medium level learners suit to the video clips classified according to features of both subtitle and audio most. However, the low level learners suit to the video clips classified according to features of subtitle most. Second, the video clips classified according to features of both subtitle and audio is contributive to facilitate learning comprehension. Third, the learners show positive on usability, easy-to-use, as well as quality of the system.

    摘要 I Abstract II 誌謝 III 目錄 IV 圖目錄 VI 第一章 緒論 1 1.1 研究背景 1 1.2 研究動機 2 1.3 研究目的 2 1.4 研究限制 3 1.5 論文架構 3 第二章 文獻探討 4 2.1 聽力理解 4 2.2 影響聽力理解的因素 5 2.3 多媒體學習對於聽力理解的影響 6 2.4 影片自動化分類 9 2.5 支持向量機 10 2.6 文獻探討總結 11 第三章 系統實作 12 3.1 系統架構 12 3.2 影片匯入處理 13 3.2.1 影片場景偵測模組 14 3.2.2 聲音分類SVM模組 16 3.2.3 發音縮減特徵SVM模組 18 3.2.4 說話清晰度計算模組 20 3.2.5 單字原型化處理模組 20 3.3 場景難易度SVM分群 20 3.3.1 聲音特徵 21 3.3.2 字幕特徵 23 3.3.3 特徵擷取小結 23 3.3.4 場景難易度SVM模組 24 3.3.5 影片分類處理程式介面 25 3.4 場景檢索 27 3.4.1 系統使用流程 27 3.4.2 系統操作介面 27 第四章 實驗設計 30 4.1 實驗背景及目的 30 4.2 實驗流程 31 4.3 研究工具 32 4.3.1 聽力測驗試題 32 4.3.2 混淆字理解試題 33 4.3.3 問卷調查 33 4.3.4 系統使用記錄 34 第五章 結果與討論 36 5.1 系統使用記錄分析 36 5.2 混淆字理解試題結果分析 39 5.2.1 組內前後測成績 39 5.2.2 組間前後測成績比較 41 5.3 問卷調查結果分析 43 5.3.1 系統評量問卷的效度分析 43 5.3.2 系統評量問卷的相關分析 43 5.3.3 開放式問卷 46 5.4 討論 47 5.4.1 英語影片難度自動化分類 47 5.4.2 英語影片學習系統 48 第六章 結論與未來工作 49 6.1 研究結論 49 6.2 研究貢獻 50 6.3 未來工作 51 參考文獻 52 中文部分 52 英文部分 53 附錄一 英語學習背景問卷調查 57 附錄二 整體系統評量問卷 59

    中文部分
    王小川(2004)。語音訊號處理。台北市:全華科技圖書股份有限公司。
    方秋雅、周若茵(2009)。英語混淆字簡單到不行。台灣:捷徑文化。
    許永昌(2005)。建置一個影片式語料庫檢索系統提升大一新生口語體的理解能力。國立中央大學網路學習科技研究所碩士論文。
    莊孟淳(2008)。以學生與教師觀點探討台灣國中生英語聽力困難之研究。國立政治大學英語教學研究所碩士論文
    郭協理(1995)。英文單字片語正確用法。台北市:滿庭芳。
    羅家珍(2004)。DVD電影教學對大一學生聽力學習的成效探討。國立清華大學外國語文學系碩士論文。
    趙御筌 (2005)。說出好聽力。台灣:笛藤出版圖書有限公司。
    蔡其澂(2008)。開發場景導向之影片式語料庫檢索系統輔助英語口語體理解。國立中央大學網路學習科技研究所碩士論文。
    鍾靜怡(2008)。建置影片式英語口語發音縮減特徵學習系統輔助英文聽力學習。國立中央大學網路學習科技研究所碩士論文。
    藍啟恆(2009)。棒球比賽精華片段擷取分類系統。國立中央大學資訊工程學系研究所碩士論文。

    英文部分
    Arbogast, B. (2001). TOEIC official test-preparation guide (2nd ed.) Peterson''s.
    Arcario, P. (1992). Video in Second Language Teaching: Using, Selecting and Producing Video for the Classroom. Alexandria, VA: TESOL.
    Braddock, B. (1996). Using films in the English class. Hemel Hempstead: Phoenix ELT.
    Bolle, R.M., Yeo, B.L., & Yeung, M.M. (1998). Video query: Beyond the keywords. IBM Journal of Research and Development, 42(2), 253-286.
    Brown, G., & Yule, G. (1983). Teaching the spoken language . Cambridge: Cambridge University Press.
    Brown, H. D. (2001). Teaching by principles: An interactive approach to language pedagogy (2nd ed.). New York: Pearson Education.
    C.-C. Chang and C.-J. Lin. LIBSVM: a library for support vector machines,
    2001a. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm. machines.
    Celce-Murcia, M., Brinton, D.M., & Goodwin, J.M. (1996). Teaching pronunciation: A reference for teachers of English to speakers of other languages. New York: Cambridge University Press.
    Chiang, H.L. (1996). Students introducing their favorite English movies. The 12th Conference on English Teaching and Learning in Republic of China, 154-176.
    Clark, H. H., & Clark, E. V. (1977). Psychology and language : An introduction to psycholinguistics. New York: Harcourt Brace Jovanovich.
    Cooper, M. (2004). Video segmentation combining similarity analysis and classification. Proceedings of the 12th annual ACM international conference on Multimedia.
    Ekin, A., Tekalp, A.M., & Mehrotra, R. (2003). Automatic Soccer Video Analysis and Summarization. IEEE Trans. on Image Processing, 12(7), 796-807.
    Fan, Y. ( 1993 ). Listening: Problems and solutions. English Teaching Forum, 31 ( 1 ), 16-19.
    Field, J. (2003). Promoting perception: lexical segmentation in L2 listening. ELT Jounral, 57(4), 325-334.
    Goh, C. (1999). How much do learners know about the factors that influence their listening comprehension? Hong Kong Journal of Applied Linguistics, 4(1), 17-40.
    Gee, J.P. (2004). Situated language and learning: A Critique of traditional schooling. London: Routledge.
    Gardner, R.C. (1985). Social psychology and second language learning: The role of attitudes and motivation. London, Ontario: Edward Arnold
    Henrichsen, L. E. (1984). Sandhi-variation: A filter of input for learners of ESL. Language Learning, 34(3), 103-126.
    Ito, Y. ( 2001 ). Effect of reduced forms on ESL learners’ input-intake process. Second Language Studies, 20 ( 1 ), 99-124.
    Kender, J.R., & Yeo, B.L. (1998). Video Scene Segmentation Via Continuous Video Coherence. IEEE Computer Society Conf. Computer Vision and Pattern Recognition, 367-373.
    King, J. (2002). Using DVD Feature Films in the EFL Classroom. Computer Assisted Language Learning, 15(5), 509-523.
    Kuo, C.H., Wible, D., Chen, M.C., Tsao, N.L., & Chou, T.C. (2004). On the Design of Web-based Interactive Multimedia Contents for English Learning. Proceedings of the IEEE International Conference on Advanced Learning Technologies, 420-424.
    L. Lu, S. Z. Li, and H. J. Zhang, Content-based audio segmentation using support vector machines. IEEE Int. Conf. Multimedia and Expo (ICME), Aug. 2001, pp. 956–959.
    Lehane, B., O’Connor, N., & Murphy, N. (2004). Dialogue scene detection in movies using low and mid-level visual features. International Workshop on Image, Video, and Audio Retrieval and Mining.
    Liu, Z., Wang, Y., & Chen, T. (1998). Audio feature extraction and analysis for scene segmentation and classification. The Journal of VLSI Signal Processing, 20(1), 61-79.
    McEnery, & G. Knowle (Eds.). Teaching and Language Corpora. London: Logman. 131-145.
    Mayer, R. E., & Moreno, R.(2003). Nine ways to reduce cognitive load in multimedia learning. Educational psychologist, 38(1), 43-52.
    Moreno,R.,&Mayer,R.E.(2000).A coherence effect in multimedia learning :the case for minimizing irrelevant sounds in the design of multimedia instructional messages. Journal of Educational Psychology,92(1),117-125.
    O’Hare, N., Smeaton, A.F., Czirjek, C., O’Connor, N., & Murphy, N. (2004). A generic news story segmentation system and its evaluation. IEEE International Conference on Acoustics, Speech, and Signal Processing.
    O’Malley, M., Chamot, A.U. & Kupper, L. (1989). Listening comprehension strategies in second language acquisition. Applied Linguistics, 10(1): 418-437.
    Ryan, S., & Francais, A. (1998). Using films to develop learner motivation. The Internet TESL Journal, 4(11).
    Rasheed, Z., & Shah, M. (2005). Detection and Representation of Scenes in Videos. IEEE Transactions on Multimedia, 7(6), 1097-1105.
    Rui, Y., Huang, T.S., & Mehrotra, S. (1998). Exploring video structure beyond the shots. Proc. IEEE Conf. Multimedia Computing and Syst.
    Sadlier, D.A., O’Connor, N., Marlow, S., & Murphy, N. (2003). A Combined Audio-Visual Contribution to Event Detection in Field Sports Broadcast Video. Case Study: Gaelic Football. in Proc. IEEE ISSPIT Darmstadt, 552-555.
    Satoh, S., Nakamura, Y., & Kanade, T. (1999). Name-it: Naming and detecting faces in news videos. IEEE Multimedia, 6(1), 22-35.
    Sebastian, S.F. (2003). Towards a Visual Lexicon: The Creation of a Corpus Linguistic Database Using Digital Movie Data. The journal of Nagasaki University of Foreign Studies, 5, 83-91.
    Shinoda, K., Ishihara, K., Furui, S., & Mochizuki, T. (2008). Automatic Score Scene Detection for Baseball Video. Lecture Notes in Computer Science, 4938, 226-240.
    Sundaram, H., & Chang, S.F. (2000). Determining computable scenes in films and their structures using audio-visual memory models. ACM Multimedia.
    Secules, T., Herron, C., & Tomasello, M. (1992). The Effect of Video Context on Foreign Language Learning. The Modern Language Journal, 76(4), 480-490.
    Tschirner, E. (2001). Language Acquisition in the Classroom: The Role of Digital Video. Computer Assisted Language Learning, 14(3-4), 305-319.
    The CMU phoneme set. Retrieved 6/25, 2008, from http://www.speech.cs.cmu.edu/cgi-bin/cmudict#phones
    The CMU pronouncing dictionary. Retrieved 6/25, 2008, from http://www.speech.cs.cmu.edu/cgi-bin/cmudict
    Wu, Y. C., Lee, Y. S., &Yang, J. C. (2008). Robust and efficient multiclass SVM models for phrase pattern recognition. Pattern Recognition .
    Weyers, J.R. (1999). The effect of authentic video on communicative competence. The Modern Language Journal, 83(3), 339-349.
    Xiong, Z., Zhou, X.S., Tian, Q, Rui, Y., & Huang, T.S. (2006). Semantic Retrieval of Video. IEEE Signal Processing Magazine, 23(2), 18-27.
    Yeung, M.M., Yeo, B.L., Wolf, W., & Liu, B. (1995). Video Browsing using Clustering and Scene Transitions on Compressed Sequences. Multimedia Computing and Networking.
    Yeung, M.M., Yeo, B.L., & Liu, B. (1998). Segmentation of Video by Clustering and Graph Analysis. Computer Vision and Image Understanding, 7(1), 94-109.
    Yamaguchi, C. (2002). Towards international English in EFL classrooms in Japan. The Internet TESL Journal, Vol. VIII, No. 1, January 2002. Retrieved November 15, 2005, from http://iteslj.org/Articles/Yamaguchi-Language.htm
    Zhu, W., Toklu, C., & Liou, S.P. (2001). Automatic News Video Segmentation and Categorization Based on Closed-Captioned Text. International Conference on Multimedia and Expo.

    QR CODE
    :::