語音辨識系統之研究及其對同步電腦教室學習｜國立中央大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	羅斯坦 Rustam Shadiev
論文名稱：	語音辨識系統之研究及其對同步電腦教室學習 A study of Speech to Text Recognition and its Effects on Learning Performance in Synchronous Cyber Classrooms
指導教授：	黃武元 Wu-Yuin Hwang 陳年興 Nian-Shing Chen
口試委員:
學位類別：	博士 Doctor
系所名稱：	資訊電機學院 - 網路學習科技研究所 Graduate Institute of Network Learning Technology
畢業學年度：	100
語文別：	英文
論文頁數：	92
中文關鍵詞：	語音辨識、書寫報告、團體討論、個別口頭報告、單向授課、同步學習
外文關鍵詞：	Writing essay, Group discussion, Individual oral presentation, One-way lecture, Speech to text recognition, Synchronous learning
相關次數：	點閱：13 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

語音辨識系統是一種將口說演講同步轉換為文字的科技，值得探討此系統對線上學習活動的影響。本篇論文的主要目標為探討語音辨識系統如何改善線上同步電腦教室的教學和學習，並設計兩個實驗探討此系統對學生學習的影響：第一個實驗探討語音辨識系統對單向授課的影響；第二個實驗探討此系統對獨立口頭簡報和群組討論的影響。學生使用語音辨識系統的感知和行為傾向，也是這兩個實驗研究的重心。
進行第一個實驗後，實驗組學生認為語音辨識系統容易使用，並有助於完成單向授課及其附帶的家庭作業。大多數學生也表達他們有高度意願在未來將此系統作為學習工具。統計結果顯示，實驗組與控制組相較，在完成作業上有中等程度的進步。然而，某次實驗組的學生對於語音辨識系統產出的內文十分熟悉，並將此系統作為學習工具，他們在前測的表現顯著地超越控制組。受訪學生並表示語音辨識系統產出的內容對單向授課的溝通十分有益，效果如同學習期間和單向授課之後的學習。
第二個實驗的結果顯示，若比較實驗組和控制組在寫作兩個主題短文的表現，實驗組在中測和後測的表現更為優異。大多數學生認為語音辨識系統對獨立簡報和短文寫作具有助益，同時表示未來願意使用此系統學習。然而在此系統發現文字辨識正確率低和未同步呈現口說內容的學生，不認為語音辨識系統對群組討論有所裨益。同時本篇論文的研究結果顯示，語音辨識系統對學生在同步電腦教室中口頭簡報和群組討論的學習表現，有全面的提升。
本篇論文的主要貢獻為透過同步電腦教室的教學活動，使學生的學習表現和溝通能力有所進步。本篇論文同時顯示語音辨識系統對於促進學習效果、訓練策略和發展在同步電腦教室進行課中和課後學習的潛力。除此之外，本篇論文並提供未來的深入研究一些新方向：其一為在其他學習環境(例如傳統課室)運用語音辨識系統學習的潛力；另一方向為深入發展此系統的辨識技術(例如讓電腦辨識文字)，以輔助學習和教學。

Speech to text recognition (STR) is a technology to translate natural language speech into text in real time. It is worth to apply STR for online learning activities and to investigate its influence on learning. The aim of this study was to apply STR in an effort to improve teaching and learning performances in an online synchronous cyber classroom environment. Two experiments were conducted to investigate the effectiveness of applying STR on learning performance of students; STR was applied for one-way lectures during the first experiment and for individual oral presentations and group discussions during the second experiment. Students’ perceptions and their behavioral intentions toward using STR were also investigated in both experiments.
Statistical results of the first experiment showed moderate improvement in the experimental groups’ performance over the control group on homework accomplishments. However, once the students in the experimental group became familiar with the STR-generated texts and used them as learning tools, they significantly outperformed the control group students in post-test results. Interviews with participating students revealed that STR-generated texts were beneficial to communication during one-way lectures as well as to learning during and after one-way lectures. After the experiment, students from the experimental group perceived that the STR mechanism was easy to use and useful during one-way lectures as well as for homework accomplishment. Most students also expressed that they were highly motivated to use STR as a learning tool in the future. Statistical results of the second experiment revealed students of the experimental group performed significantly better compared to the control group students in two sessions of writing essays, intermediate test and post-test. Most of students perceived that STR was useful for individual presentations and for writing essays. Students also expressed they are willing to use the STR for learning in the future. However, the students who obtained transcripts with low accuracy rate and experienced delay in STR text generation did not perceive the STR as easy to use and useful for group discussions. The results of this study showed that the STR is beneficial to one-way lectures, students’ oral presentations and group discussions in a synchronous cyber classroom so as to improve their overall learning performance.
The main contribution of this study is that students’ learning performance and the quality of communication during teaching and learning activities in synchronous cyber classrooms were improved. This study demonstrated the effectiveness of using STR-texts to facilitate learning. The strategies for STR training and its potential applications during and after teaching and learning activities in synchronous cyber classrooms were developed. In addition, this study contributed new research directions for further study: one direction is to study a potential of STR application for other learning environments (e.g., traditional classroom), and another direction is to research the extension of STR with similar technologies (e.g., Text to Speech Recognition) in order to support learning.

Abstract in Chinese i
Abstract in English ii
Acknowledgement iv
Table of Contents v
List of Tables viii
List of Figures ix
Chapter 1. Introduction 1
1.1 Background 1
1.2 Motivation 2
1.3 Objectives and research questions 3
1.4 Structure of the Dissertation 4
Chapter 2. Literature Review 6
2.1 Related theory 6
2.2 Online synchronous teaching and learning 10
2.3 Teaching and learning activities in synchronous cyber classrooms 12
2.4 Technology of STR 14
2.5 STR for education 16
Chapter 3. Methodology 19
3.1 First experiment: the STR application for one-way lectures 21
3.1.1 Participants and experimental procedures 21
3.1.2 Experimental Design 23
3.1.3 Teaching and Learning Activities Design 23
3.1.4 Research tools 25
3.1.5 Statistical Analysis Methods 31
3.2 Second experiment: the STR application for individual oral presentations and group discussions 32
3.2.1 Participants and experimental procedures 32
3.2.2 Experimental Design 34
3.2.3 Teaching and Learning Activities Design 34
3.2.4 Research tools 36
3.2.5 Statistical Analysis Methods 39
Chapter 4. Results and Discussions 40
4.1 First experiment: the STR for one-way lectures 40
4.1.1 Do the students who use STR-generated texts perform significantly different in accomplishing homework tasks and in post-test evaluations than the students who do not use STR technology? 40
4.1.2 Are STR-generated texts beneficial to students’ learning during one-way lectures and accomplishing homework? 43
4.1.3 What are the students’ perceptions and behavioral intentions regarding the use of the STR technology during one-way lectures in a synchronous cyber classroom and accomplishing homework? 45
4.2 Second experiment: the STR for individual oral presentations and group discussions 49
4.2.1 Do the students who use STR-generated texts perform significantly different in writing essays, intermediate test and post-test than the students who do not use STR technology? 49
4.2.2 Are STR-generated texts beneficial to students’ learning during group learning activities and accomplishing homework? 51
4.2.3 What are students’ perceptions and behavioral intentions regarding the use of the STR technology during group learning activities in a synchronous cyber classroom and accomplishing homework? 54
4.3 Technical and Pedagogical Implications using STR 59
Chapter 5. Conclusions and further study 64
5.1 Conclusions 64
5.2 Future Work 66
References 68
Appendixes 75
Appendix 1. Timeline of the course and topics of the lectures for the first experiment 75
Appendix 2. Pretest items 76
Appendix 3.Intermediate test items 80
Appendix 4. Posttest items 84
Appendix 5. Timeline of the course and topics of the lectures for the second experiment 87
Appendix 6. List of topics for individual presentations 88
Appendix 7. List of topics for group discussion 89
Curriculum Vitae 90
List of publications 91
Journal papers 91
Conference papers 92

                                

Anderson, T. (2008). Teaching in an Online Learning Context . In T. Anderson (Eds), Theory and Practice of Online Learning (pp. 343-366). Edmonton: Athabasca University.
Anderson, R., Beavers, J., VanDeGrift, T., & Videon, F. (2003). Videoconferencing and presentation support for synchronous distance learning. In Proceeding of the ASEE/IEEE Frontiers in Education Conference, 13-18.
Atkinson, R.L. & Shiffrin, R.M. (1968). Human memory: a proposed system and its control processes. In K.W. Spence and J.T. Spence (Eds). The Psychology of Learning and Motivation: Advances in Research and Theory. New York: Academic Press.
Aylesworth, S. (2005). A Guide to Speech-to-Text Services in the Postsecondary Environment. St.Paul, MN: Postsecondary Education Programs Network.
Boulos, M.N.K., Taylor, A.D., & Breton, A. (2005). A Synchronous Communication Experiment within an Online Distance Learning Program: A Case Study. Telemedicine and e-Health, 11(5), 583-593.
Burkes, K.M.E. (2007). Applying cognitive load theory to the design of online learning. Dissertation Prepared for the Degree of Doctor of Philosophy, University of North Texas.
Chen, N.S., & Ko, L. (2010). An online synchronous test for professional interpreters. Educational Technology & Society, 13(2), 153-165.
Chen, N.S., & Wang, Y. (2008). Testing Principles of Language Learning in a Cyber Face-to-Face Environment. Educational Technology & Society, 11 (3), 97-113.
Chen, N.S., Ko, H.C., Kinshuk, & Lin, T. (2005). A model for synchronous learning using the Internet. Innovations in Education and Teaching International, 42(2), 181–194.
Cohen, J. (1992). A power primer. Psychological Bulletin, 112(1), 155–159.
Collaborative Cyber Community (3C). Retrieved March 7, 2011, from http://ccc.k12.edu.tw/
Cooper, G. (1998). Research into Cognitive Load Theory and Instructional Design at UNSW, University of New South Wales, Australia.
Cooper, H. (2007). The battle over homework: Common ground for administrators, teachers, and parents (3rd ed.). Thousand Oaks, CA: Corwin Press.
Creswell, J.W. (2008). Educational research: planning, conducting, and evaluating quantitative and qualitative research (2nd ed.) Upper Saddle River, N.J.: Merrill.
Daft, R.L. & Lengel, R.H. (1986). Organizational information requirements, media richness and structural design. Management Science 32(5), 554-571.
Davis, F. D. (1989). Perceived usefulness, perceived ease of use, and user acceptance of information technology. MIS Quarterly, 13, 319–340.
Donegan, M. (2000) Voice Recognition Technology in Education: Factors for Success Ace Centre Oxford.
Fiscus, J. G., Ajot, J. and Garofolo, J. S. (2007). The Rich Transcription 2007 meeting recognition evaluation. Lecture Notes in Computer Science, 4625, 373–389
Galbraith, J. (1977). Organization Design. Reading, MA: Addison-Wesley.
Garrison, D. R., & Shale, D. (1987). Mapping the boundaries of distance education: Problems in defining the field. The American Journal of Distance Education, 1(1), 7-13.
Hartley, J. (1998). Learning and Studying: A Research Perspective. London: Routledge.
Hartman, J.D. (1989). Writing To Learn And Communicate In A Data Structures Course. ACM, 2, 32-36.
Hastie, M., Chen, N.S., & Kuo, Y.H. (2007). Instructional design for best practice in the synchronous cyber classroom. Educational Technology & Society, 10 (4), 281-294.
Hastie, M., Hung, I-Ch., Chen, N.S. & Kinshuk (2010). A blended synchronous learning model for educational international collaboration. Innovations in Education and Teaching International, 47(1), 9–24.
Huang, H.M. (2002). Toward constructivism for adult learners in online learning environments. British Journal of Educational Technology, 33 (1), 27-37.
Hyder, K., Kwinn, A., Miazga, R., & Murray, M. (2007). The eLearning Guild’s Handbook on Synchronous e-Learning. Santa Rosa, CA: The eLearning Guild.
IMS Global Learning Consortium. Guidelines for Developing Accessible Synchronous Communication and Collaboration Tools. Retrieved March 4, 2011, from http://www.imsglobal.org/accessibility/accessiblevers/sec7.html
JoinNet. Retrieved March 7, 2011, from http://www.homemeeting.com/en_US/products_joinnet.html
Kanevsky, D., Basson, S., Chen, S., Faisman, A., Zlatsin, A., Conrod, S., & McCormick, A. (2006). Speech Transcription Services. SPECOM''2006, St. Petersburg, 37-43.
Karat, C.M., Halverson, C., Horn, D., & Karat, J. (1999). Patterns of Entry and Correction in Large Vocabulary Continuous Speech Recognition Systems. In Proceeding of CHI conference, 568-575.
Keegan, D. J. (1980). On defining distance education. Distance Education, 1, 13-36.
Kheir, R., & Way, T. (2006). Improving Speech Recognition to Assist Real-time Classroom Note Taking. In Proceeding of the Rehabilitation Engineering and Assistive Technology Society of North America Conference, 1-4.
Kiewra, K.A. (1985). Investigating notetaking and review: a depth of processing alternative. Educational Psychologist, 20, 23–32.
Kruger, H.A., & Kearney, W.D. (2006). A prototype for assessing information security awareness. Computers & Security, 25, 289-296.
Lohr, L.L., & Gall, J.E. (2008). Representation Strategies. In J.M. Spector, M.D. Merrill, J.van Merrienboer, and M.P. Driscoll. Handbook of Research on Educational Communications and Technology, 3rd ed. pp. 85-96. New York: Lawrence Erlbaum Associates.
Mayer, R. E., & Moreno, R. (2003). Nine ways to reduce cognitive load in multimedia learning. Educational Psychologist, 38, 43-52.
McIsaac, M.S. & Gunawardena, C.N. (1996). Distance Education. In D.H. Jonassen (Eds), Handbook of research for educational communications and technology: a project of the Association for Educational Communications and Technology (pp.403-437). New York: Simon & Schuster Macmillan.
Miller, G.A. (1956). The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psychological Review, 63, 81-97.
Net4Voice. Retrieved March 10, 2011 from https://www.net4voice.eu/net4voice/Deliverables/Forms/Deliverables.aspx
Nilson, L.B. (2010). Teaching at Its Best: A Research-Based Resource for College Instructors. 3rd Ed., San-Francisco, CA: John Wiley and Sons Inc.
Nisbet, P.D. & Wilson, A. (2002). Introducing Speech Recognition in Schools: using IBM ViaVoice. Edinburgh: CALL Centre.
O’Harea, E.A., & McTear, M.F. (1999). Speech recognition in the secondary school classroom: an exploratory study. Computers & Education, 33(1), 27-45.
O''Shaughnessy, D. (2008). Automatic speech recognition: History, methods and challenges. Pattern Recognition, 41, 2965-2979.
Park, Y.J., & Bonk, C.J. (2007). Synchronous Learning Experiences: Distance and Residential Learners’ Perspectives in a Blended Graduate Course. Journal of Interactive Online Learning, 6 (3), 245-264.
Petta, T.D. & Woloshyn, V.E. (2001) Voice Recognition for On-line Literacy: Continuous Voice Recognition Technology in Adult Literacy Training. Education and Information Technologies, 6(4), 225–240.
Power, T.J., Dombrowski, S.C., Mautone, J. & Watkins, M. (2007). Assessing Children''s Homework Performance: Development of a Multi-Dimensional, Multi-Informant Rating System. Journal of School Psychology, 45(3), 333-348.
Pridmore, J.L., Bradley, R.V., & Mehta, N. (2010). Methods of Instruction and Learning Outcomes: A Theoretical Analysis of Two Approaches in an Introductory Information Technology Course. Decision Sciences Journal of Innovative Education, 8(2), 289-311.
Pullen, J. M., & McAndrews, P.M. (2005). Low-Cost Internet Synchronous Distance Education Using Open-Source Software. Computers in Education, 15(4), 64-71.
Punch, K.F. (2009). Introduction to research methods in education. London: SAGE.
Rowe, N.C. (2004). Cheating in Online Student Assessment: Beyond Plagiarism. Online Journal of Distance Learning Administration, 7(2). Retrieved May 15, 2011 from: http://www.westga.edu/~distance/ojdla/summer72/rowe72.html
Ryba, K., McIvor, T., Shakir, M. & Paez, D. (2006). Liberated Learning: Analysis of University Students’ Perceptions and Experiences with Continuous Automated Speech Recognition. Journal of Instructional Science and Technology, 9(1), 1-19.
Shadiev, R. (2000). Establishing of Distance Education in Uzbekistan. Journal of Pedagogical Education, 3, 47-52.
Shadiev, R. (2002). Developing of the distance form of education in higher institutions of the Republic of Uzbekistan. Ziyokor, 4, 73-79.
Sim, G., Holifield, P., & Brown, M. (2004). Implementation of computer assisted assessment: lessons from the literature. ALT-J, Research in Learning Technology, 12(3), 215-229.
Smith, P.L. & Ragan, T.J. (2004). Instructional design (3rd ed.). New Jersey: John Wiley & Sons, Inc.
Speech Recognition in Schools. Retrieved March 7, 2011 from http://callcentre.education.ed.ac.uk/Research/Speech_Recog_PRA/speech_recog_pra.html
Swanson, D.B., Norman, G.R. & Linn, R.L. (1995). Performance-Based Assessment: Lessons from the Health Professions. Educational Researcher 24(5), 5-11, 35.
Sweller, J., Van Merrienboer, J. J. G., & Paas, F. (1998). Cognitive architecture and instructional design. Educational Psychology Review, 10(3), 251-296.
Thompson, D. J. (1996). Audioteleconferencing: Myths and realities. Open Learning, 11(2), 20-27.
Van Merriënboer, J.J.G. & Sweller, J. (2005). Cognitive load theory and complex learning: Recent developments and future directions. Educational Psychology Review, 17(2), 147-177
Venkatesh, V., & Bala, H. (2008). Technology Acceptance Model 3 and a Research Agenda on Interventions. Decision Sciences, 39(2), 273-315.
Wald, M. (2010). Synote: Accessible and Assistive Technology Enhancing Learning for All Students. In K. Miesenberger et al. (Eds.), ICCHP 2010, LNCS 6180 (pp. 177–184). Berlin: Springer-Verlag.
Wald, M. & Bain, K. (2008). Universal access to communication and learning: the role of automatic speech recognition. International Journal Universal Access in the Information Society, 6(4), 435-447.
Wang, Y., Chen, N. S., & Levy, M. (2010a). The design and implementation of a holistic training model for language teacher education in a cyber face-to-face learning environment. Computers and Education, 55(2), 777-788.
Wang, Y., Chen, N. S., & Levy, M. (2010b). Teacher training in a synchronous cyber face-to-face classroom: Characterizing and supporting the online teachers’ learning process. Computer Assisted Language Learning, 23(4), 277-293.
Way, T., Kheir, R., & Bevilacqua, L. (2008). Achieving Acceptable Accuracy in a Low-Cost, Assistive Note-Taking, Speech Transcription System. In Proceedings of the IASTED International Conference on Telehealth and Assistive Technologies, ACTA Press, 72-77.
Weinberger, A. & Fischer, F. (2005). A framework to analyze argumentative knowledge construction in computer-supported collaborative learning. Computers & Education, 46(1), 71–95
Zdeslav, H., Dean, Z., & Anjay, R. N. (2007). Comparing students’ and experts’ understanding of the content of a lecture. Journal of Science Education & Technology, 16(3), 213–224.
Zschorn, A., Littlefield, J.S., Broughton, M., Dwyer, B., & Hashemi-Sakhtsari, A. (2003). Transcription of Multiple Speakers Using Speaker Dependent Speech Recognition. DSTO Technical Report, DSTO_TR_1498.

簡易檢索 / 詳目顯示

相關論文