| 研究生: |
張右新 Yu-Hsin Chang |
|---|---|
| 論文名稱: |
使用卷積神經網路之眼寫符號圖像辨識法 Pictorial Method with Convolutional Neural Network for Eye-writing Symbol Recognition |
| 指導教授: |
蔡章仁
Jang-Zern Tsai |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 電機工程學系 Department of Electrical Engineering |
| 論文出版年: | 2019 |
| 畢業學年度: | 107 |
| 語文別: | 中文 |
| 論文頁數: | 83 |
| 中文關鍵詞: | 眼電圖 、卷積神經網路 、眼動訊號眨眼偵測 、眼寫系統 |
| 外文關鍵詞: | Electro-oculography(EOG), Convolutional Neural Network (CNN), EOG blinking detection, Eye-writing |
| 相關次數: | 點閱:7 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
Amyotrophic lateral sclerosis (ALS)是一種神經病變疾病,也被稱作Lou Gehrig‘s disease這種疾病特徵是會使腦中的運動神經不斷退化,也就是我們俗稱的漸凍人。患有ALS的病人四肢及軀幹的肌肉會逐漸地癱瘓麻痺甚至無力,同時也會慢慢喪失講話的功能,因此嚴重者會無法使用他們的肢體與口語溝通的能力。
本研究的宗旨就是希望可以利用眼電圖法(ElectrooculoGraphy , EOG)做為眼動訊號的偵測來建構一套”眼寫系統”,並利用CNN網路來提高其辨識率。系統主要可以分成硬體和軟體兩大部分,而軟體的部分主要又可以分成:校準、雙眨眼偵測、符號分類三大功能,首先利用EOG訊號擬合出一個用來重建眼睛移動軌跡的轉移函數,有了這個轉移函數我們可以解決在時域上眼寫時符號的歪寫和比例大小問題,再者透過以訓練好的雙眨眼神經網路在時域上來進行雙眨眼偵測做為眼寫的開始,把截取之眼寫訊號轉換成29 29 pixel的二維圖像並強化圖像特徵,最後以卷積神經網路(CNN)來分類26個英文大寫字母、10個數字和4個命令符號(刪除、空白、換行、結束)。本論文利用CNN之圖像法來分類雙眨眼和字符大大的提高了其個人書寫的辨識率平均達到95%以上。
Amyotrophic lateral sclerosis (ALS) is a neuropathic disease. This disease is characterized by gradual degeneration of motor nerves in the brain, thus patients with ALS will experience progressive numbness and even weakness in the muscles of the extremities and trunks, and will slowly lose their speaking ability. Therefore, severe patients will not be able to move their body and communicate with others.
The purpose of this study is to use the electro-oculogram method in an "eye-writing system" and use the CNN (convolutional neural network) network to improve its recognition rate. The main algorithm can be divided into three parts: calibration, double-blink detection, and symbol classification.
First, the EOG signal at nine reference points is fitted to a transfer function to reconstruct the eye movement plane. With this transfer function, we can solve the problem of skewing and unequal scaling of the symbols in the time domain. Secondly, the trained CNN will detect double blinking from the EOG signal in the time domain to mark the beginning of the eye-writing of a symbol. Thirdly, the intercepted eye writing signal is converted into a 29 × 29 pixel two-dimensional image and the image features are enhanced. Finally, the CNN is used to classify 26 English letters,10 numbers and 4 command symbols (delete, blank, line feed, end).
The CNN used in this study to classify double blinking and characters greatly improves the recognition of eye-writing symbols, with an average recognition rate over 95%..
[1] D. Hirtz, D. Thurman, K. Gwinn-Hardy, M. Mohamed, A. Chaudhuri, and R. J. N. Zalutsky, "How common are the “common” neurologic disorders?," vol. 68, no. 5, pp. 326-337, 2007.
[2] T. Yagi, Y. Kuno, K. Koga, and T. Mukai, "Drifting and blinking compensation in electro-oculography (EOG) eye-gaze interface," in Systems, Man and Cybernetics, 2006. SMC'06. IEEE International Conference on, 2006, vol. 4, pp. 3222-3226: IEEE.
[3] M. Y. Hesna Özbek Ülkütaş, "Computer based eye-writing system by using EOG," Bodrum, Turkey, 15-18 Oct. 2015.
[4] A. Lopez, F. Ferrero, D. Yanguela, C. Alvarez, and O. Postolache, "Development of a Computer Writing System Based on EOG," Sensors (Basel), vol. 17, no. 7, Jun 26 2017.
[5] K.-R. Lee, W.-D. Chang, S. Kim, and C.-H. Im, "Real-time “eye-writing” recognition using electrooculogram," IEEE Transactions on Neural Systems Rehabilitation Engineering, vol. 25, no. 1, pp. 37-48, 2017.
[6] M. Merino, O. Rivera, I. Gómez, A. Molina, and E. Dorronzoro, "A method of EOG signal processing to detect the direction of eye movements," in Sensor Device Technologies and Applications (SENSORDEVICES), 2010 First International Conference on, 2010, pp. 100-105: IEEE.
[7] 儀表放大器. https://zh.wikipedia.org/wiki/%E5%84%80%E8%A1%A8%E6%94%BE%E5%A4%A7%E5%99%A8.
[8] T. INSTRUMENTS. (2019). INA12x Precision, Low-Power Instrumentation Amplifiers,http://www.ti.com/lit/ds/symlink/ina128.pdf.
[9] L. H. Goldberg. (2012-02-01). Arduino 的類比功能:如何運用在新設計中,https://www.digikey.tw/zh/articles/techzone/2012/feb/arduinos-analog-functions-how-to-use-them-in-your-next-design.
[10] MathWorks. https://www.mathworks.com/products/matlab.html.
[11] 張嘉安. (2006,01,01). 眼寫鍵盤和眼寫滑鼠 Eye-writing Keyboard and Eye-writing Mouse.
[12] T. Yagi, Y. Kuno, K. Koga, and T. Mukai, "Drifting and blinking compensation in electro-oculography (EOG) eye-gaze interface," in 2006 IEEE International Conference on Systems, Man and Cybernetics, 2006, vol. 4, pp. 3222-3226: IEEE.
[13] L. J. E. Lindstrom, "Muscular fatigue and action potential conduction velocity changes studied with frequency analysis of EMG signals," vol. 10, no. 4, pp. 341-356, 1970.
[14] F. Simini, A. Touya, A. Senatore, and J. Pereira, "Gaze tracker by electrooculography (EOG) on a head-band," in 2011 10th International Workshop on Biomedical Engineering, 2011, pp. 1-4: IEEE.
[15] X. Kong and G. F. Wilson, "A new EOG-based eyeblink detection algorithm," Behavior Research Methods, Instruments, & Computers, vol. 30, no. 4, pp. 713-719, 1998.
[16] M. S. Reddy, B. Narasimha, E. Suresh, and K. S. Rao, "Analysis of EOG signals using wavelet transform for detecting eye blinks," in 2010 International Conference on Wireless Communications & Signal Processing (WCSP), 2010, pp. 1-4: IEEE.
[17] MathWorks, "Deep Learning,https://www.mathworks.com/solutions/deep-learning/convolutional-neural-network.html."
[18] 台大電機系李宏毅老師. (2016). ML Lecture 6: Introduction of Deep Learning,http://violin-tao.blogspot.com/2017/07/ml-introduction-of-deep-learning.html.
[19] S. Ioffe and C. Szegedy, "Batch normalization: Accelerating deep network training by reducing internal covariate shift," arXiv preprint arXiv:1502.03167, 2015.
[20] 台大電機系李宏毅老師. (2017). Batch Normalization,https://www.youtube.com/watch?v=BZh1ltr5Rkg.