| 研究生: |
謝秉寰 Ping-Huan Hsieh |
|---|---|
| 論文名稱: |
可見式語音診斷與復健系統 Visible Speech Diagnostic and Rehabilitation System |
| 指導教授: |
吳炤民
Chao-Min Wu |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 電機工程學系 Department of Electrical Engineering |
| 論文出版年: | 2014 |
| 畢業學年度: | 102 |
| 語文別: | 中文 |
| 論文頁數: | 141 |
| 中文關鍵詞: | 復健 、構音障礙 、語音分析 、聲譜 、頻譜 、基頻 |
| 外文關鍵詞: | rehabilitation, articulatory disorder, speech analysis, spectrogram, spectrum, fundamental frequency |
| 相關次數: | 點閱:13 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
隨著近年來身心障礙的族群越來越受到重視,現今的社會有越來越多的構音障礙者會尋求語音上的治療以及復健。因此,發展一套語言治療師能用來輔助及復健治療的工具就顯的愈來愈重要。本研究發展一套以軟體為主的可見式語音診斷與復健系統,使用者可以透過使用者介面分別對構音正常與構音障礙個案錄下語音訊號,並比對兩組語音信號的波形、頻譜、聲譜及基頻等資訊,提供量化的客觀分析。為了評估此系統的實用性、功能性以及正確性,本研究透過與台北榮民總醫院以及衛生福利部桃園醫院新屋分院復健科語言治療團隊的合作,經由人體測試委員會的核准之後,利用電腦收錄了共十三組 (其中包含八位成人,五位小孩;九位男性,四位女性)音訊來做後續的結果分析以及比較。根據實測結果顯示,本研究實現了一個可以呈現出正常語者與具有構音障礙之個案的構音差異的可見式語音診斷與復健系統。此外,透過與Praat軟體比較的結果顯示,本研究的系統較適合於使用者在比對、分析、訓練以及復健上使用,且兩者在功能性 (頻譜,聲譜,基頻等)的使用上會得到近似的結果。總而言之,在臨床上,語言治療師可運用此系統作為評估、診斷以及復健構音障礙者發音狀況的工具。
As more attention paid to the handicapped people recently, more patients with articulatory disorder look for speech therapy. Therefore, to develop a tool which could assist the speech therapist to provide rehabilitation service to more patients is getting more important. The purpose of this study was to develop a software-based visible speech diagnostic and rehabilitation system (VSDRS) which could be used to record and compare the speech signals from the normal speaker and the patient with articulation disorder via the user interface. In this user interface, the clinical users could compare these two signals in the forms of the speech signal, spectrum, spectrogram and fundamental frequency to provide a quantitative analysis. In order to evaluate the usefulness, functionality and accuracy of the system, this study analyzed and compared thirteen (i.e., eight adults and five children, or nine male and four female subjects) speech recordings provided from the cooperation of Taipei Veterans General Hospital and Tao Yuan General Hospital, Ministry of Health and Welfare-Sinwu Branch approved by the Internal Review Board (IRB). The results showed that the VSDRS was realized and able to indicate the difference between the normal speaker and patient with articulatory disorder. Furthermore, when compared with the free software, the Praat system, our VSDRS would be more suitable for clinical users to train their patients for rehabilitation purpose in addition to their functional similarity. In summary, the speech therapists could apply this VSDRS for diagnosis, assessment, and rehabilitation of patients with articulatory disorders.
英文參考資料:
Blumstein, S. E. & Stevens, K. N. (1979). “Acoustic invariance in speech production: Evidence from measurements of the spectral characteristics of stop consonants”, J. Acoust. Soc. Am., Vol. 66, No. 4, 1001-1017.
Hillenbrand, J. & Getty, L. A. & Clark, M. J. & Wheeler K. (1995). “Acoustic characteristics of American English vowels”, J. Acoust. Soc. Am., Vol. 97, 3009-3111.
Kent, R. D. & Read, C. (2002). The acoustic analysis of speech, Thomson Learning: Albany, NY, USA.
Kondoz, A. M. (1994). Digital Speech Coding for Low Bit Rate Communications Systems, Wiley: Hoboken, New Jersey, USA.
Ladefoged, P. (2001). A course in Phonetics, Thomson Learning : Albany, NY, USA.
Liss J. M.& Spitzer S.& Caviness J. N.& Adler C.& Edwards B. (1998). “Syllabic strength and lexical boundary decisions in the perception of hypokinetic dysarthric speech”, J. Acoust. Soc. Am., Vol. 104, No. 4, 2457-2466.
Makhoul, J. (1975). “Linear Prediction: A Tutorial Review”, Proceedings of the IEEE, Vol. 63, No.4, 561-580.
Park, S. H. & Kim, D. J. & Lee, J. H. & Yoon,T. S. (1994). “Integrated Speech Training System for Hearing Impaired”, IEEE transitions on rehabilitation engineering, Vol. 2, No. 4, 189-196.
Peterson,G. E. & Barney,H. L. (1952). “Control methods used in a study of the vowels”, J. Acoust. Soc. Am., Vol. 24, 175-184.
Qiu, L. & Yang, H. & Koh, S. N. (1995). “Fundamental frequency determination based on instantaneous frequency estimation”, Signal Processing, Vol. 44, 233-241.
Rabiner, L. R. (1977). “On the Use of Autocorrelation Analysis for Pitch Detection”, IEEE Transactions on Acoustics, Speech and Signal Processing, Vol. 25, No. 1, 24-33.
Riddel, J. & McCauley, R. J. & Mulligan, M. & Tandan, R.(1995). “Intelligibility and Phonetic Contrast Errors in Highly Intelligible Speakers With Amyotrophic Lateral Sclerosis”, Journal of Speech and Hearing Research, Vol. 38, 304-314.
Robert E. O. Jr.& Metz, D. E.& Haas, A. (2000). Introduction to communication disorders, Pearson Education: Needham Heights, MA.
Rusz J.& Cmejla R.& Ruzickova H.& Ruzicka E. (2011). “Quantitative acoustic measurements for characterization of speech and voice disorders in early untreated Parkinson’s disease”, J. Acoust. Soc. Am., Vol. 129, No. 1, 350-367.
Santen, M. & Mobius, B. & Olive, J. (2005). “Formant Tracking Using Context-Dependent Phonemic Information”, IEEE Transactions on Speech and Audio Processing, Vol. 13, No. 5, 741-750.
Stepp, C. E. (2013). “ Relative fundamental frequency during vocal onset and offset in older speakers with and without Parkinson’s disease”, J. Acoust. Soc. Am., Vol. 133, No. 3, 1637-1643.
Stevens, K. N. & House, A. S. (1955). “Development of a Quantitative Description of Vowel Articulation”, J. Acoust. Soc. Am., Vol. 27,484-493.
Tjaden, K. & Turner, G. S. (1997). “Spectral Properties of Fricatives in Amyotrophic Lateral Sclerosis”, Journal of Speech, Language, and Hearing Research, Vol. 40, 1358-1372.
Turner, G.S. & Weismer, G. (1993). “Characteristics of speaking rate in the dysarthria associated with myotrophic lateral sclerosis”, Journal of Speech and Hearing Research, Vol. 36, 1134-1144.
Watanabe, A. (2001). “Formant Estimation Method Using Inverse-Filter Control”, IEEE Transactions on Speech and Audio Processing, Vol. 9, No. 4, 317-326.
Weismer G. & Martin R. & Kent R. D. & Kent J. F. (1992). “Formant trajectory characteristics of males with amyotrophic lateral sclerosis”, J. Acoust. Soc. Am., Vol. 91, No. 2, 1085-1098.
中文參考資料:
行政院內政部統計處網站,民國100年資料:
http://www.moi.gov.tw/stat/index.aspx
蘇宗柏、陳思遠、王亭貴、王顏和、連倚南,復健醫療服務之疾病分類研究:國內某醫學中心近期經驗,民國99年:
http://www.ntuh.gov.tw/PMR/Lists/List14/Attachments/188/10253009-201012-201101150006-201101150006-229-236.pdf
王淑娟 、高韋樺,國內特定型語言障礙相關研究之初探,民國101年:http://www.ntcu.edu.tw/spc/aspc/6_ebook/pdf/10101/1.pdf
王秋鈴、林素貞,台灣地區兒童語言障礙評量現況調查之研究,民國97年:
http://nutnr.lib.nutn.edu.tw/bitstream/987654321/7457/1/%E6%95%99%E8%82%B2%E5%AD%B8%E5%A0%B1-19%E6%9C%9F-%E7%AC%AC%E4%B8%80%E7%AF%87.pdf
林坤燦,學前障礙幼兒語言評量與需求調查之研究,民國88年:
http://www.cse.ndhu.edu.tw/ezfiles/75/1075/img/237/east2-1.pdf
林寶貴、錡寶香,語言障礙學生輔導手冊,民國95年:
http://163.32.59.168/2/org/Spedu/03.pdf
林寶貴,語言障礙與矯治,民國83年,五南圖書出版有限公司,台灣台北市。
王小川,語音訊號處理,修定二版,民國98年,全華圖書股份有限公司,台灣新北市。
謝國平,語言學概論,第五章,民國95年,三民書局股份有限公司,台灣台北市。
鍾玉梅,舌根音化異常兒童之音韻處理能力探討,民國91年,聽語障礙科學研究所,國立台北護理學院,碩士論文。
鄭靜宜,語音聲學-說話聲音的科學,民國100年,心理出版社股份有限公司,台灣台北市。
網頁參考資料:
Computerized Speech Lab(KayPENTAX, Montvale, NJ.,USA):http://www.kayelemetrics.com/
GNU通用公共授權條款,2007:http://zh.wikipedia.org/wiki/GNU%E9%80%9A%E7%94%A8%E5%85%AC%E5%85%B1%E8%AE%B8%E5%8F%AF%E8%AF%81
MATLAB(The MathWorks, Natick, Massachusetts, USA):
http://www.mathworks.com/products/matlab/
Praat: doing phonetics by computer,2013:http://www.fon.hum.uva.nl/praat/
Sensimetrics, SpeechStation2(Malden, MA, USA):http://www.sens.com/
Spectrogram(app store, 2013):https://itunes.apple.com/tw/app/spectrogram/id293980373?mt=8
Spectrograph(app store, 2013):https://itunes.apple.com/tw/app/spectrograph/id496219322?l=zh&mt=8
Spectrum Analyzer(app store, 2012):
https://itunes.apple.com/tw/app/spectrum-analyzer/id490078884?l=zh&mt=8
SpectrumView Plus(app store, 2013)
https://itunes.apple.com/tw/app/spectrumview-plus/id571455198?l=zh&mt=8
Vocal-2 Visible Speech Training System(Madsen, Austin, TX, USA):
http://www.audioelectronicsinc.com/audiometers/otometricsmadsen.html
ACER(宏碁股份有限公司,新北市,台灣):
http://www.acer-group.com/public/chinese/index/services.htm