| 研究生: |
陳又彰 You-Zhang Chen |
|---|---|
| 論文名稱: |
應用深度學習於盲源分離之實現 |
| 指導教授: | 董必正 |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
工學院 - 機械工程學系 Department of Mechanical Engineering |
| 論文出版年: | 2020 |
| 畢業學年度: | 108 |
| 語文別: | 中文 |
| 論文頁數: | 61 |
| 中文關鍵詞: | 深度學習 、盲源分離 |
| 外文關鍵詞: | deep learning, blind source separation |
| 相關次數: | 點閱:14 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文旨在建構一套盲源分離軟體,並導入深度學習演算法,以此實現音頻擷取時的降噪功能,並於未來應用於智慧製造的領域。
實驗流程方面,本論文先以乾淨的音源訊號進行模型訓練,以此驗證本論文建構之軟體的正確性,再將各式噪音與原始訊號合成,研究應用深
度學習於音頻降噪實現時最佳的模型訓練方式。
The propose of the thesis is to build a blind source separation program which based on deep learning algorithm. The goal is to achieve noise reduction while capturing sound signals, and to applicate in intelligent manufacturing in the future.
The process of experimentation is training the model with clean audio signal at first. It would verify if the program be written properly. After the verification, the original signal combined with noise would be used to train the model for researching the best training method on noise reduction.
〔1〕J. F.Cardoso, A.Souloumiac, “Blind beamforming for non-gaussian signals”, IEE Proceedings F Radar and Signal Processing, Vol 140, pp. 362, 1993.
〔2〕P. Comon, “Independent component analysis, A new concept?”, Signal Processing, Vol 36, pp. 287-314, April 1994.
〔3〕C. Servière, P. Fabry, “Principal component analysis and blind source separation of modulated sources for electro-mechanical systems diagnostic”, Mechanical Systems and Signal Processing, Vol 19, pp. 1293-1311, 2005.
〔4〕DeLiang. Wang, “Time-frequency masking for speech separation and its potential for hearing aid design”, Trends in amplification, Vol 12, pp. 332-353, 2008
〔5〕C. Jutten, J. H´erault, “Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture”, Signal Processing, Vol 24, pp.1-10, 1991.
〔6〕M.B. Anke, P. Gruber, F. Theis, S. Foo,” Blind source separation based on self-organizing neural network”, Engineering Applications of Artificial Intelligence, Vol 19, pp. 305-311, 2006.
〔7〕I. Valova, N. Gueorguieva, G. Georgiev, “Blind Source Separation with Neural Networks: Demixing Sources From Mixtures with Different Parameters”, 2006 ieee/aiaa 25TH Digital Avionics Systems Conference, pp. 1-11, Portland, USA.
〔8〕F. Acernese, A. Ciaramella, S. De Martino, R. De Rosa, M. Falanga, R. Tagliaferri,” Neural networks for blind-source separation of Stromboli explosion quakes” IEEE Transactions on Neural Networks, Vol 14, pp.167-175, 2003.
〔9〕J. R. Hershey, Z. Chen, J. Le Roux, S. Watanabe, “Deep clustering: Discriminative embeddings for segmentation and separation”, 2016 ICASSP, pp. 31-35, Shanghai, China.
〔10〕V. Nasir, J. Cool, F. Sassani, “Intelligent Machining Monitoring Using Sound Signal Processed With the Wavelet Method and a Self-Organizing Neural Network”, IEEE Robotics and Automation Letters, Vol 4, pp.3449-3456, 2019.
〔11〕Y. Luo, N. Mesgarani, “Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation”, Ieee-Acm Transactions on Audio Speech and Language Processing, Vol 27, pp.1256-1266, 2019.
〔12〕A.G. Howard, Z.Menglong, B. Chen, etc., “MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications”, April 2017.
〔13〕V. Panayotov, G. Chen, D. Povey, S. Khudanpur, ” Librispeech: An ASR corpus based on public domain audio books ,2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5206-5210, Brisbane, Australia.
〔14〕W. Gordon, A. Joe, F. Michael, etc., “WHAM!: Extending Speech Separation to Noisy Environments”, 2019.
〔15〕C. Joris, P. Manuel, C, Samuele, etc., “LibriMix: An Open-Source Dataset for Generalizable Speech Separation”, 2020.
〔16〕S.J Choi, A. Cichocki, S.Y Lee, S.Y Lee, “Blind Source Separation and Independent Component Analysis: A Review” Neural Information Processing – Letters and Reviews, Vol 6, November 2004.