| 研究生: |
鄧鈺翰 Yu-Han Teng |
|---|---|
| 論文名稱: |
使用多模態架構進行深度學習模型分析之研究 Using a multimodal architecture Research on Deep Learning Model Analysis |
| 指導教授: | 薛義誠 |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
管理學院 - 資訊管理學系 Department of Information Management |
| 論文出版年: | 2019 |
| 畢業學年度: | 107 |
| 語文別: | 中文 |
| 論文頁數: | 80 |
| 中文關鍵詞: | 多模態深度學習、GRU、CNN、Word2Vec、Glove、注意力機制 |
| 外文關鍵詞: | Multimodal deep learning, GRU, CNN, Word2Vec, Glove, Attention mechanism |
| 相關次數: | 點閱:14 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
隨著社交網路與電子商務網站的普及,使用者從被動的接收訊息轉變為主動傳播訊息,評論以及網路訊息所呈現的價值也越來越重要,過去幾年的分析研究,試圖去分析了解有關具體的輿論產品、主題、評論與推文的趨勢,在各個方面發揮著重要作用。本研究利用不同的向量化處理,對多模態分析模型進行驗證比對,確認模型可有效提升準確度。本研究提出一種由兩種模型組成之結合特徵,並將此特徵結合深度學習神經網路建構建立多模態分析模型。模型一是基於Glove向量、注意力機制與GRU神經網路架構之深度學習模型,模型二是基於Word2Vec向量、注意力機制與CNN神經網路架構之深度學習模型,多模態分析模型經由K折交叉驗證、F1測量方法進行模型驗證。實驗結果證明本研究提出之多模態分析模型,準確率高於相關研究,利用高層級多模態結合法,將多個模型的特徵取出並加以結合形成結合特徵,並將此特徵進行神經網路訓練,可使特徵集有互相輔助之效果,透過兩種向量與最佳神經網路架構並搭配多模態方法可以得到91.56%的準確率,並在模型驗證得到了93%的驗證值,證明本研究提出之多模態分析模型用於評論文本領域,可有效提升模型預測準確率,使其準確率有顯著的提升。
With the popularity of social networks and e-commerce sites, users have switched from passively receiving messages to actively disseminating messages. The value of comments and online messages is also becoming more and more important. Analysis and research over the past few years. Trying to analyze trends about specific product products, topics, reviews, and tweets. Play an important role in all aspects. This study uses different vectorization processes to verify the multimodal analysis model and confirm that the model can effectively improve the accuracy. This study proposes a combination of two models. This feature is combined with deep learning neural network construction to build a multimodal analysis model. Model 1 is a deep learning model based on Glove vector, attention mechanism and GRU neural network architecture. Model 2 is a deep learning model based on Word2Vec vector, attention mechanism and CNN neural network architecture. Multimodal analysis model is validated by K-fold cross validation and F1 measurement method. The experimental results prove that the multimodal analysis model proposed in this study has higher accuracy than related research. Using the high-level multi-modal combination method, the features of multiple models are extracted and combined to form a combined feature, and this feature is trained in neural network. The feature set can be mutually assisted, and the accuracy can be 91.56% through the two vectors and the optimal neural network architecture combined with the multi-modal method. And the model verification shows 93% verification value, which proves that the multimodal analysis model proposed in this study is used in the field of comment texts, which can effectively improve the accuracy of model prediction and improve its accuracy.
Andrew. (2011). Learning word vectors for sentiment analysis.
Arras & Montavon. (2016). Explaining predictions of non-linear classifiers in NLP.
Azimi & Abdolrashidi. (2019). Deep-Sentiment: Sentiment Analysis Using Ensemble of CNN and Bi-LSTM Models.
Bagnall, A., Lines, J., Hills, J., & Bostrom, A. (2015). Time-series classification with COTE: the collective of transformation-based ensembles.
Bargal&Sclaroff. (2018). Top-down neural attention by excitation backprop.
Bengio & Grandvalet. (2014). No unbiased estimator of the variance of k-fold cross-validation.
Cerisara & Lenc. (2018). On the effects of using word2vec representations in neural networks for dialogue act recognition.
Chua&Sun. (2015). Topical word embeddings.
Dayan & Abbott. (2001). Theoretical neuroscience: computational and mathematical modeling of neural systems.
Dhariyal & Ravi. (2018). Sentiment analysis via Doc2Vec and Convolutional Neural Network hybrids.
Hansen & Simonsen. (2019). Neural Speed Reading with Structural-Jump-LSTM.
Hinton&Salakhutdinov. (2006). Reducing the dimensionality of data with neural networks.
Hochreiter & Schmidhuber. (1997). Long short-term memory. Neural computation.
Ji, L., Gong, P., & Yao, Z. (2019). A text sentiment analysis model based on self-attention mechanism.
Kai Sheng Tai & Richard Socher. (2015). Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks.
Khosla & Ng. (2011). Multimodal deep learning.
KimY. (2014). Convolutional neural networks for sentence classification.
Kingma & Ba. (2014). Adam: A method for stochastic optimization.
Koley & Dey. (2012). An ensemble system for automatic sleep stage classification using single channel EEG signal.
Lines & Bostrom. (2017). Time series classification from scratch with deep neural networks: A strong baseline.
Liu & Xiong. (2018). Attention Aware Bidirectional Gated Recurrent Unit Based Framework for Sentiment Analysis.
Mikolov & Dean. (2013). Efficient estimation of word representations in vector space.
Nanopoulos & Manolopoulos. (2001). Feature-based classification of time-series data.
Peng & Zhao. (2017). Object-part attention model for fine-grained image classification.
Pennington & Manning. (2014). Glove: Global vectors for word representation.
Ravi&Dhariyal. (2018). Sentiment analysis via Doc2Vec and Convolutional Neural Network hybrids.
Ren&Bao. (2018). Investigating Lstm with k-Max Pooling for Text Classification.
RosenblattF. (1958). The perceptron: a probabilistic model for information storage and organization in the brain.
Shazeer & Jones. (2017). Attention is all you need.
Simard & Frasconi. (1994). Learning long-term dependencies with gradient descent is difficult.
Subarno & Ghosh. (2018). Sentiment Analysis in the Light of LSTM Recurrent Neural Networks.
Sutskever & Hinton. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems.
Torralba&Fidler. (2015). Skip-Thought Vectors.
Tsuruoka. (2016). A joint many-task model: Growing a neural network for multiple nlp tasks.
Wei & Keogh. (2006). Semi-supervised time series classification.
Williams & Zipser. (1989). A learning algorithm for continually running fully recurrent neural networks.
Xianghua. (2018). Lexicon-enhanced LSTM with attention for general sentiment analysis.
Xiao & Zhao. (2018). A deep learning-based multi-model ensemble method for cancer prediction.
Xinpeng & Jingyuan. (2018). Fine-grained Video Attractiveness Prediction Using Multimodal.
Zhicheng Cui. (2016). Multi-scale convolutional neural networks for time series classification.