跳到主要內容

簡易檢索 / 詳目顯示

研究生: 劉冠慶
Kuan-Ching Liou
論文名稱: 深度學習應用於YouTube影片情緒分類
指導教授: 陳彥良
Y. L. Chen
口試委員:
學位類別: 碩士
Master
系所名稱: 管理學院 - 資訊管理學系
Department of Information Management
論文出版年: 2020
畢業學年度: 108
語文別: 中文
論文頁數: 36
中文關鍵詞: 文字探勘情緒分析深度學習YouTube
外文關鍵詞: Text mining, Sentiment analysis, Deep Learning, YouTube
相關次數: 點閱:8下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 隨著YouTube成為全球第二大的熱門網站,不斷增加的流量、用戶量與營收,新興的職業與商業模型順應而生,背後的商機非常可觀,在流量變現的時代中,不僅僅是影片上傳者想要精準行銷,使用者們在茫茫的影片海中也想找到適合自己的影片,為了讓使用者可以快速搜尋到想要的影片,管理與分類這些巨量的影音內容成為主要的任務。
    而YouTube上除了上傳影片資料外,還包含其他的使用者產生的內容(User-generated content),如影片標題、標籤、描述、影片評論等等,大部分都是由創作者在上傳影片時自己輸入的,而評論則是由大量的使用者共同創造的,因此我們希望透過結合創作者與使用者共同產生的資訊,能夠提供更加客觀的分類。以往的YouTube影片分類方法多用機器學習方法分析文本,本篇論文利用深度學習法將網路影片分類到指定的情緒類別中。
    本文利用text-CNN提取影片標題、標籤、描述、評論四組文字的局部特徵,再利用Bi-LSTM分析較長的評論特徵,將YouTube影片更有效分類到適合的情緒中,最高達到92.19%。


    With YouTube has been becoming the world’s second most popular website, YouTube has been increasing traffic, user volume and revenue. New careers and business models have been emerging, and the business opportunities behind it are very impressive. content providers want to precision marketing, and users also want to find videos that suit them in the vast of videos. In order for users to quickly search for the videos they want, managing and categorizing these huge amounts of videos become the main task.
    In addition to uploading video, YouTube also contains other user-generated content, such as video descriptions, keywords, titles, comments, etc., most of which are uploaded by the content provide. While the comments are created by a large number of users, we hope to provide a more objective classification by combining the information generated by content provide and users. In the past, YouTube video classification methods mostly used machine learning methods to analyze text. This paper uses deep learning methods to classify Internet videos into designated emotional categories.
    This paper uses text-CNN to extract the local features of the four groups of texts: title, keywords, description, and comments, and then uses Bi-LSTM to analyze longer comment features to more effectively classify YouTube videos into suitable emotions, up to 92.19%.

    摘要 i Abstract ii 致謝 iii 目錄 iv 圖目錄 v 表目錄 vi 第一章 緒論 1 第二章 文獻探討 5 2.1 情緒分析 5 2.2 圖片情緒分析 7 2.3 影片情緒分析 7 第三章 研究方法 8 3.1 研究架構 8 3.2 資料前處理 8 3.3 字詞向量 8 3.4 深度學習法 10 3.4.1 卷積神經網絡 10 3.4.2 遞迴神經網路 12 3.4.3雙向長短期記憶模型 12 3.4.4 Ensemble Model 13 第四章 實驗結果 16 4.1 實驗設計 16 4.2 實驗環境 18 4.3 實驗模型參數設定 18 4.4 實驗結果 21 第五章 結論 24 參考文獻 25

    [1] G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," science, vol. 313, no. 5786, pp. 504-507, 2006.
    [2] A. Voulodimos, N. Doulamis, A. Doulamis, and E. Protopapadakis, "Deep learning for computer vision: A brief review," Computational intelligence and neuroscience, vol. 2018, 2018.
    [3] J. L. Elman, "Finding structure in time," Cognitive science, vol. 14, no. 2, pp. 179-211, 1990.
    [4] P. D. Turney, "Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews," arXiv preprint cs/0212032, 2002.
    [5] B. Snyder and R. Barzilay, "Multiple aspect ranking using the good grief algorithm," in Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference, 2007, pp. 300-307.
    [6] C. Strapparava and R. Mihalcea, "Learning to identify emotions in text," in Proceedings of the 2008 ACM symposium on Applied computing, 2008, pp. 1556-1560.
    [7] J. Wiebe, T. Wilson, and C. Cardie, "Annotating expressions of opinions and emotions in language," Language resources and evaluation, vol. 39, no. 2-3, pp. 165-210, 2005.
    [8] R. Socher et al., "Recursive deep models for semantic compositionality over a sentiment treebank," in Proceedings of the 2013 conference on empirical methods in natural language processing, 2013, pp. 1631-1642.
    [9] N. Kalchbrenner, E. Grefenstette, and P. Blunsom, "A convolutional neural network for modelling sentences," arXiv preprint arXiv:1404.2188, 2014.
    [10] K. S. Tai, R. Socher, and C. D. Manning, "Improved semantic representations from tree-structured long short-term memory networks," arXiv preprint arXiv:1503.00075, 2015.
    [11] V. Yanulevskaya, J. C. van Gemert, K. Roth, A.-K. Herbold, N. Sebe, and J.-M. Geusebroek, "Emotional valence categorization using holistic image features," in 2008 15th IEEE international conference on Image Processing, 2008: IEEE, pp. 101-104.
    [12] C. Xu, S. Cetintas, K. Lee, and L. Li, "Visual Sentiment Prediction with Deep Convolutional Neural Networks (2014)," arXiv preprint arXiv:1411.5731.
    [13] Q. You, J. Luo, H. Jin, and J. Yang, "Robust image sentiment analysis using progressively trained and domain transferred deep networks," in Twenty-ninth AAAI conference on artificial intelligence, 2015.
    [14] H.-B. Kang, "Affective content detection using HMMs," in Proceedings of the eleventh ACM international conference on Multimedia, 2003, pp. 259-262.
    [15] X. Gibert, H. Li, and D. Doermann, "Sports video classification using HMMs," in 2003 International Conference on Multimedia and Expo. ICME'03. Proceedings (Cat. No. 03TH8698), 2003, vol. 2: IEEE, pp. II-345.
    [16] B. Xu, Y. Fu, Y.-G. Jiang, B. Li, and L. Sigal, "Heterogeneous knowledge transfer in video emotion recognition, attribution and summarization," IEEE Transactions on Affective Computing, vol. 9, no. 2, pp. 255-270, 2016.
    [17] Y. Fan, X. Lu, D. Li, and Y. Liu, "Video-based emotion recognition using CNN-RNN and C3D hybrid networks," in Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016, pp. 445-450.
    [18] M. Hajar, "Using YouTube comments for text-based emotion recognition," Procedia Computer Science, vol. 83, pp. 292-299, 2016.
    [19] P. Sarakit, T. Theeramunkong, C. Haruechaiyasak, and M. Okumura, "Classifying emotion in Thai youtube comments," in 2015 6th International Conference of Information and Communication Technology for Embedded Systems (IC-ICTES), 2015: IEEE, pp. 1-5.
    [20] Y.-L. Chen, C.-L. Chang, and C.-S. Yeh, "Emotion classification of YouTube videos," Decision Support Systems, vol. 101, pp. 40-50, 2017.
    [21] Z. S. Harris, "Distributional structure," in Papers in structural and transformational linguistics: Springer, 1970, pp. 775-794.
    [22] J. R. Firth, "A synopsis of linguistic theory, 1930-1955," Studies in linguistic analysis, 1957.
    [23] J. Savigny and A. Purwarianti, "Emotion classification on youtube comments using word embedding," in 2017 International Conference on Advanced Informatics, Concepts, Theory, and Applications (ICAICTA), 2017: IEEE, pp. 1-5.
    [24] T. Mikolov, K. Chen, G. Corrado, and J. Dean, "Efficient estimation of word representations in vector space," arXiv preprint arXiv:1301.3781, 2013.
    [25] Y. Kim, "Convolutional neural networks for sentence classification," arXiv preprint arXiv:1408.5882, 2014.
    [26] S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural computation, vol. 9, no. 8, pp. 1735-1780, 1997.
    [27] T. Fischer and C. Krauss, "Deep learning with long short-term memory networks for financial market predictions," European Journal of Operational Research, vol. 270, no. 2, pp. 654-669, 2018.
    [28] D. Li and J. Qian, "Text sentiment analysis based on long short-term memory," in 2016 First IEEE International Conference on Computer Communication and the Internet (ICCCI), 2016: IEEE, pp. 471-475.
    [29] A. Graves and J. Schmidhuber, "Framewise phoneme classification with bidirectional LSTM networks," in Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005., 2005, vol. 4: IEEE, pp. 2047-2052.

    QR CODE
    :::