跳到主要內容

簡易檢索 / 詳目顯示

研究生: 戴安傑
AN-jie Dai
論文名稱: OR ensemble 應用於財務危機預測
Financial crisis prediction based on OR ensemble
指導教授: 梁德容
De-ron Liang
口試委員:
學位類別: 碩士
Master
系所名稱: 資訊電機學院 - 資訊工程學系
Department of Computer Science & Information Engineering
論文出版年: 2013
畢業學年度: 102
語文別: 中文
論文頁數: 90
中文關鍵詞: 集成學習分類器財務危機預測
外文關鍵詞: ensemble learning, classifier, financial crisis prediction
相關次數: 點閱:12下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 財務危機預測問題長久以來都是一個重要且常被廣泛討論的主題,吸引了世界各地的投資者和研究學者的關注。若想要能夠準確地進行預測,選個好的預測模型就顯得十分重要,我們從文獻中可發現,在常見的資料集下,沒有一致推薦的預測模型可供我們解決財務危機預測問題。又最近研究顯示,集成學習(ensemble learning)比單一分類器來的好且穩定,因此本研究以集成學習為基底,設計出一個新的預測模型,期望此預測模型不僅能夠對付現有的財務危機資料集,對於一個未來沒看過的資料集,也能夠準確預測。


    Financial crisis problem has been important and widely studied topic. Financial crisis prediction is receiving increasing attention of stakeholders and researchers in the worldwide. If you want to predict accurately, choose a good prediction model is very important, from the literature can be found , in the common data set, there is no consistant conclusion of the prediction model for us to solve the financial crisis prediction problem, and recent studies have shown that ensemble learning is good and stable than a single classifier, thus, this study base on ensemble learning, to design a new prediction model, expect this prediction model not only deal with existing financial crisis data sets, but also predict accurately about a future data sets that we have never seen.

    目錄 中文摘要 i Abstract ii 致謝 iii 圖目錄 vi 表目錄 viii 一、緒論 1 1-1.研究背景 1 1-2.研究動機 3 1-3.論文架構 8 二、文獻探討 9 三、Ensemble learning與classifiers 11 3-1. 分類器 11 3-1-1. SVM 11 3-1-2. KNN 15 3-1-3. Neural Network 16 3-1-4. CART 17 3-1-5. Naïve bayes 18 3-2. Ensemble method 19 3-2-1. Bagging 19 3-2-2. Boosting 20 3-2-3. stacking 22 3-2-4. Majority voting 24 四、Proposed an ensemble 26 4-1.組合方式 26 4-2. 選取base learner的方針 27 4-3. OR ensemble架構 27 五、實驗設計 29 5-1.資料來源 29 5-2.資料前處理 32 5-3.Misclassification cost和cost ratios 33 5-4.實驗參數 34 5-5.研究假設 36 5-6.實驗架構 36 六、實驗結果與分析 38 6-1.base learner的挑選 38 6-2.四個資料集的結果 43 6-2-1.台灣資料集 43 6-2-2.中國資料集 48 6-2-3.澳洲資料集 51 6-2-4.德國資料集 55 6-3.結果分析 59 6-4.再次文獻探討 61 七、結論與未來展望 64 7-1.總結 64 7-2.研究貢獻 64 7-3.未來展望 65 參考文獻 67 附錄一 70 附錄二 73 附錄三 79 附錄四 89

    [1] W.-Y. Lin, Y.-H. Hu, and C.-F. Tsai, "Machine learning in financial crisis prediction: a survey," Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on, vol. 42, pp. 421-436, 2012.
    [2] P. J. Fitzpartrick, "A comparison of ratios of successful industrial enterprises with those of failed companies," Journal of Accounting Research, pp. 598-605, 1932.
    [3] W. H. Beaver, "Financial ratios as predictors of failure," Journal of accounting research, pp. 71-111, 1966.
    [4] E. I. Altman, "Financial ratios, discriminant analysis and the prediction of corporate bankruptcy," The journal of finance, vol. 23, pp. 589-609, 1968.
    [5] J. A. Ohlson, "Financial ratios and the probabilistic prediction of bankruptcy," Journal of accounting research, vol. 18, pp. 109-131, 1980.
    [6] L.-H. Chen and H.-D. Hsiao, "Feature selection to diagnose a business crisis by using a real GA-based support vector machine: An empirical study," Expert Systems with Applications, vol. 35, pp. 1145-1155, 2008.
    [7] Z. Hua, Y. Wang, X. Xu, B. Zhang, and L. Liang, "Predicting corporate financial distress based on integration of support vector machine and logistic regression," Expert Systems with Applications, vol. 33, pp. 434-440, 2007.
    [8] K.-S. Shin, T. S. Lee, and H.-j. Kim, "An application of support vector machines in bankruptcy prediction model," Expert Systems with Applications, vol. 28, pp. 127-135, 2005.
    [9] C.-H. Wu, G.-H. Tzeng, Y.-J. Goo, and W.-C. Fang, "A real-valued genetic algorithm to optimize the parameters of support vector machine for predicting bankruptcy," Expert systems with applications, vol. 32, pp. 397-408, 2007.
    [10] D. West, "Neural network credit scoring models," Computers & Operations Research, vol. 27, pp. 1131-1152, 2000.
    [11] L. Sun and P. P. Shenoy, "Using Bayesian networks for bankruptcy prediction: Some methodological issues," European Journal of Operational Research, vol. 180, pp. 738-753, 2007.
    [12] H. Li, J. Sun, and J. Wu, "Predicting business failure using classification and regression tree: An empirical comparison with popular classical statistical methods and top classification mining methods," Expert Systems with Applications, vol. 37, pp. 5895-5904, 2010.
    [13] K. Y. Tam and M. Y. Kiang, "Managerial applications of neural networks: the case of bank failure predictions," Management science, vol. 38, pp. 926-947, 1992.
    [14] E. N. Ozkan-Gunay and M. Ozkan, "Prediction of bank failures in emerging financial markets: an ANN approach," Journal of Risk Finance, The, vol. 8, pp. 465-480, 2007.
    [15] T. G. Dietterich, "Machine-learning research," AI magazine, vol. 18, p. 97, 1997.
    [16] G. Wang, J. Hao, J. Ma, and H. Jiang, "A comparative assessment of ensemble learning for credit scoring," Expert systems with applications, vol. 38, pp. 223-230, 2011.
    [17] S.-j. Wang, A. Mathew, Y. Chen, L.-f. Xi, L. Ma, and J. Lee, "Empirical analysis of support vector machine ensemble classifiers," Expert Systems with Applications, vol. 36, pp. 6466-6476, 2009.
    [18] D. West, S. Dellana, and J. Qian, "Neural network ensemble strategies for financial decision applications," Computers & operations research, vol. 32, pp. 2543-2559, 2005.
    [19] R. S. Sexton, R. S. Sriram, and H. Etheridge, "Improving decision effectiveness of artificial neural networks: a modified genetic algorithm approach," Decision Sciences, vol. 34, pp. 421-442, 2003.
    [20] Receiver operating characteristic. Available: http://en.wikipedia.org/wiki/Receiver_operating_characteristic
    [21] L. Shi, L. Xi, X. Ma, and X. Hu, "Bagging of artificial neural networks for bankruptcy prediction," in Proceedings of Information and Financial Engineering, 2009, pp. 154-156.
    [22] P. Yao, "Credit scoring using ensemble machine learning," in Proceedings of Hybrid Intelligent Systems, 2009, pp. 244-246.
    [23] S. Lei, M. Xinming, X. Lei, and H. Xiaohong, "Financial data mining based on support vector machines and ensemble learning," in Proceedings of Intelligent Computation Technology and Automation (ICICTA), 2010, pp. 313-314.
    [24] H. Kim, H. Kim, H. Moon, and H. Ahn, "A weight-adjusted voting algorithm for ensembles of classifiers," Journal of the Korean Statistical Society, vol. 40, pp. 437-449, 2011.
    [25] X. Wu, V. Kumar, J. R. Quinlan, J. Ghosh, Q. Yang, H. Motoda, et al., "Top 10 algorithms in data mining," Knowledge and Information Systems, vol. 14, pp. 1-37, 2008.
    [26] F. Lin, D. Liang, and E. Chen, "Financial ratio selection for business crisis prediction," Expert Systems with Applications, vol. 38, pp. 15094-15102, 2011.
    [27] L. Z. Xun. Support Vector Machines introduction. Available: http://www.cmlab.csie.ntu.edu.tw/~cyy/learning/tutorials/SVM2.pdf
    [28] C.-F. Tsai and M.-L. Chen, "Credit rating by hybrid machine learning techniques," Applied Soft Computing, vol. 10, pp. 374-380, 2010.
    [29] L. B. J. F. R. Olshen and C. J. Stone, "Classification and regression trees," Wadsworth International Group, 1984.
    [30] L. Breiman, "Bagging predictors," Machine learning, vol. 24, pp. 123-140, 1996.
    [31] Y. Freund and R. E. Schapire, "Experiments with a new boosting algorithm," in Proceedings of ICML, 1996, pp. 148-156.
    [32] A. Vezhnevets and O. Barinova, "Avoiding boosting overfitting by removing confusing samples," in Machine Learning: ECML 2007, ed: Springer, 2007, pp. 430-441.
    [33] D. H. Wolpert, "Stacked generalization," Neural networks, vol. 5, pp. 241-259, 1992.
    [34] R. Polikar, "Ensemble based systems in decision making," Circuits and Systems Magazine, IEEE, vol. 6, pp. 21-45, 2006.
    [35] T. G. Dietterich, "Ensemble methods in machine learning," in Multiple classifier systems, ed: Springer, 2000, pp. 1-15.
    [36] UCI Machine Learning Repository. Available: http://archive.ics.uci.edu/ml/
    [37] Taiwan Economic Journal. Available: http://www.tej.com.tw/twsite/
    [38] Taiwan Stock Exchange Corporation. Available: http://mops.twse.com.tw/mops/web/index.
    [39] J. Efrim Boritz and D. B. Kennedy, "Effectiveness of neural network types for prediction of business failure," Expert Systems with Applications, vol. 9, pp. 503-512, 1995.
    [40] C.-C. C. a. C.-J. Lin. LIBSVM -- A Library for Support Vector Machines. Available: http://www.csie.ntu.edu.tw/~cjlin/libsvm/
    [41] C.-F. Tsai, "Feature selection in bankruptcy prediction," Knowledge-Based Systems, vol. 22, pp. 120-127, 2009.
    [42] G. H. John and P. Langley, "Estimating continuous distributions in Bayesian classifiers," in Proceedings of the Eleventh conference on Uncertainty in artificial intelligence, 1995, pp. 338-345.
    [43] M. A. Clyde and H. K. Lee, "Bagging and the Bayesian bootstrap," in Proceedings of Artificial Intelligence and Statistics, 2001, pp. 169-174.
    [44] X. Xu and E. Frank, "Logistic regression and boosting for labeled bags of instances," in Advances in Knowledge Discovery and Data Mining, ed: Springer, 2004, pp. 272-281.
    [45] N. Chen, A. S. Vieira, J. Duarte, B. Ribeiro, and J. C. Neves, "Cost-sensitive learning vector quantization for financial distress prediction," in Progress in Artificial Intelligence, ed: Springer, 2009, pp. 374-385.
    [46] S. W. Pottier, "Life insurer financial distress, Best's ratings and financial ratios," Journal of Risk and Insurance, pp. 275-288, 1998.
    [47] P. L. Brockett, L. L. Golden, J. Jang, and C. Yang, "A comparison of neural network, statistical methods, and variable choice for life insurers' financial distress prediction," Journal of Risk and Insurance, vol. 73, pp. 397-419, 2006.
    [48] O. S. Persons, "Using financial statement data to identify factors associated with fraudulent financial reporting," Journal of Applied Business Research (JABR), vol. 11, pp. 38-46, 2011.

    QR CODE
    :::