| 研究生: |
陳少宇 Shao-Yu Chen |
|---|---|
| 論文名稱: |
基於學生學習的多維度與多階層因果規則探勘 Multi-dimensional and Multi-level Causal Rule Mining for Academic Datasets of Students' Portfolio |
| 指導教授: |
蔡孟峰
Meng-Feng Tsai |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 資訊工程學系 Department of Computer Science & Information Engineering |
| 論文出版年: | 2021 |
| 畢業學年度: | 109 |
| 語文別: | 中文 |
| 論文頁數: | 45 |
| 中文關鍵詞: | 資料探勘 、多階層關聯規則探勘 、因果規則探勘 |
| 外文關鍵詞: | Data Mining, Multi-level Association Rule Mining, Causal Rule Mining |
| 相關次數: | 點閱:15 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
企業與組織之間為了提升營運能力,因此對決策支援系統的需求量大增,目前的系統皆針對特定議題進行,並無一套較廣泛性、一般化應用的系統,然而由於大數據成為目前主流的趨勢,資料搜集變得非常容易,也因此可分析議題也隨之暴增,幅度甚至超過所蒐集資料量的增加,因為可能關注的分析議題,是與資料間的各種排列組合成正比(或更快速成長);而資料的排列組合相對於資料量的大小,是成指數程度的成長。決策支援系統需要科學化論證來說明此決策預期達到的成效,因此本研究以學生為主題切入,不分議題的進行廣泛探索性分析,找尋具有因果關係的資料組合,並將因果關係強度加以量化,以證明此決策的成效,此外更納入多維度與多階層資料架構,挖掘出多樣化、多觀點的因果規則,並從因果規則中推論間接因果規則,釐清整件事情的前因後果。本研究的目標為找出因果規則提供研究人員分析議題發想,亦可提供多元化的決策支援。
In order to improve the operating abilities, enterprises and organizations have greatly increased their demand for decision support systems. The current systems are all focused on specific issues, and there is not exist a more extensive and general system. However, big data has become the current trends. Data collection has become very easy. Therefore, the number of analyzable topics has also increased sharply. Even exceeding the increase of the amount of collected data. Because the analysis topics may be proportional to the various permutations and combinations of the data (or more rapid growth). And the permutation and combination of data grow exponentially with dataset size. The decision support system requires scientific argumentation to illustrate the expected results Therefore, this research takes students as the theme and conducts extensive exploratory analysis. Find out the combinations of data which has causal relationship and quantifies the strength of the relationship to prove the effectiveness of decision-support. Our research also constructs a multi-dimensional and multi-level data structure to mining diversified and multi-view causal rules. Also infer indirect causal rules from the causal rules to clarify the cause and effect. Our goal is to find out causal rule for decision-support and also provide new research topics for researchers.
[ 1 ] https://www.gvm.com.tw/article/42808
[ 2 ] http://www.tair.tw/zh-TW/about/mission
[ 3 ] Efraim Turban; Jay E. Aronson; Ting-Peng Liang (2008). Decision Support Systems and Intelligent Systems. p. 574.
[ 4 ] https://sprout.moe.edu.tw/SproutWeb/Project/Origin
[ 5 ] Rosenbaum, Paul R. 2009. Design of Observational Studies. New York: Springer
[ 6 ] Tukey, John Wilder (1977). Exploratory Data Analysis. Addison-Wesley. ISBN 978-0-201-07616-5.
[ 7 ] P. Spirtes. 2010. Introduction to causal inference. Journal of Machine Learning Research 11, 1643–1662.
[ 8 ] R. Agrawal, T. Imielin ́ski, and A. Swami. 1993. Mining association rules between sets of items in large databases. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD’93). 207–216.
[ 9 ] J. Han and M. Kamber. 2005. Data Mining: Concepts and Techniques (2nd ed.). Morgan Kaufmann, San Francisco, CA.
[ 10 ] J. Han and Y. Fu. 1995. Discovery of Multiple-Level Association Rules from Large Databases. In Proceedings of the 21th International Conference on Very Large Data Bases (VLDB '95). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 420–431.
[ 11 ] "OLTP vs. OLAP". Datawarehouse4u.Info. 2009
[ 12 ] Gray, Jim; Chaudhuri, Surajit; Bosworth, Adam; Layman, Andrew; Reichart, Don; Venkatrao, Murali; Pellow, Frank; Pirahesh, Hamid (January 1997). "Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals". Data Mining and Knowledge Discovery. 1 (1): 29–53.
[ 13 ] P. Spirtes, C. C. Glymour, and R. Scheines. 2001. Causation, Predication, and Search (2nd. ed.). MIT Press, Cambridge, MA.
[ 14 ] Jiuyong Li, Thuc Duy Le, Lin Liu, Jixue Liu, Zhou Jin, Bingyu Sun, and Saisai Ma. 2015. From observational studies to causal rule mining. ACM Trans. Intell. Syst. Technol. 7, 2, Article 14 (November 2015), 27 pages.
[ 15 ] J. W. Song and K. C. Chung. 2010. Observational studies: Cohort and case-control studies. Plastic and Reconstructive Surgery 126, 6, 2234–2242.
[ 16 ] J. L. Fleiss, B. Levin, and M. C. Paik. 2003. Statistical Methods for Rates and Proportions (3rd ed.). Wiley.
[ 17 ] Rakesh Agrawal and Ramakrishnan Srikant Fast algorithms for mining association rules. Proceedings of the 20th International Conference on Very Large Data Bases, VLDB, pages 487-499, Santiago, Chile, September 1994.
[ 18 ] Han (2000). "Mining Frequent Patterns Without Candidate Generation". Proceedings of the 2000 ACM SIGMOD international conference on Management of data - SIGMOD '00. Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data. SIGMOD '00. pp. 1–12.
[ 19 ] Burns, William C., "Spurious Correlations", 1997.
[ 20 ] Floyd, Robert W. (June 1962). "Algorithm 97: Shortest Path". Communications of the ACM. 5 (6): 345