跳到主要內容

簡易檢索 / 詳目顯示

研究生: 何聰鑫
Tsung-Hsin Ho
論文名稱: 時序性資料庫中未知週期之非同步週期性樣板的探勘
Mining Asynchronous Partial Periodic Pattern from Multi-event Time Series Database with Unknown Periods
指導教授: 張嘉惠
Chia-Hui Chang
口試委員:
學位類別: 碩士
Master
系所名稱: 資訊電機學院 - 資訊工程學系
Department of Computer Science & Information Engineering
畢業學年度: 91
語文別: 中文
論文頁數: 46
中文關鍵詞: 資料探勘非同步週期性樣板的探勘時間序列分析
外文關鍵詞: asynchronous partial periodic pattern mining, data mining
相關次數: 點閱:11下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 目前對於週期性樣板探勘的研究均把焦點放在探勘非同步卻是單一事件樣板。然而在實際生活中,每一個時間點發生的事件數絕非單一事件樣板所能夠描述,所以,在這篇論文中,我們提出了一個嶄新的演算法,能夠廣泛地使用在實際應用上。系統有三個參數在找尋顯著樣板時被提了出來,分別是:min_rep限制每個樣板符合的顯著分割中,樣板必須不間斷地重複出現次數;max_dis規範兩兩顯著分割之間,系統所能夠忍受的雜訊間隔最大長度;total_rep指定一個樣板形成的顯著時間子序列中,樣板必須重複出現的最少次數。
    系統演算法由兩大部分所構成。第一部分稱為單一時間樣板探勘,在這一個部分,使用了Sliding Window方法尋找所有單一時間樣板符合的所有顯著分割。第二部分稱為樣板成長,這一部分使用BFS概念結合所有的顯著單一時間樣板,直到得到時序性資料集中所有的顯著子序列。在最後一章,實驗部分顯示我們的演算法是有效率並且穩定的。


    Current research on periodic pattern mining focuses on mining asynchronous but simple single-evnet patterns. However, in real-life situation, there are more than one events happening at one time. In this paper, we propose a thoroughly-new algorithm to really solve the problem we would experience in livelihood. Three parameters min_rep, max_dis and total_rep are employed to specify the constraints a significant pattern must satisfy. Min_rep specify the minimum number of repetitions that is required within each segment of non-disrupted pattern occurrences, max_dis specify the maximum allowed disturbance between any two successive valid segments, and total_rep claims the minimum overall repetitions that is needed within a valid subsequence.
    Our algorithm is composed of two individual parts. One is called 1-pattern mining, and the other is called pattern growth. In the first part, a sliding window method is devised to find the entire potential valid segment matched by 1-patterns. The second part, we make use of the concept of BFS to gain valid subsequences in the overall time series dataset. Finally in experiments, our algorithm is shown efficient and stable with scale-up dataset size.

    1. 緒論 1 1.1. 論文貢獻 3 1.2. 論文架構 4 2. 相關研究 5 2.1. 同步的週期性樣板探勘 5 2.1.1. 循環關聯式規則 5 2.1.2. 分段週期性樣板 7 2.2. 非同步的週期性樣板探勘 12 2.2.1. 分段演算法 12 2.2.2. P-pattern演算法 14 2.3. 結論 16 3. 非同步週期性樣板探勘 18 3.1. 問題描述 20 3.2. 單一時間樣板 22 3.3. 樣板成長 29 3.4. 結論 32 4. 系統使用與說明 33 4.1. 單一週期性樣板 33 4.2. 樣板成長 36 5. 實驗結果 38 5.1. 實驗測試資料說明 38 5.2. 各參數對實驗結果的影響 40 5.2.1. 單一樣板 40 5.2.2. 樣板成長 42 5.3. 實驗結果討論 43 6. 結論與未來展望 44 7. 參考文獻 45

    [1] Jiawei Han, Wan Gong, Yiwen Yin, "Mining Segment-Wise Periodic Patterns in Time-Related Databases", International Conference on Knowledge Discovery and Data Mining, pp. 214-218, 1998.
    [2] Jiawei Han, Guozhu Dong, Yiwen Yin, "Efficient Mining of Partial Periodic Patterns in Time Series Database", International Conference on Data Engineering, pp. 106-115, 1999.
    [3] Herman J. Loether and Donald G. McTavish, "Descriptive and Inferential Statistics: An Introduction", Allyn and Bacon, 1993.
    [4] Banu Ozden, Sridhar Ramaswamy, and Abraham Silberschatz, "Cyclic Association Rules", International Conference on Data Engineering, pp. 412-421, 1998.
    [5] Sridhar Ramaswamy, Sameer Mahajan, Avi Silberschatz, "On the Discovery of Interesting Patterns in Association Rules", International Conference on Very Large Data Bases, pp. 368-379, 1998.
    [6] Jiong Yang, Wei Wang, Philip S. Yu, "Mining Asynchronous Periodic Patterns in Time Series Data", Internationl Conference on Knowledge Discovery and Data Mining, pp. 275-279, 2000.
    [7] Wei Wang, Jiong Yang, "Mining Patterns in Long Sequential Data with Noise", International Conference on Knowledge Discovery and Data Mining (SIGKDD Explorations), pp. 28-33, 2000.
    [8] Wei Wang, Jiong Yang, Philip S. Yu, "Meta-Patterns: Revealing Hidden Periodic Patterns", International Conference on Knowledge Discovering and Data Mining, pp. 550-557, 2001.
    [9] Rakesh Agrawal, Ramakrishnan Srikant, "Fast Algorithms for Mining Association Rules in Large Databases", International Conference on Very Large Data Bases, pp. 487-499, 1994.
    [10] Rakesh Agrawal, Tomasz Imielinski, Arun N. Swami, "Mining Association Rules between Sets of Items in Large Databases", International Conference on Management of Data, pp. 207-216, 1993.
    [11] Osmar R. Zaiane, Man Xin, Jiawei Han, "Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs", Advances in Digital Libraries, pp. 19-29, 1998.
    [12] Sheng Ma, Joseph L. Hellerstein, "Mining Partially Periodic Event Patterns With Unknown Periods", International Conference on Data Engineering, pp. 205-214, 2000.

    QR CODE
    :::