| 研究生: |
張祜嘉 Hu-Chia Chang |
|---|---|
| 論文名稱: |
分散式重複序列資料庫之效能評估 Performance Evaluation of A Distributed Database of Repetitive Elements in Complete Genomes |
| 指導教授: |
洪炯宗
Jorng-Tzong Horng |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 資訊工程學系 Department of Computer Science & Information Engineering |
| 畢業學年度: | 91 |
| 語文別: | 英文 |
| 論文頁數: | 52 |
| 中文關鍵詞: | 資料庫 、重複序列 、分散式 |
| 外文關鍵詞: | Database, Complete Genomes, Repetitive Elements, Performance Evaluation, Distributed |
| 相關次數: | 點閱:12 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
前一版的重複序列資料庫是建立在集中式資料庫系統上的,此資料庫目前包含了大量的資料,而生物資訊的資料也日漸擴增,重複序列資料庫的效能成為很重要的問題。為了得到更好的效能,我們建立了分散式重複序列資料庫。在分散式資料庫上,資料分散的方法是平衡負載的重要機制。我們設計了許多資料分散的方法來做實驗以得到最適合重複序列資料的方法,也發展了智慧型元件來輔助我們的系統以得到更好的效能。
The original version of Repeat Sequence Database (RSDB) was created based on centralized database systems (CDBSs). It contains large number of data currently, and the size of biological data is increasing rapidly. The performance of RSDB becomes an important issue. Distributed RSDB (DRSDB) is created based on distributed database systems (DDBSs) in order to obtain better performance. Data distribution serves as an important load-balancing mechanism. We design lots of data distribution approaches and try to find the proper approaches to our particular system with experiments. The results show that query processor does not always choose the right data access paths for queries, and we develop an intelligent component to assist our system executing queries wisely in order to obtain much better performance.
[1] Elmasri,R. and Navathe,S.B. (1994) Fundamentals of Database Systems Second Edition. Addison-Wesley Publishing Company, Menlo Park, CA.
[2] Horng,J.T., Lin,J.H. and Kao,C.Y. (2001) RSDB – A Database of Repetitive Elements in Complete Genomes. Proceedings of the Atlantic Symposium on Computational Biology and Genome Information Systems & Technology, Burham, NC, USA, 220-223.
[3] Horowitz,E., Sahni,S. and Mehta,D. (1995) Fundamentals of data structures in C++. W. H. Freeman and Company.
[4] Mehta,M. and DeWitt,D.J. (1997) Data placement in shared-nothing parallel database systems. The VLDB journal, 6. 53-72.
[5] Mukkamala,R. (1989) Measuring the Effect of Data Distribution Models on Performance Evaluation of Distributed Database Systems. IEEE transactios on Knowledge and Data Engineering, 1. 494-507.
[6] Nicola,M. and Jarke,M. (2000) Performance Modeling of Distributed and Replicated Databases. IEEE transactions on Knowledge and Data Engineering, 12. 645-672.
[7] Özsu,M.T. and Valduriez,P. (1996) Distributed and Parallel Database Systems. ACM Computing Surveys, 28. 125-128.
[8] Özsu,M.T. and Valduriez,P. (1999) Principles of Distributed Database Systems Second Edition. Prentice-Hall.
[9] Tamhankar,A.M. and Ram,S. (1998) Database Fragmentation and Allocation: An Integrated Methodology and Case Study. IEEE transactions on System, Man, and Cybernetics – Part A: Systems
39
and Humans, 28. 288-305.