| 研究生: |
詹皇傑 CHAN HUANG CHIEH |
|---|---|
| 論文名稱: |
基於Hadoop之分散式多層次三角網格索引系統 Distributed Hierarchical Triangular Mesh Index Base on Hadoop |
| 指導教授: | 蔡孟峰 |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 資訊工程學系 Department of Computer Science & Information Engineering |
| 論文出版年: | 2016 |
| 畢業學年度: | 104 |
| 語文別: | 中文 |
| 論文頁數: | 58 |
| 中文關鍵詞: | 分散式系統 、空間索引 、多層次三角網格 |
| 外文關鍵詞: | distributed system, spatial index, hierarchical triangular mesh |
| 相關次數: | 點閱:16 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
目前,在地球科學、地理、天文等領域的研究或應用中,需要分析處理空間中的物件,而物件的位置通常是在球面上以赤經及赤緯來記錄。為了使儲存及管理這些物件更有效率,需設計出一套良好的球面空間索引系統。隨著科技的進步以及資料長久的累積,各個領域需要分析的資料量越來越多,導致在傳統單機應用的環境下,硬體無法負擔。
因此,本論文建構了分散式球面空間索引系統,以多層次三角網格(HTM)為模型,運用Hadoop的MapReduce運算框架,設計出相應的資料結構及分散式演算法。不但能儲存及管理大量球面空間物件,還能提供分散式應用程式快速索引與查詢的功能。
本論文之分散式球面空間索引系統,是使用泛星巡天計劃Desktop Virtual Observatory (DVO) 資料集,而經由實驗將系統中各項參數對效率的響影做比較,供使用者參考,可以針對不同的系統環境及不同的應用需求進行調整,達到系統的最佳效率。
Recently, research or applications in earth science, geography, astronomy, those need analyzing and processing spatial objects which position is usually recorded by right ascension (ra) and declination (dec) on sphere. In order to store and manage those objects efficiently, we need to design a well sphere spatial index system. With the progress of technology and long-term accumulation of data, there are more and more data need to analyze in each field, this result cause that hardware can’t afford data in traditional single machine application environment.
Therefore, this paper construct distributed sphere spatial index system which is modeling by Hierarchical Triangular Mesh (HTM) using Hadoop MapReduce computing framework to design reference data structure and algorithm. Not only can it store and manage lots of sphere spatial objects, but offer distributed applications indexing efficiently and searching function.
Distributed sphere spatial index system in this paper, using Desktop Virtual Observatory (DVO) data in Pan-STARRS Project. Through experiments, we compare influence of efficiency by each parameters in this system, and that results can be referenced by users. They can modify parameters according to different system environments and different applications, and distributed sphere spatial index system can achieve its’ best efficiency.
[1] SkyServer – HTM, http://www.skyserver.org/
[2] Pan-STARRS, http://pan-starrs.ifa.hawaii.edu/public/
[3] TAOS, http://taos.asiaa.sinica.edu.tw/
[4] SUBARU, http://www.subarutelescope.org/index.html
[5] Apache Hadoop, http://hadoop.apache.org/
[6] OpenStack, https://www.openstack.org/
[7] Sachin Puttur: Big Data: Overview of apache Hadoop, http://www.sachinpbuzz.com/2014/01/big-data-overview-of-apache-hadoop.html
[8] Hadoop 101: Programming MapReduce with Native Libraries, Hive, Pig, and Cascading, http://blog.pivotal.io/pivotal/products/hadoop-101-programming-mapreduce-with-native-libraries-hive-pig-and-cascading
[9] A. Szalay, J. Gray, G. Fekete, P. Kunszt, P. Kukol, and A. Thakar, "Indexing the sphere with the hierarchical triangular mesh", in MSR-TR- 2005-123, September 2005.
[10] Feng, Jun, et al. "HQ-Tree: A distributed spatial index based on Hadoop." China Communications 11.7 (2014): 128-141.
[11] Lv, Zhenhua, et al. "Spatial indexing of global geographical data with HTM." 2010 18th International Conference on Geoinformatics. IEEE, 2010.
[12] 鄭凱豪, "Distributed Large-Scale Astronomical Data Management System Based on HTM Index", 國立中央大學, 碩士論文, 2012.
[13] 黃郁哲, "Hierarchical Distributed Index and File System for Astronomical Observation Data", 國立中央大學, 碩士論文, 2014.