| 研究生: |
張迪安 Ti-an Chang |
|---|---|
| 論文名稱: |
快速XML文件查詢法之研究 An Effective XML Document Search Method |
| 指導教授: |
許秉瑜
Ping-yu Hsu |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
管理學院 - 企業管理學系 Department of Business Administration |
| 畢業學年度: | 96 |
| 語文別: | 中文 |
| 論文頁數: | 39 |
| 中文關鍵詞: | 關鍵字查詢 、XML 、XML 查詢語言 |
| 外文關鍵詞: | XML, Keyword Search, XML Query Language |
| 相關次數: | 點閱:16 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
隨著網際網路的發展,以XML作為資料儲存文件越來越多,特別是辦公室軟體OpenOffice.org與Microsoft Office皆陸續開放其原始碼,以XML 作為資料儲存的格式。面對大量XML文件時,如何獲取完整且相關的資訊,成為一個重要的議題。
所以本研究提供了一個較快速的XML 文件查詢方法,能夠針對於XML 文件的文字內容(Text Content)與結構(Structure)部分進行查詢,跟以往XML 查詢語言須透過XML 文件之樹狀結構進行查詢相比,本研究是針對XML 文件的文字內容與結構的部分,建立關鍵字矩陣與結構矩陣,透過矩陣來進行查詢能夠改善查詢時間的效率,在面對大量XML 文件查詢時更能發揮它的效果。
另外,由於本研究進行查詢時須建立矩陣,因而造成儲存空間的增加,所以要如何在有限的儲存空間做最有效的查詢,本研究提出一個結構矩陣決策演算法,來決定增加結構矩陣後的儲存空間能否符合使用者在時間效率改善上的期望。
Because of the rapid development of internet, there are more and more documents stored by the XML–based format. Especially some office software opens his source code gradually like Microsoft Office and OpenOffice.org. When there is a great deal of XML documents, how to get the valuable information is an important subject.
This paper proposes an effective XML document search method to search text contents and structures of XML documents. We build the keyword matrix of text contexts and structure matrixes of structures in XML documents to improve the efficiency of query time. When there is a great deal of XML documents, the search method we propose can improve much efficiency of query time.
Because this search method should build matrixes, these would make the addition of storage space. So how to have better query efficiency in the limited storage space is an important subject. This paper proposes the decision-making of structure matrixes algorithm to decide addition of structure matrixes by the efficiency of improved query time which accomplishes the expectation the user wants to improve.
[1] OpenOffice.org XML File Format. available at http://xml.openoffice.org/general.html
[2] Introducing the Office (2007) Open XML File Formats. 2006, available at http://msdn2.microsoft.com/en-us/library/aa338205.aspx
[3] Z. Liu and Y. Chen,“Identifying Meaningful Return Information for XML Keyword Search”, Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp.329-340, 2007
[4] J. Clark and S. DeRose: XML Path Language(XPath)Version 1.0. 1999, available at http://www.w3.org/TR/xpath
[5] S. Boag, et al: XQuery 1.0: An XML Query Language. 2007, available at http://www.w3.org/TR/xquery
[6] S. Cohen, et al,“XSEarch:A Semantic Search Engine for XML”, Proceedings of the 29th VLDB Conference, pp. 45–56, 2003
[7] L.Guo, et al,“XRANK:Ranked Keyword Search over XML Documents”, Proceedings of the 2003 ACM SIGMOD international conference on Management of data, pp. 16-27, 2003
[8] Extensible Markup Language (XML). available at http://www.w3.org/XML/
[9] W3C-World Wide Web Consortium, http://www.w3.org/
[10] ISO-International Organization for Standardization, http://www.iso.org/
[11] Standard Generalized Markup Language (SGML), Standard by the International Organization for Standardization, ISO-8879, 1986.
[12] Comparison of SGML and XML. 1997, available at http://www.w3.org/TR/NOTE-sgml-xml-971215
[13] OpenOffice.org 中文計劃. available at http://zh.openoffice.org/new/index.html
[14] P. Durusau, et al: Open Document Format for Office Applications (OpenDocument) v1.1. 2007, available at http://docs.oasis-open.org/office/v1.1/OS/OpenDocument-v1.1-html/OpenDocument-v1.1.html
[15] S. DeRose, E. Maler, R.Daniel: XML Pointer Language(XPointer).2001, available at http://www.w3.org/TR/WD-xptr
[16] J. Clark: XSL Transformations(XSLT).1999, available at http://www.w3.org/TR/xslt
[17] S. Abiteboul, et al, "The Lorel query language for semistructured data", International Journal on Digital Libraries, pp. 88-89, 1997
[18] A. Deutsch, et al, XML-QL: A Query Language for XML. 1998, available at http://www.w3.org/TR/1998/NOTE-xml-ql-19980819.
[19] S. Ceri, et al, XML-GL:A graphical language for querying and restructuring XML documents, In proceedings of the 8th International World Wide Web Conference, pp. 93-109, 1999
[20] J. Robie, et al: XML Query Language (XQL). 1998, available at http://www.w3.org/TandS/QL/QL98/pp/xql.html
[21] A. Bonifati and S. Ceri,“Comparative analysis of five XML query languages”, ACM SIGMOD Record, 2000
[22] Y.Xu and Y. Papakonstantinou,“Efficient Keyword Search for Smallest LCAs in XML Databases”, Proceedings of the 2005 ACM SIGMOD international conference on Management of data, pp.527-538, 2005
[23] 林昌正,「多XML文件整合萃取工具之研究」,國立中央大學,碩士論文,民國97年
[24] V. Harinarayan, A. Rajaraman, and J. D. Ullman, “Implementing Data Cubes Efficiently”, Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data , pp. 205-216, June 1996.
[25] CeBIT, http://www.cebit.de/homepage_e
[26] http://de.openoffice.org/files/documents/66/3274/CeBIT_OOo20.odp
[27] http://www.ba.ncu.edu.tw/dmerplab/CeBIT_OOo_En.odp