| 研究生: |
顏逸品 Yi-Pin Yian |
|---|---|
| 論文名稱: |
網際網路半結構化資料之蒐集與整合研究 |
| 指導教授: |
陳奕明
Yi-Ming Chen |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
管理學院 - 資訊管理學系 Department of Information Management |
| 畢業學年度: | 88 |
| 語文別: | 中文 |
| 論文頁數: | 108 |
| 中文關鍵詞: | 網際網路 、全球資訊網 、半結構化資料 、領域架構清單 、資料搜尋 、資料蒐集 、資料萃取 、資料整合 |
| 相關次數: | 點閱:14 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本研究提出領域架構清單的背景知識結構,以個別應用領域的背景關鍵辭彙為基礎,系統會自動地完成資料來源的搜尋、半結構化資料分析、資料再結構等相關處理程序。本研究架構有效地改善了傳統半結構化資料處理的可行性程度,並簡化了目前半結構化資料處理相關研究的複雜程度。經過實驗分析與系統評估,發現本研究所使用的領域架構清單背景知識的方式可以有效地完成目前大部分網際網路半結構化資料的處理,與其他同質性與異質性的系統比較起來,本系統的效能也普遍較佳。
[AltaVista] Alta Vista Search Engine, http://www.altavista.com/.
[AltaVista] Alta Vista Search Engine, http://www.altavista.com/.
[CHB] 彰化銀行網站, http://www.chb.com.tw/index1.html.
[CHB] 彰化銀行網站, http://www.chb.com.tw/index1.html.
[DH 1999] Dean Jeffery, Henzinger Monika R., “Finding related pages in the World Wide Web,” Computer Network, Vol 31. 1999. pp. 1467-1479.
[DH 1999] Dean Jeffery, Henzinger Monika R., “Finding related pages in the World Wide Web,” Computer Network, Vol 31. 1999. pp. 1467-1479.
[FP 1998] Filman Robert E., Pant Sangam, “Searching The Internet,” IEEE Internet Computing, July/Auguest,1998, pp 21-23.
[Fund] 基金特蒐員網站, http://www.hello.com.tw/~fund/company/c1.htm.
[GA 1998] Gustavo O. Arocena, Alberto O. Mendelzon, “Viewing WISs as Database Applications,” Communication of ACM, Vol. 41, No. 7, July 1998, pp. 101-102.
[GAIS] GAIS Search Engine, http://gais.cs.ccu.edu.tw/.
[GHR 1998] Gupta Ashish, Harinarayan Venky, Rajaraman Anand, “Virtual Database Technology,” Data Engineering Proceedings., 14th International Conference, 1998 ,pp. 297 —301.
[GHR 1998] Gupta Ashish, Harinarayan Venky, Rajaraman Anand, “Virtual Database Technology,” Data Engineering Proceedings., 14th International Conference, 1998 ,pp. 297 —301.
[HMC+] Hammer J., H. Molina Garcia, Cho J., R. Aranha, and A. Crespo, “Extracting semistructured informationfrom the web,” ftp://db.stanford.edu/pub/papers/extract.ps.
[HMC+] Hammer J., H. Molina Garcia, Cho J., R. Aranha, and A. Crespo, “Extracting semistructured informationfrom the web,” ftp://db.stanford.edu/pub/papers/extract.ps.
[Kleinberg 1998] Kleinberg J., “Authoritative sources in hyperlinked environment,” Proc. of the 9th Annual ACM-SIAM Symposium on Discrete Alogrithms, January 1998, pp.668-677.
[KMS+ 1998] Kogan Yakov, Michaeli David, Sagiv Yehoshua, Shmueli Oded, “Utilizing the multiple facets of WWW contects,” Data Knowledge Engineering, Vol. 28, 1998, pp. 255-275.
[KMS+ 1998] Kogan Yakov, Michaeli David, Sagiv Yehoshua, Shmueli Oded, “Utilizing the multiple facets of WWW contects,” Data Knowledge Engineering, Vol. 28, 1998, pp. 255-275.
[KS 1996] Konopniki, D. and O. Shmuli, “Early experiences with W3QS - A WWW
Information Gathering System,” The 19th IEEE Convention of Electrical and Electronics Engineers, http://www.cs.technion.ac.il/~konop/ieee-1996.ps.gz
Information Gathering System,” The 19th IEEE Convention of Electrical and Electronics Engineers, http://www.cs.technion.ac.il/~konop/ieee-1996.ps.gz
[KWD 1997] Kushmerick, Weld, Doorenbos: “Wrapper induction for information extraction,”IJCAI-97, http://www.compapp.dcu.ie/~nick/research/-download/kushmerick-ijcai97.ps.Z
[KWD 1997] Kushmerick, Weld, Doorenbos: “Wrapper induction for information extraction,”IJCAI-97, http://www.compapp.dcu.ie/~nick/research/-download/kushmerick-ijcai97.ps.Z
[Lore] Lore Project, http://www-db.stanford.edu/lore/.
[MMM 1996] Mendelzohn A., Mihaila G. A., and Milo T., “Querying the world wide web,” 1996, Draft.URL, ftp://ftp.db.toronto.edu/pub/papers/pdis96.ps.gz
[Openfind] Openfind, http://www.openfind.com.tw/.
[RN 1998] Rajaraman Anand, Norvig Peter, “Virtual Database Technology : Transforming the Internet into a Database,” IEEE Internet Computing, July/August, 1998, pp.55-58.
[Teleport] Teleport Web Spider, http://www.teleport.com/.
[TSIMMIS] TSIMMIS Project, http://www-db.stanford.edu/tsimmis/tsimmis.html.
[W3C] World Wide Web Consortium, http://www.w3.org/.
[WebSQL] WebSQL Project, http://www.cs.toronto.edu/~websql/toc.html.
[WebOQL] WebOQL Project, http://www.cs.toronto.edu/~gus/weboql/index.html.
[Wolfgang 1999] Wolfgang May, “Modeling and Querying Structure and Contents of the Web,” IEEE Internet Computing, 1999, pp. 721-725.
[Yahoo] Yahoo, http://www.yahoo.com/.
[李明德 1998] 李明德,『網際網路半結構化資料的擷取、管理與呈現系統』,國立中央大學資訊管理學研究所碩士論文,民國 87 年 6 月。
[許盛貴 1999] 許盛貴,『網際網路資料搜尋之研究』,國立中央大學資訊管理學系碩士論文,民國 88 年 12 月。
[楊振偉 1998] 楊振偉,『利用書籤功能達到網際網路資訊分享與過濾的技術探討』,國立中央大學資訊管理研究所碩士論文,民國 87 年 7 月。