跳到主要內容

簡易檢索 / 詳目顯示

研究生: 莊泉福
Chuan-Fu Chuang
論文名稱: 應用PHP語言開發用戶化全球資訊網探勘系統-以高雄市地政局網站數據為例
Developing a Customized W3 Mining System with PHP Language: a Case of KLA Website Data
指導教授: 陳炫碩
Shiuann-Shuoh Chen
口試委員:
學位類別: 博士
Doctor
系所名稱: 管理學院 - 企業管理學系
Department of Business Administration
論文出版年: 2019
畢業學年度: 107
語文別: 中文
論文頁數: 39
中文關鍵詞: Web探勘系統框架經濟的Web探勘架構用戶化全球資訊網探勘系統PHPWeb爬蟲
外文關鍵詞: Web mining system framework, Economical web mining architecture, Customized W3 mining system, PHP, Web crawler
相關次數: 點閱:12下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本研究之目的為定義一個新的Web探勘系統框架(Web Mining System Framework, WMSF),設計出經濟的Web探勘架構,開發出一個用戶化全球資訊網探勘系統(Customized W3 Mining System, CWMS),以降低構建Web探勘平台的成本。然後驗證與評估CWMS的Web爬蟲效率與彈性的可調節性與可移植性。本研究方法是將文獻探討所得之理論模式做一整合,並作成研究架構。其理論之推論以Web探勘系統框架的定義、用戶化全球資訊網探勘系統、CWMS的設計與實現、CWMS實驗研究為主。在本研究中,依據Cha等人研究的基於關鍵字的文本數據收集和分析的集成框架的功能體系結構中之Web爬蟲組件與使用者介面組件中的結果視圖面板(Result View Panel, RVP),提出了一個新的Web探勘系統框架(WMSF),並依照Oh等人研究的Script腳本執行和地址提取的過程(虛擬演算法),有效地應用PHP語言開發了用戶化全球資訊網探勘系統(CWMS),該系統組件包括演算法程式、巨量儲存空間和Web伺服器。最後,在CWMS的實驗研究結果已驗證了該系統可以提高Web探勘效率與彈性的可調節性與可移植性。


    The aim of this study is to define a new web mining system framework (WMSF), to design an economical web mining architecture, to develop a customized W3 mining system (CWMS) for the cost down on the web mining platform. Then, we verify and evaluate the web crawler efficiency of CWMS and the adjustability and portability of resiliency. The method of the research is to integrate the theoretical model obtained from the literature and to make a research structure. The inference of the theory is with the definition of WMSF, the CWMS, the design and implementation of CWMS and the experimental research of CWMS as a focus. In this study, according to the functional architecture of integrated framework for keyword-based text data collection and analysis studied by Cha et al., and we extract the web crawler component and the result view panel (RVP) of user interface component in the architecture and then reorganize a new WMSF. And then, according to the script execution and address extraction process (virtual algorithm) studied by Oh et al., the effective using PHP language to develop the CWMS which is components include algorithm programs, huge amounts storage space and web server. Finally, in the experimental results of CWMS, we have verified that the system can improve the web crawler efficiency of CWMS and the adjustability and portability of resiliency.

    中文摘要 I ABSTRACT II 誌謝 IV 目錄 V 表目錄 VII 圖目錄 VIII 一、 緒論 1 1-1研究背景與動機 1 1-2 研究目的 2 二、 文獻探討 4 2-1 WEB爬蟲 4 2-2文本數據收集和分析的集成框架 7 2-3 腳本執行和地址提取的過程 8 三、 WEB探勘系統框架的定義 10 四、 用戶化全球資訊網探勘系統 14 4-1 CWMS輪廓 14 4-2 SCVS的數據格式 15 五、 CWMS的設計與實現 16 六、 CWMS的實驗研究 20 七、 結論 23 7-1研究結論 23 7-2 研究限制及未來研究方向 24

    [1] Blazquez, D., & Domenech, J., Web Data Mining For Monitoring Business Export Orientation, Technological and Economic Development of Economy, Vol. 24, No. 2, pp. 406–428, March, 2018.
    [2] Oh, H. J., Won, D. H., Kim, C., Park, S. H., & Kim, Y., Design and implementation of crawling algorithm to collect deep web information for web archiving, Data Technologies and Applications, Vol. 52, No. 2, pp. 266-277, 2018.
    [3] Amanatidis, T., & Chatzigeorgiou, A., Studying the evolution of PHP web applications, Information and Software Technology, Vol. 72, pp. 48-67, 2016.
    [4] Lu, C. T., Yeh, C. S. E., Wang, Y. C., & Yang, C. S., Hybrid Clouds for Web Systems: Usability and Performance, Journal of Internet Technology, Vol. 19, No. 1, pp. 187-195, January, 2018.
    [5] Zhao, F., Zhou, J., Nie, C., Huang, H., & Jin, H., SmartCrawler: A Two-stage Crawler for Efficiently Harvesting Deep-Web Interfaces, IEEE Transactions on Services Computing, Vol. 99, pp. 1-14, 2015.
    [6] Mason, T., Trochez, C., Thomas, R., Babar, M., Hesso, I., & Kayyali, R., Knowledge and awareness of the general public and perception of pharmacists about antibiotic resistance, BMC Public Health, Vol. 18, No. 711, pp. 1-10, June , 2018.
    [7] Sohal1, I. S., O’Fallon, K. S., Gaines, P., Demokritou, P., & Bello1, D., Ingested engineered nanomaterials: state of science in nanotoxicity testing and future research needs, Particle and Fibre Toxicology, Vol. 15, No. 1, pp. 1-31, July , 2018.
    [8] Savini, L., Tora, S., Lorenzo, A. D., Cioci, D., Monaco, F., Polci, A., Orsini, M. Calistri, P., & Conte, A., A Web Geographic Information System to share data and explorative analysis tools: The application to West Nile disease in the Mediterranean basin, PLOS ONE, Vol. 13, No. 6, pp. 1-14, June , 2018.
    [9] Cha, M., Kwon, J. H., Lee, S. B., Park, J., Youm, S., & Kim1, E. J., Integrated Framework for Keyword-based Text Data Collection and Analysis, Sensors and Materials, Vol. 30, No. 3, pp. 439–445, January, 2018.
    [10] Hu, H., Ge, Y., & Hou, D., Using Web Crawler Technology for Geo-Events Analysis: A Case Study of the Huangyan Island Incident, Sustainability, Vol. 6, No. 4, pp. 1896-1912, April, 2014.
    [11] Thenmalar, S., & Geetha, T. V., The Modified Concept Based Focused Crawling Using Ontology, Journal of Web Engineering, Vol. 13, No. 5, pp. 525-538, November, 2014.
    [12] Hsu, P. Y., Hsieh, S. T., & Chuang, Y. C., Effective memory reusability based on user distributions in a cloud architecture to support manufacturing ubiquitous computing, International Journal of Computer Integrated Manufacturing, Vol. 30, No. 4, pp. 459-471, 2017.
    [13] Tsai, W. H., Chou, W. C., & Leu, J. D., An Effectiveness Evaluation Model for the Web-based Marketing of the Airline Industry, Expert Systems with Applications, Vol. 38, No. 12. pp. 15499-15516, November–December, 2011.
    [14] Chen, S. S., Chuang, Y. W., & Chen, P. Y., Behavioral Intention Formation in Knowledge Sharing: Examining the Roles of KMS Quality, KMS Self-Efficacy, and Organizational Climate, Knowledge-based Systems, Vol. 31, pp. 106-118, July, 2012.
    [15] Thapar, V., & Gupta, O.P., PI3 Performance Model of Software as a Service (SaaS) Cloud Environment, International Journal of Advanced Research in Computer Science, Vol. 8, No. 3, pp. 926-937, Mar/Apr, 2017.
    [16] Shen, C. W., Hsu, P. Y., & Peng, Y. T., The Impact of data environment and profitability on business intelligence adoption, Lecture Notes in Artificial Intelligence, Vol. 7197, pp. 185-193, 2012.

    QR CODE
    :::