| 研究生: |
楊舒晶 Shu-Ching Yang |
|---|---|
| 論文名稱: |
於共引用分析中使用完整作者集之研究 Clustering authors with complete sets in co-citation analysis |
| 指導教授: |
許秉瑜
Ping-Yu Hsu |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
管理學院 - 工業管理研究所 Graduate Institute of Industrial Management |
| 畢業學年度: | 93 |
| 語文別: | 英文 |
| 論文頁數: | 43 |
| 中文關鍵詞: | 作者共引用分析 、引文分析 、叢集分析 、分群 、資料探勘 |
| 外文關鍵詞: | data mining, ACA, clustering, Author co-citation analysis |
| 相關次數: | 點閱:10 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
中文摘要
作者共引用分析 (Author Co-citation Analysis,ACA) 常被使用在將作者分至不同領域中,然而傳統的ACA方法卻有如下幾項缺點:
1. 分析時,只選取參考文獻中的第一位作者進入分析中,往往忽略了同一篇中其它優秀的作者。
2. 分群後的結果顯示,一位作者只能歸屬至同一群中,當作者為跨領域或多專長的作者時,則無法以分群結果自然呈現出作者特性。
針對上述兩項缺點,本研究中所提出的方法,主要將參考文獻中的所有作者納入分析,使用完整的作者群 (Complete set and compound set) 取代傳統上僅使用第一位作者為分析單位,經由實驗後所得的分群結果顯示,本研究所提出的方法,可以成功地將作者分至不同群中,亦即一位作者可被同時歸屬至不同的分群中。
此外,本研究尚加入門檻值的設定,以過濾被引用次數較少的作者,即進入分析的作者其被引用的頻率必須大於門檻值,否則將不予納入分析,此舉是為了增加演算法的效率。在門檻值的應用下,將本研究所提出的方法分成兩種方法,一為 Complete author pair,僅在共引用作者群成對時設置門檻值,二為Compound author pair,先針對個別作者被引用的頻率設置一門檻值,當共引用作者群成對時,再設置第二次門檻值。經由實驗結果顯示,當門檻值設定較低時,我們可以納入更多的作者進入分析,而演算法的執行時間也受到門檻值所影響,執行時間長短可經由門檻值的設定而調整,當使用較高門檻值時,則所需的執行時間較短。
最後,本研究所提出的方法具備有如下的優點:
1. 可成功地將一作者分至不同的群中。
2. 使用完整的作者群,分析時可納入較多的作者,同時可避免一些優秀的作者被忽略。
3. 藉由門檻值的設定,使得分析上較為彈性,當需分析較多作者時,可設定較低的門檻值,當考慮演算法的執行時間時,可設定較高的門檻值。
Abstract
Author co-citation analysis (ACA) is a method for identifying relationships between co-cited authors. According to the cited frequency of authors by source papers, ACA method can group authors into different research fields. However, the traditional ACA method may have two drawbacks. First, most of ACA researches include only the first author. Second, an author can be grouped into only one cluster in traditional ACA process.
In this research, we use complete and compound sets instead of only first author to compute author co-relations. As a result two algorithms, namely complete author pair and compound author pair, are proposed. The result shows that our proposed method can group author into multiple clusters successfully and include authors who rarely addressed as the first author. Thresholds in the proposed algorithms can be used to tune performances while reducing authors whose works are rarely collected in the databases.
Reference
[1] A.E. Bayer, J.C. Smart, and G.W. McLaughlin, “Mapping Intellectual Structure of a Scientific Subfield through Author Co-citations,” Journal of the American Society for Information Science, vol. 41, no. 6, pp. 444-452, 1990.
[2] Y. Ding, G. Chowdhury, and S. Foo, “Mapping the Intellectual Structure of Information Retrieval Studies: An Author Co-citation Analysis 1987-1997,” Journal of Information Science, vol. 25, no. 1, pp. 67-78 , 1999.
[3] H.D. White and K.W. McCain, “Visualizing a discipline: An author co-citation analysis of information science, 1972-1995,” Journal of the American Society for Information Science, vol. 49, no. 4, pp. 327-356, 1998.
[4] H.D. White and B.C. Griffith, “Authors as Markers of Intellectual Space: Co-citation Studies of Science, Technology and Society,” Journal of Documentation, vol. 38, no. 4, pp. 255-272. , Dec. 1982.
[5] H.D. White and B.C. Griffith, “Author Co-citation: A Literature Measure of Intellectual Structure,” Journal of the American Society for Information Science, vol. 32, pp. 163-171, May 1981.
[6] H.D. White, “Author Co-citation Analysis: Overview and Defense,” In Scholarly Communication and Bibliometrics, ed. Christine L. Borgman, Newbury Park: Sage Publications, pp.84-106, 1990.
[7] R. Karki, “Search for Bridges between Disciplines: An Author Co-citation Analysis on the Research into Scholarly Communication,” Journal of Information Science, vol. 22, no. 5, pp. 323-334, 1996.
[8] C. Chen and L. Carr, “Trailblazing the literature of hypertext: author co-citation analysis (1989–1998),” In Proceedings of the 10th ACM Conference on Hypertext and hypermedia: returning to our diverse roots, pp. 51-60, 1999.
[9] H.D. White, J. Buzydlowski, and X. Lin, “Co-Cited Author Maps as Interfaces to Digital Libraries: Designing Pathfinder Networks in the Humanities,” Proceedings, IEEE International Conference on Information Visualization, pp. 25-30, 2000.
[10] C. Chen, “Visualizing Semantic Spaces and Author Co-Citation Networks in Digital Libraries,” Information Processing and Management, vol. 35, pp. 401-420, 1999.
[11] K.W. McCain, “Mapping Authors in Intellectual Space: A Technical Overview,” Journal of the American Society for Information Science, vol. 41, pp. 433-443, 1990.
[12] H. Small, “Co-citation in the scientific literature: A new measure of the relationship between two documents,” Journal of American Society for Information Science, vol. 24, no. 4, pp. 265-269, 1973.
[13] A.K. Jain, M.N. Murty, and P.J. Flynn, “Data clustering: a review,” ACM Computing Surveys (CSUR), vol. 31, no. 3, 1999.
[14] Y. He and S.C. Hui, “Mining a web citation database for author co-citation analysis,” Information Processing and Management, vol. 38, pp. 491-508, 2002.
[15] B. Everitt, “Cluster analysis,” Hampshire, England: Gower Press, 1986.
[16] H.G. Small, “Co-citation in scientific literature. A new measure for the relationship between publications,” JASIS, vol. 24, pp. 265-269, 1973.
[17] D. Hicks, “Limitations of co-citation analysis as a tool for science policy,” Social Studies of Science, vol. 17, pp. 295-316, 1987.
[18] L. Egghe and R. Rousseau, “Introduction to Informetrics Quantitative Method in Library, Documentation and Information Science,” Elsevier, pp.211-227, 1990.
[19] S.B. Eom, 2003. Author co-citation analysis using custom bibliographic databases an introduction to the SAS approach Edwin Mellen Press.
[20] H.G. Small and B.C. Griffith, “The structure of scientific literatures I: Identifying and graphing specialties,” Science Studies, vol. 4, pp. 17-40, 1974.
[21] H.D. White and B.C. Griffith, “Author co-citation: A literature measure of intellectual structure,” Journal of the American Society for Information Science, vol. 32, pp. 307-312, 1981.
[22] M.S. Aldenderfer and R.K. Blashfield, “Clusster analysis,” Sage Publications, Newbury Park, Calif., pp. 33-45, 1986.
[23] P. Willet, “Recent trends in hierarchical document clustering: a critical review,” Information Processing and Management, vol. 24, pp. 577-597, 1988.
[24] X. Lin, H.D. White and J. Buzydlowski, “Real-time author co-citation mapping for online searching,” Information Processing and Management, vol. 39, pp. 689-706, 2003.