MetaTOC stay on top of your field, easily

Recovering uncaptured citations in a scholarly network: A two‐step citation analysis to estimate publication importance

, ,

Journal of the American Society for Information Science and Technology

Published online on

Abstract

The citation relationships between publications, which are significant for assessing the importance of scholarly components within a network, have been used for various scientific applications. Missing citation metadata in scholarly databases, however, create problems for classical citation‐based ranking algorithms and challenge the performance of citation‐based retrieval systems. In this research, we utilize a two‐step citation analysis method to investigate the importance of publications for which citation information is partially missing. First, we calculate the importance of the author and then use his importance to estimate the publication importance for some selected articles. To evaluate this method, we designed a simulation experiment—“random citation‐missing”—to test the two‐step citation analysis that we carried out with the Association for Computing Machinery (ACM) Digital Library (DL). In this experiment, we simulated different scenarios in a large‐scale scientific digital library, from high‐quality citation data, to very poor quality data, The results show that a two‐step citation analysis can effectively uncover the importance of publications in different situations. More importantly, we found that the optimized impact from the importance of an author (first step) is exponentially increased when the quality of citation decreases. The findings from this study can further enhance citation‐based publication‐ranking algorithms for real‐world applications.