Evaluation of Features for Author Name Disambiguation Using Linear Support Vector Machines

Abstract
Author name disambiguation allows to distinguish between two or more authors sharing the same name. In a previous paper, we have proposed a name disambiguation framework in which for each author name in each article we build a context consisting of classification codes, bibliographic references, co-authors, etc. Then, by pairwise comparison of contexts, we have been grouping contributions likely referring to the same people. In this paper we examine which elements of the context are most effective in author name disambiguation. We employ linear Support Vector Machines (SVM) to find the most influential features.
Description
Keywords
Citation
P. J. Dendek, Ł. Bolikowski, M. Łukasik. Evaluation of Features for Author Name Disambiguation Using Linear Support Vector Machines. In:. Proceedings of the 10th IAPR International Workshop on Document Analysis Systems. 2012; p. 440-444.