This study furthers metadata quality research by providing complementary network-based metrics and insights to analyze metadata records and identify areas for improvement.
Metadata record graphs apply network analysis to metadata field values; this study evaluates the interconnectedness of subjects within each Hub aggregated into the Digital Public Library of America. It also reviews the effects of NACO normalization – simulating revision of values for consistency – and breaking up pre-coordinated subject headings – to simulate applying the Faceted Application of Subject Terminology to Library of Congress Subject Headings.
Network statistics complement count- or value-based metrics by providing context related to the number of records a user might actually find starting from one item and moving to others via shared subject values. Additionally, connectivity increases through the normalization of values to correct or adjust for formatting differences or by breaking pre-coordinated subject strings into separate topics.
This analysis focuses on exact-string matches, which is the lowest-common denominator for searching, although many search engines and digital library indexes may use less stringent matching methods. In terms of practical implications for evaluating or improving subjects in metadata, the normalization components demonstrate where resources may be most effectively allocated for these activities (depending on a collection).
Although the individual components of this research are not particularly novel, network analysis has not generally been applied to metadata analysis. This research furthers previous studies related to metadata quality analysis of aggregations and digital collections in general.
Phillips, M.E. and Tarver, H. (2021), "Investigating the use of metadata record graphs to analyze subject headings in the digital public library of America", The Electronic Library, Vol. 39 No. 3, pp. 450-468. https://doi.org/10.1108/EL-11-2020-0317
Emerald Publishing Limited
Copyright © 2021, Emerald Publishing Limited