To read this content please select one of the options below:

The stability of symbol sets produced by variety generation from bibliographic data

E. Verity Brack (Postgraduate School of Librarianship and Information Science, University of Sheffield, Western Bank, Sheffield, S10 2TN)
David Cooper (Postgraduate School of Librarianship and Information Science, University of Sheffield, Western Bank, Sheffield, S10 2TN)
Michael F. Lynch (Postgraduate School of Librarianship and Information Science, University of Sheffield, Western Bank, Sheffield, S10 2TN)

Program: electronic library and information systems

ISSN: 0033-0337

Article publication date: 1 February 1978

35

Abstract

Variety Generation involves the selection of sets of character strings, or symbols, which are intended to occur with equal probabilities in bodies of text or sets of text units from a particular source. It is important that the sample used to generate the symbol set should be representative of the data with which the set will be used. An assessment is given here of the amount of variation in symbol sets generated from files of titles and author names from BNB MARC data over a five year period, and a comparison is made with LC MARC. Some of the BNB symbol sets are compared directly, and equifrequency statistics are obtained for the assignment of each symbol set to each file. The differences between the equifrequency statistics are examined by means of an analysis of variance technique.

Citation

Verity Brack, E., Cooper, D. and Lynch, M.F. (1978), "The stability of symbol sets produced by variety generation from bibliographic data", Program: electronic library and information systems, Vol. 12 No. 2, pp. 64-77. https://doi.org/10.1108/eb046772

Publisher

:

MCB UP Ltd

Copyright © 1978, MCB UP Limited

Related articles