Search results
1 – 10 of 15Kenning Arlitsch, Jonathan Wheeler, Minh Thi Ngoc Pham and Nikolaus Nova Parulian
This study demonstrates that aggregated data from the Repository Analytics and Metrics Portal (RAMP) have significant potential to analyze visibility and use of institutional…
Abstract
Purpose
This study demonstrates that aggregated data from the Repository Analytics and Metrics Portal (RAMP) have significant potential to analyze visibility and use of institutional repositories (IR) as well as potential factors affecting their use, including repository size, platform, content, device and global location. The RAMP dataset is unique and public.
Design/methodology/approach
The webometrics methodology was followed to aggregate and analyze use and performance data from 35 institutional repositories in seven countries that were registered with the RAMP for a five-month period in 2019. The RAMP aggregates Google Search Console (GSC) data to show IR items that surfaced in search results from all Google properties.
Findings
The analyses demonstrate large performance variances across IR as well as low overall use. The findings also show that device use affects search behavior, that different content types such as electronic thesis and dissertation (ETD) may affect use and that searches originating in the Global South show much higher use of mobile devices than in the Global North.
Research limitations/implications
The RAMP relies on GSC as its sole data source, resulting in somewhat conservative overall numbers. However, the data are also expected to be as robot free as can be hoped.
Originality/value
This may be the first analysis of aggregate use and performance data derived from a global set of IR, using an openly published dataset. RAMP data offer significant research potential with regard to quantifying and characterizing variances in the discoverability and use of IR content.
Peer review
The peer review history for this article is available at: https://publons.com/publon/10.1108/OIR-08-2020-0328
Details
Keywords
Patrick O’Brien, Scott W.H. Young, Kenning Arlitsch and Karl Benedict
The purpose of this paper is to examine the extent to which HTTPS encryption and Google Analytics services have been implemented on academic library websites, and discuss the…
Abstract
Purpose
The purpose of this paper is to examine the extent to which HTTPS encryption and Google Analytics services have been implemented on academic library websites, and discuss the privacy implications of free services that introduce web tracking of users.
Design/methodology/approach
The home pages of 279 academic libraries were analyzed for the presence of HTTPS, Google Analytics services and privacy-protection features.
Findings
Results indicate that HTTPS implementation on library websites is not widespread, and many libraries continue to offer non-secured connections without an automatically enforced redirect to a secure connection. Furthermore, a large majority of library websites included in the study have implemented Google Analytics and/or Google Tag Manager, yet only very few connect securely to Google via HTTPS or have implemented Google Analytics IP anonymization.
Practical implications
Librarians are encouraged to increase awareness of this issue and take concerted and coherent action across five interrelated areas: implementing secure web protocols (HTTPS), user education, privacy policies, informed consent and risk/benefit analyses.
Originality/value
Third-party tracking of users is prevalent across the web, and yet few studies demonstrate its extent and consequences for academic library websites.
Details
Keywords
Patrick OBrien, Kenning Arlitsch, Jeff Mixter, Jonathan Wheeler and Leila Belle Sterman
The purpose of this paper is to present data that begin to detail the deficiencies of log file analytics reporting methods that are commonly built into institutional repository…
Abstract
Purpose
The purpose of this paper is to present data that begin to detail the deficiencies of log file analytics reporting methods that are commonly built into institutional repository (IR) platforms. The authors propose a new method for collecting and reporting IR item download metrics. This paper introduces a web service prototype that captures activity that current analytics methods are likely to either miss or over-report.
Design/methodology/approach
Data were extracted from DSpace Solr logs of an IR and were cross-referenced with Google Analytics and Google Search Console data to directly compare Citable Content Downloads recorded by each method.
Findings
This study provides evidence that log file analytics data appear to grossly over-report due to traffic from robots that are difficult to identify and screen. The study also introduces a proof-of-concept prototype that makes the research method easily accessible to IR managers who seek accurate counts of Citable Content Downloads.
Research limitations/implications
The method described in this paper does not account for direct access to Citable Content Downloads that originate outside Google Search properties.
Originality/value
This paper proposes that IR managers adopt a new reporting framework that classifies IR page views and download activity into three categories that communicate metrics about user activity related to the research process. It also proposes that IR managers rely on a hybrid of existing Google Services to improve reporting of Citable Content Downloads and offers a prototype web service where IR managers can test results for their repositories.
Details
Keywords
ALEXANDRA DIMITROFF and KENNING ARLITSCH
The purpose of this study was to determine the rate of self‐citation in the library and information science literature. A sample of 1,058 articles was examined. 50% of the…
Abstract
The purpose of this study was to determine the rate of self‐citation in the library and information science literature. A sample of 1,058 articles was examined. 50% of the articles examined contained at least one self‐citation. Articles that were reports of research, that were written by a faculty member, that addressed a theoretical topic, or that had multiple authors were all more likely to have to higher self‐citation rates. The self‐citation rate of 50% was higher than that reported in studies of self‐citation rates in the sciences and social sciences. However, the percentage of self‐citations as related to total citations of 6.6% falls between the percentage reported in the sciences and that reported in other social sciences.
The purpose of this paper is to examine the disruptive implications of the Espresso Book Machine on library user services, collection development, and special collections.
Abstract
Purpose
The purpose of this paper is to examine the disruptive implications of the Espresso Book Machine on library user services, collection development, and special collections.
Design/methodology/approach
The approach takes into account various published articles and online statements to help articulate and support the arguments.
Findings
Library users can derive immediate benefit from a machine that prints books for them in only a few minutes. The EBM's impact on collection development in libraries may change a decades‐old model of speculative buying to one of buying on demand. The EBM can also help libraries bring high‐quality facsimiles of their unique special collections books to the public, and perhaps even generate a revenue stream that might offset costs.
Originality/value
This is an original article that draws on other published and unpublished research and opinion articles.
Details
Keywords
Kenning Arlitsch and Jeff Jonsson
The purpose of this paper is to describe the creation of the Mountain West Digital Library (MWDL), a cooperative regional program distributed throughout Utah and Nevada. Its…
Abstract
Purpose
The purpose of this paper is to describe the creation of the Mountain West Digital Library (MWDL), a cooperative regional program distributed throughout Utah and Nevada. Its metadata are aggregated at a single web site. Six digitization centers at the largest universities in both states digitize their own collections and support partner institutions in their geographic regions. Each center runs a CONTENTdm server, and an aggregating server at the University of Utah harvests metadata into a single searchable index. Local control and identity of collections are hallmarks of the MWDL.
Design/methodology/approach
This paper describes the technical structure of the MWDL, focusing on the multi‐site server aggregating software from DiMeMa Inc.
Findings
The MWDL was the first cooperative digital project to implement this software, and inspired the same infrastructure for a nine‐state project known as the Western Waters Digital Library. In 2005 the MWDL will also become the first in the nation to aggregate distributed digital newspaper collections.
Practical implications
Describes the creation and aggregation of a distributed regional digital library with digital asset management software that is already in use at over 200 institutions in the US. The solutions implemented in the MWDL offer a mechanism for large and small institutions to work together in a cost‐effective manner.
Originality/value
Examines the benefits and problems associated with creating a regional digital library.
Details
Keywords
Kenning Arlitsch and Patrick S. O'Brien
Google Scholar has difficulty indexing the contents of institutional repositories, and the authors hypothesize the reason is that most repositories use Dublin Core, which cannot…
Abstract
Purpose
Google Scholar has difficulty indexing the contents of institutional repositories, and the authors hypothesize the reason is that most repositories use Dublin Core, which cannot express bibliographic citation information adequately for academic papers. Google Scholar makes specific recommendations for repositories, including the use of publishing industry metadata schemas over Dublin Core. This paper aims to test a theory that transforming metadata schemas in institutional repositories will lead to increased indexing by Google Scholar.
Design/methodology/approach
The authors conducted two surveys of institutional and disciplinary repositories across the USA, using different methodologies. They also conducted three pilot projects that transformed the metadata of a subset of papers from USpace, the University of Utah's institutional repository, and examined the results of Google Scholar's explicit harvests.
Findings
Repositories that use GS recommended metadata schemas and express them in HTML meta tags experienced significantly higher indexing ratios. The ease with which search engine crawlers can navigate a repository also seems to affect indexing ratio. The second and third metadata transformation pilot projects at Utah were successful, ultimately achieving an indexing ratio of greater than 90 percent.
Research limitations/implications
The second survey is limited to 40 titles from each of seven repositories, for a total of 280 titles. A larger survey that covers more repositories may be useful.
Practical implications
Institutional repositories are achieving significant mass, and the rate of author citations from those repositories may affect university rankings. Lack of visibility in Google Scholar, however, will limit the ability of IRs to play a more significant role in those citation rates.
Social implications
Transforming metadata can be a difficult and tedious process. The Institute of Museum and Library Services has recently awarded a National Leadership Grant to the University of Utah to continue SEO research with its partner, OCLC Inc., and to develop a toolkit that will include automated transformation mechanisms.
Originality/value
Little or no research has been published about improving the indexing ratio of institutional repositories in Google Scholar. The authors believe that they are the first to address the possibility of transforming IR metadata to improve indexing ratios in Google Scholar.
Details
Keywords
Abstract
Details
Keywords
Abstract
Details