Search results

1 – 10 of over 6000
Article
Publication date: 24 January 2023

Li Si, Li Liu and Yi He

This paper aims to understand the current development situation of scientific data management policy in China, analyze the content structure of the policy and provide a…

Abstract

Purpose

This paper aims to understand the current development situation of scientific data management policy in China, analyze the content structure of the policy and provide a theoretical basis for the improvement and optimization of the policy system.

Design/methodology/approach

China's scientific data management policies were obtained through various channels such as searching government websites and policy and legal database, and 209 policies were finally identified as the sample for analysis after being screened and integrated. A three-dimensional framework was constructed based on the perspective of policy tools, combining stakeholder and lifecycle theories. And the content of policy texts was coded and quantitatively analyzed according to this framework.

Findings

China's scientific data management policies can be divided into four stages according to the time sequence: infancy, preliminary exploration, comprehensive promotion and key implementation. The policies use a combination of three types of policy tools: supply-side, environmental-side and demand-side, involving multiple stakeholders and covering all stages of the lifecycle. But policy tools and their application to stakeholders and lifecycle stages are imbalanced. The development of future scientific data management policy should strengthen the balance of policy tools, promote the participation of multiple subjects and focus on the supervision of the whole lifecycle.

Originality/value

This paper constructs a three-dimensional analytical framework and uses content analysis to quantitatively analyze scientific data management policy texts, extending the research perspective and research content in the field of scientific data management. The study identifies policy focuses and proposes several strategies that will help optimize the scientific data management policy.

Details

Aslib Journal of Information Management, vol. 76 no. 2
Type: Research Article
ISSN: 2050-3806

Keywords

Open Access
Article
Publication date: 14 July 2022

Chunlai Yan, Hongxia Li, Ruihui Pu, Jirawan Deeprasert and Nuttapong Jotikasthira

This study aims to provide a systematic and complete knowledge map for use by researchers working in the field of research data. Additionally, the aim is to help them quickly…

1703

Abstract

Purpose

This study aims to provide a systematic and complete knowledge map for use by researchers working in the field of research data. Additionally, the aim is to help them quickly understand the authors' collaboration characteristics, institutional collaboration characteristics, trending research topics, evolutionary trends and research frontiers of scholars from the perspective of library informatics.

Design/methodology/approach

The authors adopt the bibliometric method, and with the help of bibliometric analysis software CiteSpace and VOSviewer, quantitatively analyze the retrieved literature data. The analysis results are presented in the form of tables and visualization maps in this paper.

Findings

The research results from this study show that collaboration between scholars and institutions is weak. It also identified the current hotspots in the field of research data, these being: data literacy education, research data sharing, data integration management and joint library cataloguing and data research support services, among others. The important dimensions to consider for future research are the library's participation in a trans-organizational and trans-stage integration of research data, functional improvement of a research data sharing platform, practice of data literacy education methods and models, and improvement of research data service quality.

Originality/value

Previous literature reviews on research data are qualitative studies, while few are quantitative studies. Therefore, this paper uses quantitative research methods, such as bibliometrics, data mining and knowledge map, to reveal the research progress and trend systematically and intuitively on the research data topic based on published literature, and to provide a reference for the further study of this topic in the future.

Details

Library Hi Tech, vol. 42 no. 1
Type: Research Article
ISSN: 0737-8831

Keywords

Open Access
Article
Publication date: 8 February 2023

Edoardo Ramalli and Barbara Pernici

Experiments are the backbone of the development process of data-driven predictive models for scientific applications. The quality of the experiments directly impacts the model…

Abstract

Purpose

Experiments are the backbone of the development process of data-driven predictive models for scientific applications. The quality of the experiments directly impacts the model performance. Uncertainty inherently affects experiment measurements and is often missing in the available data sets due to its estimation cost. For similar reasons, experiments are very few compared to other data sources. Discarding experiments based on the missing uncertainty values would preclude the development of predictive models. Data profiling techniques are fundamental to assess data quality, but some data quality dimensions are challenging to evaluate without knowing the uncertainty. In this context, this paper aims to predict the missing uncertainty of the experiments.

Design/methodology/approach

This work presents a methodology to forecast the experiments’ missing uncertainty, given a data set and its ontological description. The approach is based on knowledge graph embeddings and leverages the task of link prediction over a knowledge graph representation of the experiments database. The validity of the methodology is first tested in multiple conditions using synthetic data and then applied to a large data set of experiments in the chemical kinetic domain as a case study.

Findings

The analysis results of different test case scenarios suggest that knowledge graph embedding can be used to predict the missing uncertainty of the experiments when there is a hidden relationship between the experiment metadata and the uncertainty values. The link prediction task is also resilient to random noise in the relationship. The knowledge graph embedding outperforms the baseline results if the uncertainty depends upon multiple metadata.

Originality/value

The employment of knowledge graph embedding to predict the missing experimental uncertainty is a novel alternative to the current and more costly techniques in the literature. Such contribution permits a better data quality profiling of scientific repositories and improves the development process of data-driven models based on scientific experiments.

Article
Publication date: 20 November 2023

Laksmi Laksmi, Muhammad Fadly Suhendra, Shamila Mohamed Shuhidan and Umanto Umanto

This study aims to identify the readiness of institutional repositories in Indonesia to implement digital humanities (DH) data curation. Data curation is a method of managing…

Abstract

Purpose

This study aims to identify the readiness of institutional repositories in Indonesia to implement digital humanities (DH) data curation. Data curation is a method of managing research data that maintains the data’s accuracy and makes it available for reuse. It requires controlled data management.

Design/methodology/approach

The study uses a qualitative approach. Data collection was carried out through a focus group discussion in September–October 2022, interviews and document analysis. The informants came from four institutions in Indonesia.

Findings

The findings reveal that the national research repository has implemented data curation, albeit not optimally. Within the case study, one of the university repositories diligently curates its humanities data and has established networks extending to various ASEAN countries. Both the national archive repository and the other university repository have implemented rudimentary data curation practices but have not prioritized them. In conclusion, the readiness of the national research repository and the university repository stand at the high-capacity stage, while the national archive repository and the other university repository are at the established and early stages of data curation, respectively.

Research limitations/implications

This study examined only four repositories due to time constraints. Nonetheless, the four institutions were able to provide a comprehensive picture of their readiness for DH data curation management.

Practical implications

This study provides insight into strategies for developing DH data curation activities in institutional repositories. It also highlights the need for professional development for curators so they can devise and implement stronger ownership policies and data privacy to support a data-driven research agenda.

Originality/value

This study describes the preparations that must be considered by institutional repositories in the development of DH data curation activities.

Article
Publication date: 19 April 2023

Aasif Mohammad Khan, Fayaz Ahmad Loan, Umer Yousuf Parray and Sozia Rashid

Data sharing is increasingly being recognized as an essential component of scholarly research and publishing. Sharing data improves results and propels research and discovery…

Abstract

Purpose

Data sharing is increasingly being recognized as an essential component of scholarly research and publishing. Sharing data improves results and propels research and discovery forward. Given the importance of data sharing, the purpose of the study is to unveil the present scenario of research data repositories (RDR) and sheds light on strategies and tactics followed by different countries for efficient organization and optimal use of scientific literature.

Design/methodology/approach

The data for the study is collected from registry of RDR (re3data registry) (re3data.org), which covers RDR from different academic disciplines and provides filtration options “Search” and “Browse” to access the repositories. Using these filtration options, the researchers collected metadata of repositories i.e. country wise contribution, content-type data, repository language interface, software usage, metadata standards and data access type. Furthermore, the data was exported to Google Sheets for analysis and visualization.

Findings

The re3data registry holds a rich and diverse collection of data repositories from the majority of countries all over the world. It is revealed that English is the dominant language, and the most widely used software for the creation of data repositories are “DataVerse”, followed by “Dspace” and “MySQL”. The most frequently used metadata standards are “Dublin Core” and “Datacite metadata schema”. The majority of repositories are open, with more than half of the repositories being “disciplinary” in nature, and the most significant data sources include “scientific and statistical data” followed by “standard office documents”.

Research limitations/implications

The main limitation of the study is that the findings are based on the data collected through a single registry of repositories, and only a few characteristic features were investigated.

Originality/value

The study will benefit all countries with a small number of data repositories or no repositories at all, with tools and techniques used by the top repositories to ensure long-term storage and accessibility to research data. In addition to this, the study provides a global overview of RDR and its characteristic features.

Details

Information Discovery and Delivery, vol. 52 no. 1
Type: Research Article
ISSN: 2398-6247

Keywords

Article
Publication date: 10 April 2024

Zhongyi Wang, Xueyao Qiao, Jing Chen, Lina Li, Haoxuan Zhang, Junhua Ding and Haihua Chen

This study aims to establish a reliable index to identify interdisciplinary breakthrough innovation effectively. We constructed a new index, the DDiv index, for this purpose.

Abstract

Purpose

This study aims to establish a reliable index to identify interdisciplinary breakthrough innovation effectively. We constructed a new index, the DDiv index, for this purpose.

Design/methodology/approach

The DDiv index incorporates the degree of interdisciplinarity in the breakthrough index. To validate the index, a data set combining the publication records and citations of Nobel Prize laureates was divided into experimental and control groups. The validation methods included sensitivity analysis, correlation analysis and effectiveness analysis.

Findings

The sensitivity analysis demonstrated the DDiv index’s ability to differentiate interdisciplinary breakthrough papers from various categories of papers. This index not only retains the strengths of the existing index in identifying breakthrough innovation but also captures interdisciplinary characteristics. The correlation analysis revealed a significant correlation (correlation coefficient = 0.555) between the interdisciplinary attributes of scientific research and the occurrence of breakthrough innovation. The effectiveness analysis showed that the DDiv index reached the highest prediction accuracy of 0.8. Furthermore, the DDiv index outperforms the traditional DI index in terms of accuracy when it comes to identifying interdisciplinary breakthrough innovation.

Originality/value

This study proposed a practical and effective index that combines interdisciplinary and disruptive dimensions for detecting interdisciplinary breakthrough innovation. The identification and measurement of interdisciplinary breakthrough innovation play a crucial role in facilitating the integration of multidisciplinary knowledge, thereby accelerating the scientific breakthrough process.

Details

The Electronic Library , vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0264-0473

Keywords

Article
Publication date: 15 March 2024

Beatrice Arthur and Thomas van der Walt

The purpose of this study is to investigate the current research data management practices among researchers in Ghana and their impact on data reuse and collaborative research…

Abstract

Purpose

The purpose of this study is to investigate the current research data management practices among researchers in Ghana and their impact on data reuse and collaborative research. The study aims to identify the methods used by researchers to store and preserve their research data, as well as to determine the extent to which researchers share their data with others.

Design/methodology/approach

The study uses a mixed-method research strategy to blend qualitative and quantitative data and is conducted at two public and two private universities in Ghana.

Findings

The study revealed that researchers in Ghana currently store and preserve their research data using personal devices, such as laptops, CDs and external flash drives, rather than keeping the data in university data repositories. They also do not share their research data with others, which negatively affects collaborative research. The current practice of storing data on personal devices and not sharing data with others hinders collaborative research. The study recommends that universities in Ghana revise their research policy documents to address RDM-related issues such as data storage, data preservation, data sharing and data reuse.

Research limitations/implications

The study was conducted at two public and two private universities in Ghana, but the findings were placed in a wider context through appropriate references.

Practical implications

This study emphasises the need for sound research data management procedures to support research collaboration and data reuse in Ghana. Universities should provide incentives to academics to disclose their data to encourage data sharing and collaboration.

Social implications

The government and management of universities should consciously invest in the needed technologies and equipment to implement research data management in their universities.

Originality/value

This study looks at how researchers in Ghana manage their research data and how it affects data reuse and collaborative research.

Details

Library Management, vol. 45 no. 3/4
Type: Research Article
ISSN: 0143-5124

Keywords

Open Access
Article
Publication date: 31 October 2023

Neema Florence Mosha and Patrick Ngulube

The study aims to investigate the utilisation of open research data repositories (RDRs) for storing and sharing research data in higher learning institutions (HLIs) in Tanzania.

Abstract

Purpose

The study aims to investigate the utilisation of open research data repositories (RDRs) for storing and sharing research data in higher learning institutions (HLIs) in Tanzania.

Design/methodology/approach

A survey research design was employed to collect data from postgraduate students at the Nelson Mandela African Institution of Science and Technology (NM-AIST) in Arusha, Tanzania. The data were collected and analysed quantitatively and qualitatively. A census sampling technique was employed to select the sample size for this study. The quantitative data were analysed using the Statistical Package for the Social Sciences (SPSS), whilst the qualitative data were analysed thematically.

Findings

Less than half of the respondents were aware of and were using open RDRs, including Zenodo, DataVerse, Dryad, OMERO, GitHub and Mendeley data repositories. More than half of the respondents were not willing to share research data and cited a lack of ownership after storing their research data in most of the open RDRs and data security. HILs need to conduct training on using trusted repositories and motivate postgraduate students to utilise open repositories (ORs). The challenges for underutilisation of open RDRs were a lack of policies governing the storage and sharing of research data and grant constraints.

Originality/value

Research data storage and sharing are of great interest to researchers in HILs to inform them to implement open RDRs to support these researchers. Open RDRs increase visibility within HILs and reduce research data loss, and research works will be cited and used publicly. This paper identifies the potential for additional studies focussed on this area.

Article
Publication date: 4 May 2022

Muhammad Inaam ul haq, Qianmu Li and Jun Hou

Special education is the education segment that deals with the students facing hurdles in the traditional education system. Research data have evolved in the domain of special…

Abstract

Purpose

Special education is the education segment that deals with the students facing hurdles in the traditional education system. Research data have evolved in the domain of special education due to scientific advances. The present study aims to employ text mining to extract the latent patterns from the scientific data.

Design/methodology/approach

This study examined the 12,781 Scopus-indexed titles, abstracts and keywords published from 1987 to 2021 through an integrated text-mining and topic modeling approach. It combines dynamic topic models with highly cited reviews of this domain. It facilitates the extraction of topic clusters and communities in the topic network.

Findings

This methodology discovered children’s communication and speech using gaming techniques, mental retardation, cost effect on infant birth, involvement of special education children and their families, assistive technology information for special education, syndrome epilepsy and the impact of group study on skill development peers or self as the hottest topic of research in this domain. In addition to finding research hotspots, it further explores annual topic proportion trends, topic correlations and intertopic research areas.

Originality/value

The results provide a comprehensive summary of the popularity of research topics in special education in the past 34 years, and the results can provide useful insights and implications, and it could be used as a guide for contributors in special education form a structured view of past research and plan future research directions.

Details

Library Hi Tech, vol. 41 no. 6
Type: Research Article
ISSN: 0737-8831

Keywords

Article
Publication date: 6 June 2023

Archana S.N. and Padmakumar P.K.

The purpose of this study was to understand the landscape of Indian research data repositories (RDRs) indexed in the re3data.org. The study analysed the metadata elements of…

Abstract

Purpose

The purpose of this study was to understand the landscape of Indian research data repositories (RDRs) indexed in the re3data.org. The study analysed the metadata elements of Indian RDRs to identify their disciplinary orientations, typology, standards adopted, foreign collaborations, etc. The study ascertained the current status of the Indian RDRs by visiting their respective websites and tried to identify and map the exact disciplinary orientation of each RDR.

Design/methodology/approach

The study used “content analysis” of the metadata elements extracted from re3data.org along with the information analysis of the respective websites of the registered RDRs.

Findings

The study identified that only 80% of the Indian RDRs listed by the re3data.org is currently active. Most of the Indian RDRs are hosted by the central and state governments and are almost equally distributed among Life Sciences, Natural Sciences and Social Sciences domains. The data provided by the re3data.org for the Indian RDRs are not complete and up-to-date.

Practical implications

The findings indicate the presence of a good number of inactive RDRs in the re3data.org. The study suggests using a revised version of the DFG subject classification scheme or considering a standard classification scheme for subject indexing.

Originality/value

To the best of the authors’ knowledge, this study is the first of its kind that critically analysed the metadata values extracted and moved further to identify the current status of Indian RDRs.

Details

Digital Library Perspectives, vol. 39 no. 4
Type: Research Article
ISSN: 2059-5816

Keywords

1 – 10 of over 6000