Search results

1 – 10 of 56
Article
Publication date: 2 May 2023

Giovanna Aracri, Antonietta Folino and Stefano Silvestri

The purpose of this paper is to propose a methodology for the enrichment and tailoring of a knowledge organization system (KOS), in order to support the information extraction…

Abstract

Purpose

The purpose of this paper is to propose a methodology for the enrichment and tailoring of a knowledge organization system (KOS), in order to support the information extraction (IE) task for the analysis of documents in the tourism domain. In particular, the KOS is used to develop a named entity recognition (NER) system.

Design/methodology/approach

A method to improve and customize an available thesaurus by leveraging documents related to the tourism in Italy is firstly presented. Then, the obtained thesaurus is used to create an annotated NER corpus, exploiting both distant supervision, deep learning and a light human supervision.

Findings

The study shows that a customized KOS can effectively support IE tasks when applied to documents belonging to the same domains and types used for its construction. Moreover, it is very useful to support and ease the annotation task using the proposed methodology, allowing to annotate a corpus with a fraction of the effort required for a manual annotation.

Originality/value

The paper explores an alternative use of a KOS, proposing an innovative NER corpus annotation methodology. Moreover, the KOS and the annotated NER data set will be made publicly available.

Details

Journal of Documentation, vol. 79 no. 6
Type: Research Article
ISSN: 0022-0418

Keywords

Article
Publication date: 13 October 2023

Judit Gárdos, Julia Egyed-Gergely, Anna Horváth, Balázs Pataki, Roza Vajda and András Micsik

The present study is about generating metadata to enhance thematic transparency and facilitate research on interview collections at the Research Documentation Centre, Centre for…

Abstract

Purpose

The present study is about generating metadata to enhance thematic transparency and facilitate research on interview collections at the Research Documentation Centre, Centre for Social Sciences (TK KDK) in Budapest. It explores the use of artificial intelligence (AI) in producing, managing and processing social science data and its potential to generate useful metadata to describe the contents of such archives on a large scale.

Design/methodology/approach

The authors combined manual and automated/semi-automated methods of metadata development and curation. The authors developed a suitable domain-oriented taxonomy to classify a large text corpus of semi-structured interviews. To this end, the authors adapted the European Language Social Science Thesaurus (ELSST) to produce a concise, hierarchical structure of topics relevant in social sciences. The authors identified and tested the most promising natural language processing (NLP) tools supporting the Hungarian language. The results of manual and machine coding will be presented in a user interface.

Findings

The study describes how an international social scientific taxonomy can be adapted to a specific local setting and tailored to be used by automated NLP tools. The authors show the potential and limitations of existing and new NLP methods for thematic assignment. The current possibilities of multi-label classification in social scientific metadata assignment are discussed, i.e. the problem of automated selection of relevant labels from a large pool.

Originality/value

Interview materials have not yet been used for building manually annotated training datasets for automated indexing of scientifically relevant topics in a data repository. Comparing various automated-indexing methods, this study shows a possible implementation of a researcher tool supporting custom visualizations and the faceted search of interview collections.

Article
Publication date: 27 July 2023

Navodana Rodrigo, Hossein Omrany, Ruidong Chang and Jian Zuo

This study aims to investigate the literature related to the use of digital technologies for promoting circular economy (CE) in the construction industry.

Abstract

Purpose

This study aims to investigate the literature related to the use of digital technologies for promoting circular economy (CE) in the construction industry.

Design/methodology/approach

A comprehensive approach was adopted, involving bibliometric analysis, text-mining analysis and content analysis to meet three objectives (1) to unveil the evolutionary progress of the field, (2) to identify the key research themes in the field and (3) to identify challenges hindering the implementation of digital technologies for CE.

Findings

A total of 365 publications was analysed. The results revealed eight key digital technologies categorised into two main clusters including “digitalisation and advanced technologies” and “sustainable construction technologies”. The former involved technologies, namely machine learning, artificial intelligence, deep learning, big data analytics and object detection and computer vision that were used for (1) forecasting construction and demolition (C&D) waste generation, (2) waste identification and classification and (3) computer vision for waste management. The latter included technologies such as Internet of Things (IoT), blockchain and building information modelling (BIM) that help optimise resource use, enhance transparency and sustainability practices in the industry. Overall, these technologies show great potential for improving waste management and enabling CE in construction.

Originality/value

This research employs a holistic approach to provide a status-quo understanding of the digital technologies that can be utilised to support the implementation of CE in construction. Further, this study underlines the key challenges associated with adopting digital technologies, whilst also offering opportunities for future improvement of the field.

Details

Smart and Sustainable Built Environment, vol. 13 no. 1
Type: Research Article
ISSN: 2046-6099

Keywords

Article
Publication date: 20 July 2023

Elaheh Hosseini, Kimiya Taghizadeh Milani and Mohammad Shaker Sabetnasab

This research aimed to visualize and analyze the co-word network and thematic clusters of the intellectual structure in the field of linked data during 1900–2021.

Abstract

Purpose

This research aimed to visualize and analyze the co-word network and thematic clusters of the intellectual structure in the field of linked data during 1900–2021.

Design/methodology/approach

This applied research employed a descriptive and analytical method, scientometric indicators, co-word techniques, and social network analysis. VOSviewer, SPSS, Python programming, and UCINet software were used for data analysis and network structure visualization.

Findings

The top ranks of the Web of Science (WOS) subject categorization belonged to various fields of computer science. Besides, the USA was the most prolific country. The keyword ontology had the highest frequency of co-occurrence. Ontology and semantic were the most frequent co-word pairs. In terms of the network structure, nine major topic clusters were identified based on co-occurrence, and 29 thematic clusters were identified based on hierarchical clustering. Comparisons between the two clustering techniques indicated that three clusters, namely semantic bioinformatics, knowledge representation, and semantic tools were in common. The most mature and mainstream thematic clusters were natural language processing techniques to boost modeling and visualization, context-aware knowledge discovery, probabilistic latent semantic analysis (PLSA), semantic tools, latent semantic indexing, web ontology language (OWL) syntax, and ontology-based deep learning.

Originality/value

This study adopted various techniques such as co-word analysis, social network analysis network structure visualization, and hierarchical clustering to represent a suitable, visual, methodical, and comprehensive perspective into linked data.

Details

Library Hi Tech, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0737-8831

Keywords

Article
Publication date: 13 November 2023

Ziyoung Park

This study aims to collect distributed knowledge organization systems (KOSs) from various domains, enrich each with meta information and link them to the multilingual KOS…

Abstract

Purpose

This study aims to collect distributed knowledge organization systems (KOSs) from various domains, enrich each with meta information and link them to the multilingual KOS registry, facilitating integrated search alongside KOSs from various languages and regions.

Design/methodology/approach

This research involved collecting and organizing KOS information through three primary steps. The initial phase involved finding KOSs from Web search results, supplemented by the Korea ON-line E-Procurement System (KONEPS) and the National R&D Integrated Notification Service. After obtaining these KOSs, they were enriched by structuring contextual meta information using Basic Register of Thesauri, Ontologies and Classification (BARTOC) metadata elements and established dedicated media wiki pages for each. Finally, the KOSs were linked to the multilingual KOS registry, BARTOC, ensuring seamless integration with KOSs from various languages and regions and creating connections between each registry entry and its associated KOS wiki page.

Findings

The research findings revealed several insights, as follows: (1) importance of a stable source for collecting KOS: no national body currently oversees KOS registration, underscoring the need for a systematic approach to collect dispersed KOSs. For Korean KOSs (K-KOSs), KONEPS and National R&D Integrated Notification Service are effective data sources. (2) Importance of enhanced metadata: merely collecting KOSs were not enough. Enhanced metadata bridges access gaps and dedicated wiki pages aid user identification and understanding. (3) Observations from multilingual registry uploads: When adding KOSs to a multilingual registry, similarities were observed across languages and regions. Recognizing this, the K-KOSs were linked with their international counterparts, fostering potential global collaboration.

Research limitations/implications

Due to the absence of a dedicated KOS registry agency, the study might have missed KOSs from certain fields or potentially over-collected from others. Furthermore, this study primarily focused on K-KOSs and their integration into the BARTOC registry, which might influence the methods and perspectives on collecting and establishing links among analogous KOSs in the registry.

Originality/value

This research pursued a stable method to detect KOS development and revisions across various fields. To facilitate this, we used the integrated e-procurement and R&D notification system and added meta information to aid in the identification and understanding of KOSs, which includes media wiki pages. Furthermore, link information was provided between the BARTOC registry and the Korean KOS websites and media wiki pages.

Details

The Electronic Library , vol. 41 no. 6
Type: Research Article
ISSN: 0264-0473

Keywords

Article
Publication date: 18 September 2023

Dongyuan Zhao, Zhongjun Tang and Fengxia Sun

This paper investigates the semantic association mechanisms of weak demand signals that facilitate innovative product development in terms of conceptual and temporal precedence…

Abstract

Purpose

This paper investigates the semantic association mechanisms of weak demand signals that facilitate innovative product development in terms of conceptual and temporal precedence, despite their inherent ambiguity and uncertainty.

Design/methodology/approach

To address this challenge, a domain ontology approach is proposed to construct a customer demand scenario-based framework that eliminates the blind spots in weak demand signal identification. The framework provides a basis for identifying such signals and introduces evaluation indices, such as depth, novelty and association, which are integrated to propose a three-dimensional weak signal recognition model based on domain ontology that outperforms existing research.

Findings

Empirical analysis is carried out based on customer comments of new energy vehicles on car platform such as “Auto Home” and “Bitauto”. Results demonstrate that in terms of recognition quantity, the three-dimensional weak demand signal recognition model, based on domain ontology, can accurately identify six demand weak signals. Conversely, the keyword analysis method exhibits a recognition quantity of four weak signals; in terms of recognition quality, the three-dimensional weak demand signal recognition model based on domain ontology can exclude non-demand signals such as “charging technology”, while keyword analysis methods cannot. Overall, the model proposed in this paper has higher sensitivity.

Originality/value

This paper proposes a novel method for identifying weak demand signals that considers the frequency of the signal's novelty, depth and relevance to the target demand. To verify its effectiveness, customer review data for new energy vehicles is used. The results provide a theoretical reference for formulating government policies and identifying weak demand signals for businesses.

Details

Kybernetes, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0368-492X

Keywords

Article
Publication date: 31 October 2023

Nadin Augustiniok, Claudine Houbart, Bie Plevoets and Koenraad Van Cleempoel

Adaptive reuse processes aim to preserve heritage values while creating new values through the architectural interventions that have become necessary. This claim provokes a…

Abstract

Purpose

Adaptive reuse processes aim to preserve heritage values while creating new values through the architectural interventions that have become necessary. This claim provokes a discussion about the meaning of values, how we can preserve them in practice and how we can translate them into architectural qualities that users experience. Riegl's understanding of the different perspectives of heritage values in the past and present opens up the possibility of identifying present values as a reflection of current social, material and political conditions in the architectural discourse.

Design/methodology/approach

This qualitative and practical study compares two Belgian projects to trace the use of values in adaptive reuse projects from an architectural design perspective. The Predikherenklooster, a 17th-century monastery in Mechelen that now houses the public library, and the C-Mine cultural centre in Genk, a former 20th-century coal mine, are compared. The starting point is Flemish legislation, which defines significance through values, distinguishing between 13 heritage values.

Findings

The study demonstrates the opportunities that axiological questions offer during the design process of an adaptive reuse project. They provide an overarching framework for tangible and intangible aspects that need to be discussed, particularly in terms of the link between what exists, the design strategy and their effect.

Originality/value

Adaptive reuse can draw on approaches from both heritage conservation and contemporary architecture and explore values as a tool for “re-designing” built heritage.

Details

Journal of Cultural Heritage Management and Sustainable Development, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2044-1266

Keywords

Open Access
Article
Publication date: 5 April 2024

Miquel Centelles and Núria Ferran-Ferrer

Develop a comprehensive framework for assessing the knowledge organization systems (KOSs), including the taxonomy of Wikipedia and the ontologies of Wikidata, with a specific…

Abstract

Purpose

Develop a comprehensive framework for assessing the knowledge organization systems (KOSs), including the taxonomy of Wikipedia and the ontologies of Wikidata, with a specific focus on enhancing management and retrieval with a gender nonbinary perspective.

Design/methodology/approach

This study employs heuristic and inspection methods to assess Wikipedia’s KOS, ensuring compliance with international standards. It evaluates the efficiency of retrieving non-masculine gender-related articles using the Catalan Wikipedian category scheme, identifying limitations. Additionally, a novel assessment of Wikidata ontologies examines their structure and coverage of gender-related properties, comparing them to Wikipedia’s taxonomy for advantages and enhancements.

Findings

This study evaluates Wikipedia’s taxonomy and Wikidata’s ontologies, establishing evaluation criteria for gender-based categorization and exploring their structural effectiveness. The evaluation process suggests that Wikidata ontologies may offer a viable solution to address Wikipedia’s categorization challenges.

Originality/value

The assessment of Wikipedia categories (taxonomy) based on KOS standards leads to the conclusion that there is ample room for improvement, not only in matters concerning gender identity but also in the overall KOS to enhance search and retrieval for users. These findings bear relevance for the design of tools to support information retrieval on knowledge-rich websites, as they assist users in exploring topics and concepts.

Article
Publication date: 28 December 2023

Na Xu, Yanxiang Liang, Chaoran Guo, Bo Meng, Xueqing Zhou, Yuting Hu and Bo Zhang

Safety management plays an important part in coal mine construction. Due to complex data, the implementation of the construction safety knowledge scattered in standards poses a…

Abstract

Purpose

Safety management plays an important part in coal mine construction. Due to complex data, the implementation of the construction safety knowledge scattered in standards poses a challenge. This paper aims to develop a knowledge extraction model to automatically and efficiently extract domain knowledge from unstructured texts.

Design/methodology/approach

Bidirectional encoder representations from transformers (BERT)-bidirectional long short-term memory (BiLSTM)-conditional random field (CRF) method based on a pre-training language model was applied to carry out knowledge entity recognition in the field of coal mine construction safety in this paper. Firstly, 80 safety standards for coal mine construction were collected, sorted out and marked as a descriptive corpus. Then, the BERT pre-training language model was used to obtain dynamic word vectors. Finally, the BiLSTM-CRF model concluded the entity’s optimal tag sequence.

Findings

Accordingly, 11,933 entities and 2,051 relationships in the standard specifications texts of this paper were identified and a language model suitable for coal mine construction safety management was proposed. The experiments showed that F1 values were all above 60% in nine types of entities such as security management. F1 value of this model was more than 60% for entity extraction. The model identified and extracted entities more accurately than conventional methods.

Originality/value

This work completed the domain knowledge query and built a Q&A platform via entities and relationships identified by the standard specifications suitable for coal mines. This paper proposed a systematic framework for texts in coal mine construction safety to improve efficiency and accuracy of domain-specific entity extraction. In addition, the pretraining language model was also introduced into the coal mine construction safety to realize dynamic entity recognition, which provides technical support and theoretical reference for the optimization of safety management platforms.

Details

Engineering, Construction and Architectural Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0969-9988

Keywords

Article
Publication date: 18 October 2023

L.P. Coladangelo

Through examination of the Library Reference Model (LRM) specifications for nomen and the potential challenges visual nomen might present for their description and use in…

Abstract

Purpose

Through examination of the Library Reference Model (LRM) specifications for nomen and the potential challenges visual nomen might present for their description and use in information systems, the purpose of this study was to investigate two questions: (1) how do nonlinguistic or nonalphanumeric signs or symbols act as nomen to identify entities? and (2) what details or attributes are relevant to describe and classify such nomen to integrate them into information systems?

Design/methodology/approach

This research was built on an exploratory, qualitative instrumental case study design using multiple (or comparative) cases. Using the International Federation of Library Associations and Institutions LRM conceptualization of nomen as the basis, this research explored the similarities and differences between the LRM definition, its attributes and the use of nonlinguistic and nonalphanumeric “strings” for visual nomen to represent a res, moving iteratively between the LRM documentation, visual nomen identified in previous research and additional examples. This study used a constant comparative method to conduct a structured, focused comparison across different cases found in the source survey.

Findings

A close review of the history of the development of the nomen entity was made to understand the semiotic relationship between entities and their symbolic representation, how those symbols are then reified to be further classified and described and how such definitions in the LRM offer a path forward for better understanding the role and function of visual nomen. Based on the foundation of the nomen entity and its attributes established in the LRM, this research then looked at visual representations of concepts and entities to suggest a nascent framework for describing aspects of visual nomen which may be relevant to their use and application

Originality/value

This exploratory study of the use of supralinguistic ways of referencing entities delineates novel insights into a potential framework for describing and using visual nomen as a way of labeling or naming entities represented in information systems. By examining the specifications of the nomen entity and its attributes as delineated by the LRM, this study reinforces the applicability of LRM-defined attributes in the use of visual nomen in addition to offering other attributes or dimensions.

Details

The Electronic Library , vol. 41 no. 6
Type: Research Article
ISSN: 0264-0473

Keywords

1 – 10 of 56