Search results

1 – 10 of over 1000
Open Access
Article
Publication date: 23 January 2024

Luís Jacques de Sousa, João Poças Martins, Luís Sanhudo and João Santos Baptista

This study aims to review recent advances towards the implementation of ANN and NLP applications during the budgeting phase of the construction process. During this phase…

Abstract

Purpose

This study aims to review recent advances towards the implementation of ANN and NLP applications during the budgeting phase of the construction process. During this phase, construction companies must assess the scope of each task and map the client’s expectations to an internal database of tasks, resources and costs. Quantity surveyors carry out this assessment manually with little to no computer aid, within very austere time constraints, even though these results determine the company’s bid quality and are contractually binding.

Design/methodology/approach

This paper seeks to compile applications of machine learning (ML) and natural language processing in the architectural engineering and construction sector to find which methodologies can assist this assessment. The paper carries out a systematic literature review, following the preferred reporting items for systematic reviews and meta-analyses guidelines, to survey the main scientific contributions within the topic of text classification (TC) for budgeting in construction.

Findings

This work concludes that it is necessary to develop data sets that represent the variety of tasks in construction, achieve higher accuracy algorithms, widen the scope of their application and reduce the need for expert validation of the results. Although full automation is not within reach in the short term, TC algorithms can provide helpful support tools.

Originality/value

Given the increasing interest in ML for construction and recent developments, the findings disclosed in this paper contribute to the body of knowledge, provide a more automated perspective on budgeting in construction and break ground for further implementation of text-based ML in budgeting for construction.

Details

Construction Innovation , vol. 24 no. 7
Type: Research Article
ISSN: 1471-4175

Keywords

Article
Publication date: 19 December 2022

Farshid Danesh and Somayeh Ghavidel

The purpose of this study was a longitudinal study on knowledge organization (KO) realm structure and cluster concepts and emerging KO events based on co-occurrence analysis.

178

Abstract

Purpose

The purpose of this study was a longitudinal study on knowledge organization (KO) realm structure and cluster concepts and emerging KO events based on co-occurrence analysis.

Design/methodology/approach

This longitudinal study uses the co-occurrence analysis. This research population includes keywords of articles indexed in the Web of Science Core Collection 1975–1999 and 2000–2018. Hierarchical clustering, multidimensional scaling and co-occurrence analysis were used to conduct the present research. SPSS, UCINET, VOSviewer and NetDraw were used to analyze and visualize data.

Findings

The “Information Technology” in 1975–1999 and the “Information Literacy” in 2000–2018, with the highest frequency, were identified as the most widely used keywords of KO in the world. In the first period, the cluster “Knowledge Management” had the highest centrality, the cluster “Strategic Planning” had the highest density in 2000–2018 and the cluster “Information Retrieval” had the highest centrality and density. The two-dimensional map of KO’s thematic and clustering of KO topics by cluster analysis method indicates that in the periods examined in this study, thematic clusters had much overlap in terms of concept and content.

Originality/value

The present article uses a longitudinal study to examine the KO’s publications in the past half-century. This paper also uses hierarchical clustering and multidimensional scaling methods. Studying the concepts and thematic trends in KO can impact organizing information as the core of libraries, museums and archives. Also, it can scheme information organizing and promote knowledge management. Because the results obtained from this article can help KO policymakers determine and design the roadmap, research planning, and micro and macro budgeting processes.

Details

Global Knowledge, Memory and Communication, vol. 73 no. 6/7
Type: Research Article
ISSN: 2514-9342

Keywords

Article
Publication date: 31 January 2023

Mrinalini Luthra, Konstantin Todorov, Charles Jeurgens and Giovanni Colavizza

This paper aims to expand the scope and mitigate the biases of extant archival indexes.

Abstract

Purpose

This paper aims to expand the scope and mitigate the biases of extant archival indexes.

Design/methodology/approach

The authors use automatic entity recognition on the archives of the Dutch East India Company to extract mentions of underrepresented people.

Findings

The authors release an annotated corpus and baselines for a shared task and show that the proposed goal is feasible.

Originality/value

Colonial archives are increasingly a focus of attention for historians and the public, broadening access to them is a pressing need for archives.

Article
Publication date: 30 August 2024

Joseph Yaw Dawson and Ebenezer Agbozo

The purpose of this study is to provide an overview of artificial intelligence (AI) in the talent management sphere. The study seeks to contribute to the body of knowledge with…

Abstract

Purpose

The purpose of this study is to provide an overview of artificial intelligence (AI) in the talent management sphere. The study seeks to contribute to the body of knowledge with respect to human resource management and AI by conducting a literature review on the integration of AI in talent management, synthesising existing approaches and frameworks, as well as emphasising potential benefits.

Design/methodology/approach

The study adopts desk research, computational literature review (CLR) and uses topic modelling [with bidirectional encoder representations from transformers (BERTopic)] to throw light on the diffusion of AI in talent management.

Findings

The study’s main finding is that the area of AI in talent management is on the verge of gradual development and is in tandem with the growth of AI. We deduced that there is a link between talent management practices (planning, recruitment, compensation and rewards, performance management, employee empowerment, employee engagement and organisational culture) and AI. Though there are some known fears with regards to using the innovation, the benefits outweigh the demerits.

Research limitations/implications

The current study has some limitations. The scope and size of the sample are the primary limitations of this study. No form of qualitative analytics was used in this study; as a result, the information obtained was limited. The study provides a snapshot of AI in talent management and contributes to the lack of literature in the joint fields. Also, the study provides practitioners and experts an overview of where to target investments and resources if need be.

Originality/value

The originality of this study comes from the combination of CLR methods and the use topic modelling with BERTopic which has not been used by previous reviews. In addition, the salient machine learning algorithms are identified in the study, which other studies have not identified.

Details

Journal of Science and Technology Policy Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2053-4620

Keywords

Article
Publication date: 5 June 2024

Azanzi Jiomekong and Sanju Tiwari

This paper aims to curate open research knowledge graph (ORKG) with papers related to ontology learning and define an approach using ORKG as a computer-assisted tool to organize…

Abstract

Purpose

This paper aims to curate open research knowledge graph (ORKG) with papers related to ontology learning and define an approach using ORKG as a computer-assisted tool to organize key-insights extracted from research papers.

Design/methodology/approach

Action research was used to explore, test and evaluate the use of the Open Research Knowledge Graph as a computer assistant tool for knowledge acquisition from scientific papers.

Findings

To extract, structure and describe research contributions, the granularity of information should be decided; to facilitate the comparison of scientific papers, one should design a common template that will be used to describe the state of the art of a domain.

Originality/value

This approach is currently used to document “food information engineering,” “tabular data to knowledge graph matching” and “question answering” research problems and the “neurosymbolic AI” domain. More than 200 papers are ingested in ORKG. From these papers, more than 800 contributions are documented and these contributions are used to build over 100 comparison tables. At the end of this work, we found that ORKG is a valuable tool that can reduce the working curve of state-of-the-art research.

Article
Publication date: 11 July 2024

Chunxiu Qin, Yulong Wang, XuBu Ma, Yaxi Liu and Jin Zhang

To address the shortcomings of existing academic user information needs identification methods, such as low efficiency and high subjectivity, this study aims to propose an…

Abstract

Purpose

To address the shortcomings of existing academic user information needs identification methods, such as low efficiency and high subjectivity, this study aims to propose an automated method of identifying online academic user information needs.

Design/methodology/approach

This study’s method consists of two main parts: the first is the automatic classification of academic user information needs based on the bidirectional encoder representations from transformers (BERT) model. The second is the key content extraction of academic user information needs based on the improved MDERank key phrase extraction (KPE) algorithm. Finally, the applicability and effectiveness of the method are verified by an example of identifying the information needs of academic users in the field of materials science.

Findings

Experimental results show that the BERT-based information needs classification model achieved the highest weighted average F1 score of 91.61%. The improved MDERank KPE algorithm achieves the highest F1 score of 61%. The empirical analysis results reveal that the information needs of the categories “methods,” “experimental phenomena” and “experimental materials” are relatively high in the materials science field.

Originality/value

This study provides a solution for automated identification of academic user information needs. It helps online academic resource platforms to better understand their users’ information needs, which in turn facilitates the platform’s academic resource organization and services.

Details

The Electronic Library , vol. 42 no. 5
Type: Research Article
ISSN: 0264-0473

Keywords

Article
Publication date: 27 February 2023

Dilawar Ali, Kenzo Milleville, Steven Verstockt, Nico Van de Weghe, Sally Chambers and Julie M. Birkholz

Historical newspaper collections provide a wealth of information about the past. Although the digitization of these collections significantly improves their accessibility, a large…

Abstract

Purpose

Historical newspaper collections provide a wealth of information about the past. Although the digitization of these collections significantly improves their accessibility, a large portion of digitized historical newspaper collections, such as those of KBR, the Royal Library of Belgium, are not yet searchable at article-level. However, recent developments in AI-based research methods, such as document layout analysis, have the potential for further enriching the metadata to improve the searchability of these historical newspaper collections. This paper aims to discuss the aforementioned issue.

Design/methodology/approach

In this paper, the authors explore how existing computer vision and machine learning approaches can be used to improve access to digitized historical newspapers. To do this, the authors propose a workflow, using computer vision and machine learning approaches to (1) provide article-level access to digitized historical newspaper collections using document layout analysis, (2) extract specific types of articles (e.g. feuilletons – literary supplements from Le Peuple from 1938), (3) conduct image similarity analysis using (un)supervised classification methods and (4) perform named entity recognition (NER) to link the extracted information to open data.

Findings

The results show that the proposed workflow improves the accessibility and searchability of digitized historical newspapers, and also contributes to the building of corpora for digital humanities research. The AI-based methods enable automatic extraction of feuilletons, clustering of similar images and dynamic linking of related articles.

Originality/value

The proposed workflow enables automatic extraction of articles, including detection of a specific type of article, such as a feuilleton or literary supplement. This is particularly valuable for humanities researchers as it improves the searchability of these collections and enables corpora to be built around specific themes. Article-level access to, and improved searchability of, KBR's digitized newspapers are demonstrated through the online tool (https://tw06v072.ugent.be/kbr/).

Article
Publication date: 19 July 2024

Kuoyi Lin, Xiaoyang Kan and Meilian Liu

This study develops and validates an innovative approach for extracting knowledge from online user reviews by integrating textual content and emojis. Recognizing the pivotal role…

Abstract

Purpose

This study develops and validates an innovative approach for extracting knowledge from online user reviews by integrating textual content and emojis. Recognizing the pivotal role emojis play in enhancing the expressiveness and emotional depth of digital communication, this study aims to address the significant gap in existing sentiment analysis models, which have largely overlooked the contribution of emojis in interpreting user preferences and sentiments. By constructing a comprehensive model that synergizes emotional and semantic information conveyed through emojis and text, this study seeks to provide a more nuanced understanding of user preferences, thereby enhancing the accuracy and depth of knowledge extraction from online reviews. The goal is to offer a robust framework that enables more effective and empathetic engagement with user-generated content on digital platforms, paving the way for improved service delivery, product development and customer satisfaction through informed insights into consumer behavior and sentiments.

Design/methodology/approach

This study uses a structured methodology to integrate and analyze text and emojis from online reviews for effective knowledge extraction, focusing on user preferences and sentiments. This methodology consists of four key stages. First, this study leverages high-frequency noun analysis to identify and extract product attributes mentioned in online user reviews. By focusing on nouns that appear frequently, the authors can systematically discern the primary features or aspects of products that users discuss, thereby providing a foundation for a more detailed sentiment and preference analysis. Second, a foundational sentiment dictionary is established that incorporates sentiment-bearing words, intensifiers and negation terms to analyze the textual part of the reviews. This dictionary is used to assign sentiment scores to phrases and sentences within reviews, allowing the quantification of textual sentiments based on the presence and combination of these predefined lexical items. Third, an emoticon sentiment dictionary is developed to address the emotional content conveyed through emojis. This dictionary categorizes emojis based on their associated sentiments, thus enabling the quantification of emotional expressions in reviews. The sentiment scores derived from the emojis are then integrated with those from the textual analysis. This integration considers the weights of text- and emoji-based emotions to compute a comprehensive attribute sentiment score that reflects a nuanced understanding of user sentiments and preferences. Finally, the authors conduct an empirical study to validate the effectiveness of the proposed methodology in mining user preferences from online reviews by applying the approach to a data set of online reviews and evaluating its ability to accurately identify product attributes and user sentiments. The validation process assessed the reliability and accuracy of the methodology in extracting meaningful insights from the complex interplay between text and emojis. This study offers a holistic and nuanced framework for knowledge extraction from online reviews, capturing both explicit and implicit sentiments expressed by users through text and emojis. By integrating these elements, this study seeks to provide a comprehensive understanding of user preferences, contributing to improved consumer insight and strategic decision-making for businesses and researchers.

Findings

The application of the proposed methodology for integrating emojis with text in online reviews yields significant findings that underscore the feasibility and value of extracting realistic user knowledge to gain insights from user-generated content. The analysis successfully captured consumer preferences, which are instrumental in informing service decisions and driving innovation. This achievement is largely attributed to the development and utilization of a comprehensive emotion-sentiment dictionary tailored to interpret the complex interplay between textual and emoji-based expressions in online reviews. By implementing a sentiment calculation model that intricately combines textual sentiment analysis with emoji sentiment analysis, this study was able to accurately determine the final attribute emotion for various product features discussed in the reviews. This model effectively characterized the emotional knowledge of online users and provided a nuanced understanding of their sentiments and preferences. The emotional knowledge extracted is not only quantifiable but also rich in context, offering deeper insights into consumer behavior and attitudes. Furthermore, a case analysis is conducted to rigorously test the validity of the proposed model in a real-world scenario. This practical examination revealed that the model is not only capable of accurately extracting and analyzing user preferences but is also adaptable to different contexts and product categories. The case analysis highlights the robustness and flexibility of the model, demonstrating its potential to enhance the precision of knowledge extraction processes significantly. Overall, the results confirm the effectiveness of the proposed approach in integrating text and emojis for comprehensive knowledge extraction from online reviews. The findings validate the model’s capability to offer actionable insights into consumer preferences, thereby supporting more informed and strategic decision-making by businesses. This study contributes to the broader field of sentiment analysis by showcasing the untapped potential of emojis as valuable indicators of user sentiments, opening new avenues for research and applications in digital marketing and consumer behavior analysis.

Originality/value

This study introduces a pioneering approach to extract knowledge from Web user interactions, notably through the integration of online reviews that incorporate both textual content and emoticons. This innovative methodology stands out because it holistically considers the dual channels of communication, text and emojis, to comprehensively mine Web user preferences. The key contribution of this study lies in its novel insights into the extraction of consumer preferences, advancing beyond traditional text-based analysis to embrace nuanced expressions conveyed through emoticons. The originality of this study is underpinned by its acknowledgment of emoticons as a significant and untapped source of sentiment and preference indicators in online reviews. By effectively merging emoticon analysis and emoji emotion scoring with textual sentiment analysis, this study enriches the understanding of Web user preferences and enhances the accuracy and depth of consumer preference insights. This dual-analysis approach represents a significant leap forward in sentiment analysis, setting a new standard for how digital communication can be leveraged to derive meaningful insights into consumer behavior. Furthermore, the results have practical implications to businesses and marketers. The insights gained from this integrated analytical approach offer a more granular and emotionally nuanced view of customer feedback, which can inform more effective marketing strategies, product development and customer service practices. By pioneering this comprehensive method of knowledge extraction, this study paves the way for future research and practice to interpret and respond more accurately to the complex landscape of online consumer expressions. This study’s originality and value lie in its innovative method of capturing and analyzing the rich tapestry of Web user communication, offering a ground-breaking perspective on consumer preference extraction that promises to enhance both academic research and practical applications in the digital era.

Details

Journal of Knowledge Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1367-3270

Keywords

Article
Publication date: 8 November 2022

Yohanes Sigit Purnomo W.P., Yogan Jaya Kumar and Nur Zareen Zulkarnain

By far, the corpus for the quotation extraction and quotation attribution tasks in Indonesian is still limited in quantity and depth. This study aims to develop an Indonesian…

Abstract

Purpose

By far, the corpus for the quotation extraction and quotation attribution tasks in Indonesian is still limited in quantity and depth. This study aims to develop an Indonesian corpus of public figure statements attributions and a baseline model for attribution extraction, so it will contribute to fostering research in information extraction for the Indonesian language.

Design/methodology/approach

The methodology is divided into corpus development and extraction model development. During corpus development, data were collected and annotated. The development of the extraction model entails feature extraction, the definition of the model architecture, parameter selection and configuration, model training and evaluation, as well as model selection.

Findings

The Indonesian corpus of public figure statements attribution achieved 90.06% agreement level between the annotator and experts and could serve as a gold standard corpus. Furthermore, the baseline model predicted most labels and achieved 82.026% F-score.

Originality/value

To the best of the authors’ knowledge, the resulting corpus is the first corpus for attribution of public figures’ statements in the Indonesian language, which makes it a significant step for research on attribution extraction in the language. The resulting corpus and the baseline model can be used as a benchmark for further research. Other researchers could follow the methods presented in this paper to develop a new corpus and baseline model for other languages.

Details

Global Knowledge, Memory and Communication, vol. 73 no. 6/7
Type: Research Article
ISSN: 2514-9342

Keywords

Article
Publication date: 13 August 2024

Wenshen Xu, Yifan Zhang, Xinhang Jiang, Jun Lian and Ye Lin

In the field of steel defect detection, the existing detection algorithms struggle to achieve a satisfactory balance between detection accuracy, computational cost and inference…

Abstract

Purpose

In the field of steel defect detection, the existing detection algorithms struggle to achieve a satisfactory balance between detection accuracy, computational cost and inference speed due to the interference from complex background information, the variety of defect types and significant variations in defect morphology. To solve this problem, this paper aims to propose an efficient detector based on multi-scale information extraction (MSI-YOLO), which uses YOLOv8s as the baseline model.

Design/methodology/approach

First, the authors introduce an efficient multi-scale convolution with different-sized convolution kernels, which enables the feature extraction network to accommodate significant variations in defect morphology. Furthermore, the authors introduce the channel prior convolutional attention mechanism, which allows the network to focus on defect areas and ignore complex background interference. Considering the lightweight design and accuracy improvement, the authors introduce a more lightweight feature fusion network (Slim-neck) to improve the fusion effect of feature maps.

Findings

MSI-YOLO achieves 79.9% mean average precision on the public data set Northeastern University (NEU)-DET, with a model size of only 19.0 MB and an frames per second of 62.5. Compared with other state-of-the-art detectors, MSI-YOLO greatly improves the recognition accuracy and has significant advantages in computational cost and inference speed. Additionally, the strong generalization ability of MSI-YOLO is verified on the collected industrial site steel data set.

Originality/value

This paper proposes an efficient steel defect detector with high accuracy, low computational cost, excellent detection speed and strong generalization ability, which is more valuable for practical applications in resource-limited industrial production.

Details

Robotic Intelligence and Automation, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2754-6969

Keywords

1 – 10 of over 1000