Search results

1 – 10 of 145
Article
Publication date: 2 January 2024

Tiara Kusumaningtiyas, Prasetyo Adi Nugroho and Nurul Aida Noor Azizi

The purpose of this paper is to explore the use of artificial intelligence (AI) in libraries, especially university libraries, which are faced with users from various countries…

Abstract

Purpose

The purpose of this paper is to explore the use of artificial intelligence (AI) in libraries, especially university libraries, which are faced with users from various countries who have different languages and cultures. Seamless M4T, which is being developed, has great potential for helping university librarians maximize library services by providing ease of communication.

Design/methodology/approach

Analyzing the possibility of developing Seamless M4T using natural language processing techniques and how to train language models to be smarter AI tools and can be used to break down language barriers between librarians and users.

Findings

The implementation of AI-based application Seamless M4T can help university librarians provide maximum service to users who are hampered by language and culture with advanced communication skills. Seamless M4T has an automatic speech recognition feature for dozens of languages, so it can translate speech-to-text, text-to-speech or both text and speech. To convert written words into verbal forms, this AI can also translate and transcribe text and speech in real-time without significant delays.

Originality/value

This paper emphasizes the use of AI in university libraries to improve services, especially in communication due to language differences between librarians and users. Advantages in using AI in libraries can support the collaboration and scholarly communication process.

Details

Library Hi Tech News, vol. 41 no. 4
Type: Research Article
ISSN: 0741-9058

Keywords

Article
Publication date: 7 June 2024

Venus Chan

Studies on technology and interpreting have increasingly explored how technology influences the role and performance of interpreters in their practice; however, there is a lack of…

Abstract

Purpose

Studies on technology and interpreting have increasingly explored how technology influences the role and performance of interpreters in their practice; however, there is a lack of comprehensive reviews and analyses. This paper aims to synthetically review the state-of-the-art application and integration of various interpreting technologies, identify the key trends of recent studies, and evaluate the associated opportunities and challenges.

Design/methodology/approach

Adopting a systematic review approach, 40 articles on technology and interpreting practice from 2013 to 2024 were selected and analysed.

Findings

A growing number of empirical studies on technology-mediated remote interpreting and technology-supported interpreting is observed in public service settings, suggesting that mobile and emerging technologies have gained more attention alongside phone and video. In addition, mixed results are revealed with respect to the impact of technology on interpreting performance.

Originality/value

This review not only provides a bird’s-eye view of how interpreting practice has reconciled with different technologies but also offers insights into the changing role of interpreters, the need for training, and the direction for future research.

Details

Interactive Technology and Smart Education, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1741-5659

Keywords

Article
Publication date: 17 July 2024

Siqi Yi and Soo Young Rieh

This paper aims to critically review the intersection of searching and learning among children in the context of voice-based conversational agents (VCAs). This study presents the…

Abstract

Purpose

This paper aims to critically review the intersection of searching and learning among children in the context of voice-based conversational agents (VCAs). This study presents the opportunities and challenges around reconfiguring current VCAs for children to facilitate human learning, generate diverse data to empower VCAs, and assess children’s learning from voice search interactions.

Design/methodology/approach

The scope of this paper includes children’s use of VCAs for learning purposes with an emphasis on conceptualizing their VCA use from search as learning perspectives. This study selects representative works from three areas of literature: children’s perceptions of digital devices, children’s learning and searching, and children’s search as learning. This study also includes conceptual papers and empirical studies focusing on children from 3 to 11 because this age spectrum covers a vital transitional phase in children’s ability to understand and use VCAs.

Findings

This study proposes the concept of child-centered voice search systems and provides design recommendations for imbuing contextual information, providing communication breakdown repair strategies, scaffolding information interactions, integrating emotional intelligence, and providing explicit feedback. This study presents future research directions for longitudinal and observational studies with more culturally diverse child participants.

Originality/value

This paper makes important contributions to the field of information and learning sciences and children’s searching as learning by proposing a new perspective where current VCAs are reconfigured as conversational voice search systems to enhance children’s learning.

Details

Information and Learning Sciences, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2398-5348

Keywords

Article
Publication date: 19 July 2024

Giulio Marchena Sekli

The aim of this study is to offer valuable insights to businesses and facilitate better understanding on transformer-based models (TBMs), which are among the widely employed…

Abstract

Purpose

The aim of this study is to offer valuable insights to businesses and facilitate better understanding on transformer-based models (TBMs), which are among the widely employed generative artificial intelligence (GAI) models, garnering substantial attention due to their ability to process and generate complex data.

Design/methodology/approach

Existing studies on TBMs tend to be limited in scope, either focusing on specific fields or being highly technical. To bridge this gap, this study conducts robust bibliometric analysis to explore the trends across journals, authors, affiliations, countries and research trajectories using science mapping techniques – co-citation, co-words and strategic diagram analysis.

Findings

Identified research gaps encompass the evolution of new closed and open-source TBMs; limited exploration across industries like education and disciplines like marketing; a lack of in-depth exploration on TBMs' adoption in the health sector; scarcity of research on TBMs' ethical considerations and potential TBMs' performance research in diverse applications, like image processing.

Originality/value

The study offers an updated TBMs landscape and proposes a theoretical framework for TBMs' adoption in organizations. Implications for managers and researchers along with suggested research questions to guide future investigations are provided.

Details

Kybernetes, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0368-492X

Keywords

Article
Publication date: 17 June 2024

Enayat Rajabi, Allu Niya George and Karishma Kumar

This study aims to investigate the applications of knowledge graphs in developing artificial intelligence (AI) assistants and chatbots by reviewing scholarly publications from…

Abstract

Purpose

This study aims to investigate the applications of knowledge graphs in developing artificial intelligence (AI) assistants and chatbots by reviewing scholarly publications from different lenses and dimensions. The authors also analyze the various AI approaches used for knowledge graph-driven chatbots and discuss how implementing these techniques makes a difference in technology.

Design/methodology/approach

Over recent years, chatbots have emerged as a transformational force in interacting with the digital world in various domains, including customer service and personal assistants. Recently, chatbots have become a revolutionary tool for interacting with the digital world in various contexts, such as personal assistants and customer support. Incorporating knowledge graphs considerably improved the capabilities of chatbots by allowing them access to massive knowledge bases and enhancing their ability to understand queries. Furthermore, knowledge graphs enable chatbots to understand semantic links between elements and improve response quality. This study highlights the role of knowledge graphs in chatbots following a systematic review approach. They have been integrated into major health-care, education and business domains. Beyond improving information retrieval, knowledge graphs enhance the user experience and increase the range of fields in which chatbots can be used. Improving and enriching chatbot answers was also identified as one of the main advantages of knowledge graphs. This enriched response can increase user confidence and improve the accuracy of chatbot interactions, making them more trustworthy information sources.

Findings

Knowledge graph-based chatbots leverage extensive data retrieval to provide accurate and enriched responses, increasing user confidence and experience without requiring extensive training. The three major domains where knowledge graph-based chatbots have been used are health care, education and business.

Practical implications

Knowledge graph-based chatbots can better comprehend user queries and respond with relevant information efficiently without extensive training. Furthermore, knowledge graphs enable chatbots to understand semantic links between elements, allowing them to answer complicated and multi-faceted questions. This semantic comprehension improves response quality, making chatbots more successful in providing accurate and valuable information in various domains. Furthermore, knowledge graphs enable chatbots to provide consumers with individualized experiences by storing and recalling individual preferences, history or previous encounters. This study analyzes the role of knowledge graphs in chatbots following a systematic review approach. This study reviewed state-of-the-art articles to understand where and how chatbots have used knowledge graphs. The authors found health care, business and education as three main areas in which knowledge-graph-based chatbots have been mostly used. Chatbots have been developed in text, voice and visuals using various machine learning models, particularly natural language pocessing, to develop recommender systems to recommend suitable items, content or services based on user preferences and item associations.

Originality/value

This paper provides a comprehensive review of the current state of the field in using knowledge graphs in chatbots, focusing on machine learning models, domains and communication channels. The study highlights the prevalence of text and voice channels over visual ones and identifies research gaps and future directions. The paper’s insights can inform the design and development of chatbots using knowledge graphs and benefit both researchers and practitioners in AI, natural language processing and human–computer interaction. The paper is of interest to professionals in domains such as health care, education and business.

Details

The Electronic Library , vol. 42 no. 3
Type: Research Article
ISSN: 0264-0473

Keywords

Article
Publication date: 29 August 2024

Yizhuo Zhang, Yunfei Zhang, Huiling Yu and Shen Shi

The anomaly detection task for oil and gas pipelines based on acoustic signals faces issues such as background noise coverage, lack of effective features, and small sample sizes…

Abstract

Purpose

The anomaly detection task for oil and gas pipelines based on acoustic signals faces issues such as background noise coverage, lack of effective features, and small sample sizes, resulting in low fault identification accuracy and slow efficiency. The purpose of this paper is to study an accurate and efficient method of pipeline anomaly detection.

Design/methodology/approach

First, to address the impact of background noise on the accuracy of anomaly signals, the adaptive multi-threshold center frequency variational mode decomposition method(AMTCF-VMD) method is used to eliminate strong noise in pipeline signals. Secondly, to address the strong data dependency and loss of local features in the Swin Transformer network, a Hybrid Pyramid ConvNet network with an Agent Attention mechanism is proposed. This compensates for the limitations of CNN’s receptive field and enhances the Swin Transformer’s global contextual feature representation capabilities. Thirdly, to address the sparsity and imbalance of anomaly samples, the SpecAugment and Scaper methods are integrated to enhance the model’s generalization ability.

Findings

In the pipeline anomaly audio and environmental datasets such as ESC-50, the AMTCF-VMD method shows more significant denoising effects compared to wavelet packet decomposition and EMD methods. Additionally, the model achieved 98.7% accuracy on the preprocessed anomaly audio dataset and 99.0% on the ESC-50 dataset.

Originality/value

This paper innovatively proposes and combines the AMTCF-VMD preprocessing method with the Agent-SwinPyramidNet model, addressing noise interference and low accuracy issues in pipeline anomaly detection, and providing strong support for oil and gas pipeline anomaly recognition tasks in high-noise environments.

Details

International Journal of Intelligent Computing and Cybernetics, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 24 September 2024

Eric Ohene, Gabriel Nani, Maxwell Fordjour Antwi-Afari, Amos Darko, Lydia Agyapomaa Addai and Edem Horvey

Unlocking the potential of Big Data Analytics (BDA) has proven to be a transformative factor for the Architecture, Engineering and Construction (AEC) industry. This has prompted…

Abstract

Purpose

Unlocking the potential of Big Data Analytics (BDA) has proven to be a transformative factor for the Architecture, Engineering and Construction (AEC) industry. This has prompted researchers to focus attention on BDA in the AEC industry (BDA-in-AECI) in recent years, leading to a proliferation of relevant research. However, an in-depth exploration of the literature on BDA-in-AECI remains scarce. As a result, this study seeks to systematically explore the state-of-the-art review on BDA-in-AECI and identify research trends and gaps in knowledge to guide future research.

Design/methodology/approach

This state-of-the-art review was conducted using a mixed-method systematic review. Relevant publications were retrieved from Scopus and then subjected to inclusion and exclusion criteria. A quantitative bibliometric analysis was conducted using VOSviewer software and Gephi to reveal the status quo of research in the domain. A further qualitative analysis was performed on carefully screened articles. Based on this mixed-method systematic review, knowledge gaps were identified and future research agendas of BDA-in-AECI were proposed.

Findings

The results show that BDA has been adopted to support AEC decision-making, safety and risk assessment, structural health monitoring, damage detection, waste management, project management and facilities management. BDA also plays a major role in achieving construction 4.0 and Industry 4.0. The study further revealed that data mining, cloud computing, predictive analytics, machine learning and artificial intelligence methods, such as deep learning, natural language processing and computer vision, are the key methods used for BDA-in-AECI. Moreover, several data acquisition platforms and technologies were identified, including building information modeling, Internet of Things (IoT), social networking and blockchain. Further studies are needed to examine the synergies between BDA and AI, BDA and Digital twin and BDA and blockchain in the AEC industry.

Originality/value

The study contributes to the BDA-in-AECI body of knowledge by providing a comprehensive scope of understanding and revealing areas for future research directions beneficial to the stakeholders in the AEC industry.

Details

Engineering, Construction and Architectural Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0969-9988

Keywords

Article
Publication date: 5 July 2024

Nouhaila Bensalah, Habib Ayad, Abdellah Adib and Abdelhamid Ibn El Farouk

The paper aims to enhance Arabic machine translation (MT) by proposing novel approaches: (1) a dimensionality reduction technique for word embeddings tailored for Arabic text…

Abstract

Purpose

The paper aims to enhance Arabic machine translation (MT) by proposing novel approaches: (1) a dimensionality reduction technique for word embeddings tailored for Arabic text, optimizing efficiency while retaining semantic information; (2) a comprehensive comparison of meta-embedding techniques to improve translation quality; and (3) a method leveraging self-attention and Gated CNNs to capture token dependencies, including temporal and hierarchical features within sentences, and interactions between different embedding types. These approaches collectively aim to enhance translation quality by combining different embedding schemes and leveraging advanced modeling techniques.

Design/methodology/approach

Recent works on MT in general and Arabic MT in particular often pick one type of word embedding model. In this paper, we present a novel approach to enhance Arabic MT by addressing three key aspects. Firstly, we propose a new dimensionality reduction technique for word embeddings, specifically tailored for Arabic text. This technique optimizes the efficiency of embeddings while retaining their semantic information. Secondly, we conduct an extensive comparison of different meta-embedding techniques, exploring the combination of static and contextual embeddings. Through this analysis, we identify the most effective approach to improve translation quality. Lastly, we introduce a novel method that leverages self-attention and Gated convolutional neural networks (CNNs) to capture token dependencies, including temporal and hierarchical features within sentences, as well as interactions between different types of embeddings. Our experimental results demonstrate the effectiveness of our proposed approach in significantly enhancing Arabic MT performance. It outperforms baseline models with a BLEU score increase of 2 points and achieves superior results compared to state-of-the-art approaches, with an average improvement of 4.6 points across all evaluation metrics.

Findings

The proposed approaches significantly enhance Arabic MT performance. The dimensionality reduction technique improves the efficiency of word embeddings while preserving semantic information. Comprehensive comparison identifies effective meta-embedding techniques, with the contextualized dynamic meta-embeddings (CDME) model showcasing competitive results. Integration of Gated CNNs with the transformer model surpasses baseline performance, leveraging both architectures' strengths. Overall, these findings demonstrate substantial improvements in translation quality, with a BLEU score increase of 2 points and an average improvement of 4.6 points across all evaluation metrics, outperforming state-of-the-art approaches.

Originality/value

The paper’s originality lies in its departure from simply fine-tuning the transformer model for a specific task. Instead, it introduces modifications to the internal architecture of the transformer, integrating Gated CNNs to enhance translation performance. This departure from traditional fine-tuning approaches demonstrates a novel perspective on model enhancement, offering unique insights into improving translation quality without solely relying on pre-existing architectures. The originality in dimensionality reduction lies in the tailored approach for Arabic text. While dimensionality reduction techniques are not new, the paper introduces a specific method optimized for Arabic word embeddings. By employing independent component analysis (ICA) and a post-processing method, the paper effectively reduces the dimensionality of word embeddings while preserving semantic information which has not been investigated before especially for MT task.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 17 no. 3
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 26 March 2024

Keyu Chen, Beiyu You, Yanbo Zhang and Zhengyi Chen

Prefabricated building has been widely applied in the construction industry all over the world, which can significantly reduce labor consumption and improve construction…

Abstract

Purpose

Prefabricated building has been widely applied in the construction industry all over the world, which can significantly reduce labor consumption and improve construction efficiency compared with conventional approaches. During the construction of prefabricated buildings, the overall efficiency largely depends on the lifting sequence and path of each prefabricated component. To improve the efficiency and safety of the lifting process, this study proposes a framework for automatically optimizing the lifting path of prefabricated building components using building information modeling (BIM), improved 3D-A* and a physic-informed genetic algorithm (GA).

Design/methodology/approach

Firstly, the industry foundation class (IFC) schema for prefabricated buildings is established to enrich the semantic information of BIM. After extracting corresponding component attributes from BIM, the models of typical prefabricated components and their slings are simplified. Further, the slings and elements’ rotations are considered to build a safety bounding box. Secondly, an efficient 3D-A* is proposed for element path planning by integrating both safety factors and variable step size. Finally, an efficient GA is designed to obtain the optimal lifting sequence that satisfies physical constraints.

Findings

The proposed optimization framework is validated in a physics engine with a pilot project, which enables better understanding. The results show that the framework can intuitively and automatically generate the optimal lifting path for each type of prefabricated building component. Compared with traditional algorithms, the improved path planning algorithm significantly reduces the number of nodes computed by 91.48%, resulting in a notable decrease in search time by 75.68%.

Originality/value

In this study, a prefabricated component path planning framework based on the improved A* algorithm and GA is proposed for the first time. In addition, this study proposes a safety-bounding box that considers the effects of torsion and slinging of components during lifting. The semantic information of IFC for component lifting is enriched by taking into account lifting data such as binding positions, lifting methods, lifting angles and lifting offsets.

Details

Engineering, Construction and Architectural Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0969-9988

Keywords

Article
Publication date: 7 July 2023

Wuyan Liang and Xiaolong Xu

In the COVID-19 era, sign language (SL) translation has gained attention in online learning, which evaluates the physical gestures of each student and bridges the communication…

Abstract

Purpose

In the COVID-19 era, sign language (SL) translation has gained attention in online learning, which evaluates the physical gestures of each student and bridges the communication gap between dysphonia and hearing people. The purpose of this paper is to devote the alignment between SL sequence and nature language sequence with high translation performance.

Design/methodology/approach

SL can be characterized as joint/bone location information in two-dimensional space over time, forming skeleton sequences. To encode joint, bone and their motion information, we propose a multistream hierarchy network (MHN) along with a vocab prediction network (VPN) and a joint network (JN) with the recurrent neural network transducer. The JN is used to concatenate the sequences encoded by the MHN and VPN and learn their sequence alignments.

Findings

We verify the effectiveness of the proposed approach and provide experimental results on three large-scale datasets, which show that translation accuracy is 94.96, 54.52, and 92.88 per cent, and the inference time is 18 and 1.7 times faster than listen-attend-spell network (LAS) and visual hierarchy to lexical sequence network (H2SNet) , respectively.

Originality/value

In this paper, we propose a novel framework that can fuse multimodal input (i.e. joint, bone and their motion stream) and align input streams with nature language. Moreover, the provided framework is improved by the different properties of MHN, VPN and JN. Experimental results on the three datasets demonstrate that our approaches outperform the state-of-the-art methods in terms of translation accuracy and speed.

Details

Data Technologies and Applications, vol. 58 no. 2
Type: Research Article
ISSN: 2514-9288

Keywords

1 – 10 of 145