Search results

1 – 10 of 231
Article
Publication date: 2 January 2024

Tiara Kusumaningtiyas, Prasetyo Adi Nugroho and Nurul Aida Noor Azizi

The purpose of this paper is to explore the use of artificial intelligence (AI) in libraries, especially university libraries, which are faced with users from various countries…

Abstract

Purpose

The purpose of this paper is to explore the use of artificial intelligence (AI) in libraries, especially university libraries, which are faced with users from various countries who have different languages and cultures. Seamless M4T, which is being developed, has great potential for helping university librarians maximize library services by providing ease of communication.

Design/methodology/approach

Analyzing the possibility of developing Seamless M4T using natural language processing techniques and how to train language models to be smarter AI tools and can be used to break down language barriers between librarians and users.

Findings

The implementation of AI-based application Seamless M4T can help university librarians provide maximum service to users who are hampered by language and culture with advanced communication skills. Seamless M4T has an automatic speech recognition feature for dozens of languages, so it can translate speech-to-text, text-to-speech or both text and speech. To convert written words into verbal forms, this AI can also translate and transcribe text and speech in real-time without significant delays.

Originality/value

This paper emphasizes the use of AI in university libraries to improve services, especially in communication due to language differences between librarians and users. Advantages in using AI in libraries can support the collaboration and scholarly communication process.

Details

Library Hi Tech News, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0741-9058

Keywords

Content available
Article
Publication date: 13 November 2023

Sheuli Paul

This paper presents a survey of research into interactive robotic systems for the purpose of identifying the state of the art capabilities as well as the extant gaps in this…

1037

Abstract

Purpose

This paper presents a survey of research into interactive robotic systems for the purpose of identifying the state of the art capabilities as well as the extant gaps in this emerging field. Communication is multimodal. Multimodality is a representation of many modes chosen from rhetorical aspects for its communication potentials. The author seeks to define the available automation capabilities in communication using multimodalities that will support a proposed Interactive Robot System (IRS) as an AI mounted robotic platform to advance the speed and quality of military operational and tactical decision making.

Design/methodology/approach

This review will begin by presenting key developments in the robotic interaction field with the objective of identifying essential technological developments that set conditions for robotic platforms to function autonomously. After surveying the key aspects in Human Robot Interaction (HRI), Unmanned Autonomous System (UAS), visualization, Virtual Environment (VE) and prediction, the paper then proceeds to describe the gaps in the application areas that will require extension and integration to enable the prototyping of the IRS. A brief examination of other work in HRI-related fields concludes with a recapitulation of the IRS challenge that will set conditions for future success.

Findings

Using insights from a balanced cross section of sources from the government, academic, and commercial entities that contribute to HRI a multimodal IRS in military communication is introduced. Multimodal IRS (MIRS) in military communication has yet to be deployed.

Research limitations/implications

Multimodal robotic interface for the MIRS is an interdisciplinary endeavour. This is not realistic that one can comprehend all expert and related knowledge and skills to design and develop such multimodal interactive robotic interface. In this brief preliminary survey, the author has discussed extant AI, robotics, NLP, CV, VDM, and VE applications that is directly related to multimodal interaction. Each mode of this multimodal communication is an active research area. Multimodal human/military robot communication is the ultimate goal of this research.

Practical implications

A multimodal autonomous robot in military communication using speech, images, gestures, VST and VE has yet to be deployed. Autonomous multimodal communication is expected to open wider possibilities for all armed forces. Given the density of the land domain, the army is in a position to exploit the opportunities for human–machine teaming (HMT) exposure. Naval and air forces will adopt platform specific suites for specially selected operators to integrate with and leverage this emerging technology. The possession of a flexible communications means that readily adapts to virtual training will enhance planning and mission rehearsals tremendously.

Social implications

Interaction, perception, cognition and visualization based multimodal communication system is yet missing. Options to communicate, express and convey information in HMT setting with multiple options, suggestions and recommendations will certainly enhance military communication, strength, engagement, security, cognition, perception as well as the ability to act confidently for a successful mission.

Originality/value

The objective is to develop a multimodal autonomous interactive robot for military communications. This survey reports the state of the art, what exists and what is missing, what can be done and possibilities of extension that support the military in maintaining effective communication using multimodalities. There are some separate ongoing progresses, such as in machine-enabled speech, image recognition, tracking, visualizations for situational awareness, and virtual environments. At this time, there is no integrated approach for multimodal human robot interaction that proposes a flexible and agile communication. The report briefly introduces the research proposal about multimodal interactive robot in military communication.

Article
Publication date: 2 March 2022

Yanli Fan and Liyan Liu

Deep learning (DL) technology is used to design a voice evaluation system to understand the impact of learning aids on DL and mobile platforms on students’ learning behavior.

Abstract

Purpose

Deep learning (DL) technology is used to design a voice evaluation system to understand the impact of learning aids on DL and mobile platforms on students’ learning behavior.

Design/methodology/approach

DL technology is used to design a speech evaluation system.

Findings

The experimental results show that the speech evaluation system designed has a high accuracy rate, the highest agreement rate with manual evaluation of pronunciation is 89.5%, and the correct speech recognition rate is 96.64%. The designed voice evaluation system and the manual voice rating system have a maximum error rate of 2%. The experimental results suggest that it is necessary to further optimize the learning aids for mobile platform. The learning aids of the mobile platform need to be further optimized to promote the improvement of student learning efficiency.

Originality/value

The results show that the speech evaluation system designed has good practical application value, and it provides a certain reference value for the future study of learning tools on DL.

Details

Library Hi Tech, vol. 41 no. 5
Type: Research Article
ISSN: 0737-8831

Keywords

Article
Publication date: 25 January 2024

Yaolin Zhou, Zhaoyang Zhang, Xiaoyu Wang, Quanzheng Sheng and Rongying Zhao

The digitalization of archival management has rapidly developed with the maturation of digital technology. With data's exponential growth, archival resources have transitioned…

Abstract

Purpose

The digitalization of archival management has rapidly developed with the maturation of digital technology. With data's exponential growth, archival resources have transitioned from single modalities, such as text, images, audio and video, to integrated multimodal forms. This paper identifies key trends, gaps and areas of focus in the field. Furthermore, it proposes a theoretical organizational framework based on deep learning to address the challenges of managing archives in the era of big data.

Design/methodology/approach

Via a comprehensive systematic literature review, the authors investigate the field of multimodal archive resource organization and the application of deep learning techniques in archive organization. A systematic search and filtering process is conducted to identify relevant articles, which are then summarized, discussed and analyzed to provide a comprehensive understanding of existing literature.

Findings

The authors' findings reveal that most research on multimodal archive resources predominantly focuses on aspects related to storage, management and retrieval. Furthermore, the utilization of deep learning techniques in image archive retrieval is increasing, highlighting their potential for enhancing image archive organization practices; however, practical research and implementation remain scarce. The review also underscores gaps in the literature, emphasizing the need for more practical case studies and the application of theoretical concepts in real-world scenarios. In response to these insights, the authors' study proposes an innovative deep learning-based organizational framework. This proposed framework is designed to navigate the complexities inherent in managing multimodal archive resources, representing a significant stride toward more efficient and effective archival practices.

Originality/value

This study comprehensively reviews the existing literature on multimodal archive resources organization. Additionally, a theoretical organizational framework based on deep learning is proposed, offering a novel perspective and solution for further advancements in the field. These insights contribute theoretically and practically, providing valuable knowledge for researchers, practitioners and archivists involved in organizing multimodal archive resources.

Details

Aslib Journal of Information Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2050-3806

Keywords

Article
Publication date: 20 October 2023

Komal Ghafoor, Tauqir Ahmad, Muhammad Aslam and Samyan Wahla

Assistive technology has been developed to assist the visually impaired individuals in their social interactions. Specifically designed to enhance communication skills, facilitate…

Abstract

Purpose

Assistive technology has been developed to assist the visually impaired individuals in their social interactions. Specifically designed to enhance communication skills, facilitate social engagement and improve the overall quality of life, conversational assistive technologies include speech recognition APIs, text-to-speech APIs and various communication tools that are real. Enable real-time interaction. Using natural language processing (NLP) and machine learning algorithms, the technology analyzes spoken language and provides appropriate responses, offering an immersive experience through voice commands, audio feedback and vibration alerts.

Design/methodology/approach

These technologies have demonstrated their ability to promote self-confidence and self-reliance in visually impaired individuals during social interactions. Moreover, they promise to improve social competence and foster better relationships. In short, assistive technology in conversation stands as a promising tool that empowers the visually impaired individuals, elevating the quality of their social engagement.

Findings

The main benefit of assistive communication technology is that it will help visually impaired people overcome communication barriers in social contexts. This technology helps them communicate effectively with acquaintances, family, co-workers and even strangers in public places. By enabling smoother and more natural communication, it works to reduce feelings of isolation and increase overall quality of life.

Originality/value

Research findings include successful activity recognition, aligning with activities on which the VGG-16 model was trained, such as hugging, shaking hands, talking, walking, waving and more. The originality of this study lies in its approach to address the challenges faced by the visually impaired individuals in their social interactions through modern technology. Research adds to the body of knowledge in the area of assistive technologies, which contribute to the empowerment and social inclusion of the visually impaired individuals.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 17 no. 1
Type: Research Article
ISSN: 1756-378X

Keywords

Open Access
Article
Publication date: 19 December 2023

Qinxu Ding, Ding Ding, Yue Wang, Chong Guan and Bosheng Ding

The rapid rise of large language models (LLMs) has propelled them to the forefront of applications in natural language processing (NLP). This paper aims to present a comprehensive…

1464

Abstract

Purpose

The rapid rise of large language models (LLMs) has propelled them to the forefront of applications in natural language processing (NLP). This paper aims to present a comprehensive examination of the research landscape in LLMs, providing an overview of the prevailing themes and topics within this dynamic domain.

Design/methodology/approach

Drawing from an extensive corpus of 198 records published between 1996 to 2023 from the relevant academic database encompassing journal articles, books, book chapters, conference papers and selected working papers, this study delves deep into the multifaceted world of LLM research. In this study, the authors employed the BERTopic algorithm, a recent advancement in topic modeling, to conduct a comprehensive analysis of the data after it had been meticulously cleaned and preprocessed. BERTopic leverages the power of transformer-based language models like bidirectional encoder representations from transformers (BERT) to generate more meaningful and coherent topics. This approach facilitates the identification of hidden patterns within the data, enabling authors to uncover valuable insights that might otherwise have remained obscure. The analysis revealed four distinct clusters of topics in LLM research: “language and NLP”, “education and teaching”, “clinical and medical applications” and “speech and recognition techniques”. Each cluster embodies a unique aspect of LLM application and showcases the breadth of possibilities that LLM technology has to offer. In addition to presenting the research findings, this paper identifies key challenges and opportunities in the realm of LLMs. It underscores the necessity for further investigation in specific areas, including the paramount importance of addressing potential biases, transparency and explainability, data privacy and security, and responsible deployment of LLM technology.

Findings

The analysis revealed four distinct clusters of topics in LLM research: “language and NLP”, “education and teaching”, “clinical and medical applications” and “speech and recognition techniques”. Each cluster embodies a unique aspect of LLM application and showcases the breadth of possibilities that LLM technology has to offer. In addition to presenting the research findings, this paper identifies key challenges and opportunities in the realm of LLMs. It underscores the necessity for further investigation in specific areas, including the paramount importance of addressing potential biases, transparency and explainability, data privacy and security, and responsible deployment of LLM technology.

Practical implications

This classification offers practical guidance for researchers, developers, educators, and policymakers to focus efforts and resources. The study underscores the importance of addressing challenges in LLMs, including potential biases, transparency, data privacy, and responsible deployment. Policymakers can utilize this information to shape regulations, while developers can tailor technology development based on the diverse applications identified. The findings also emphasize the need for interdisciplinary collaboration and highlight ethical considerations, providing a roadmap for navigating the complex landscape of LLM research and applications.

Originality/value

This study stands out as the first to examine the evolution of LLMs across such a long time frame and across such diversified disciplines. It provides a unique perspective on the key areas of LLM research, highlighting the breadth and depth of LLM’s evolution.

Details

Journal of Electronic Business & Digital Economics, vol. 3 no. 1
Type: Research Article
ISSN: 2754-4214

Keywords

Book part
Publication date: 1 February 2024

Seden Dogan

Voice command and voice search are becoming increasingly popular in the hospitality and tourism industry, with many hotels and travel companies investing in voice-enabled…

Abstract

Voice command and voice search are becoming increasingly popular in the hospitality and tourism industry, with many hotels and travel companies investing in voice-enabled technology to improve guest experiences and streamline operations. This technology allows travellers to manage their travel plans, request services and get information through natural voice commands on any voice-enabled device. Voice assistants are also multilingual, allowing hotels to customise responses to guests who do not speak the local language. Angie, a multilingual, in-room voice assistant, is an example of this technology. It can fulfil guest requests, answer common questions about the property and create streamlined access to a wide range of hotel amenities, such as ordering room service or requesting extra towels. Hotels can control questions and responses to assist stretched staff and provide upsell and advertising revenue through digital promotions or recommended onsite amenities or discounts. In addition, voice command technology can be used to book travel and find things to do at a destination. Google Assistant can help with travel plans like booking a hotel, checking flight status and finding things to do at a destination. In conclusion, voice command and voice search technology are transforming the hospitality and tourism industry by improving guest experiences, reducing operational costs and increasing revenue.

Open Access
Article
Publication date: 13 September 2023

Shuxin Ding, Tao Zhang, Kai Sheng, Yuanyuan Chen and Zhiming Yuan

The intelligent Central Traffic Control (CTC) system plays a vital role in establishing an intelligent high-speed railway (HSR) system. As the core of HSR transportation command…

Abstract

Purpose

The intelligent Central Traffic Control (CTC) system plays a vital role in establishing an intelligent high-speed railway (HSR) system. As the core of HSR transportation command, the intelligent CTC system is a new HSR dispatching command system that integrates the widely used CTC in China with the practical service requirements of intelligent dispatching. This paper aims to propose key technologies and applications for intelligent dispatching command in HSR in China.

Design/methodology/approach

This paper first briefly introduces the functions and configuration of the intelligent CTC system. Some new servers, terminals and interfaces are introduced, which are plan adjustment server/terminal, interface for automatic train operation (ATO), interface for Dynamic Monitoring System of Train Control Equipment (DMS), interface for Power Supervisory Control and Data Acquisition (PSCADA), interface for Disaster Monitoring, etc.

Findings

The key technologies applied in the intelligent CTC system include automatic adjustment of train operation plans, safety control of train routes and commands, traffic information data platform, integrated simulation of traffic dispatching and ATO function. These technologies have been applied in the Beijing-Zhangjiakou HSR, which commenced operations at the end of 2019. Implementing these key intelligent functions has improved the train dispatching command capacity, ensured the safe operation of intelligent HSR, reduced the labor intensity of dispatching operators and enhanced the intelligence level of China's dispatching system.

Originality/value

This paper provides further challenges and research directions for the intelligent dispatching command of HSR. To achieve the objectives, new measures need to be conducted, including the development of advanced technologies for intelligent dispatching command, coping with new requirements with the development of China's railway signaling system, the integration of traffic dispatching and train control and the application of AI and data-driven modeling and methods.

Article
Publication date: 16 May 2022

Parag Bhatt and Ashutosh Muduli

The presented research explored artificial intelligence (AI) application in the learning and development (L&D) function. Although a few studies reported AI and the people…

2445

Abstract

Purpose

The presented research explored artificial intelligence (AI) application in the learning and development (L&D) function. Although a few studies reported AI and the people management processes, a systematic and structured study that evaluates the integration of AI with L&D focusing on scope, adoption and affecting factors is mainly absent. This study aims to explore L&D-related AI innovations, AI’s role in L&D processes, advantages of AI adoption and factors leading to effective AI-based learning following the analyse, design, develop, implement and evaluate approach.

Design/methodology/approach

The presented research has adopted a systematic literature review method to critically analyse, synthesise and map the extant research by identifying the broad themes involved. The review approach includes determining a time horizon, database selection, article selection and article classification. Databases from Emerald, Sage, Francis and Taylor, etc. were used, and the 81 research articles published between 1996 and 2022 were identified for analysis.

Findings

The result shows that AI innovations such as natural language processing, artificial neural networks, interactive voice response and text to speech, speech to text, technology-enhanced learning and robots can improve L&D process efficiency. One can achieve this by facilitating the articulation of learning module, identifying learners through face recognition and speech recognition systems, completing course work, etc. Further, the result also shows that AI can be adopted in evaluating learning aptitude, testing learners’ memory, tracking learning progress, measuring learning effectiveness, helping learners identify mistakes and suggesting corrections. Finally, L&D professionals can use AI to facilitate a quicker, more accurate and cheaper learning process, suitable for a large learning audience at a time, flexible, efficient, convenient and less expensive for learners.

Originality/value

In the absence of any systematic research on AI in L&D function, the result of this study may provide useful insights to researchers and practitioners.

Details

European Journal of Training and Development, vol. 47 no. 7/8
Type: Research Article
ISSN: 2046-9012

Keywords

1 – 10 of 231