Search results

1 – 10 of 74
Article
Publication date: 13 March 2024

Rong Jiang, Bin He, Zhipeng Wang, Xu Cheng, Hongrui Sang and Yanmin Zhou

Compared with traditional methods relying on manual teaching or system modeling, data-driven learning methods, such as deep reinforcement learning and imitation learning, show…

Abstract

Purpose

Compared with traditional methods relying on manual teaching or system modeling, data-driven learning methods, such as deep reinforcement learning and imitation learning, show more promising potential to cope with the challenges brought by increasingly complex tasks and environments, which have become the hot research topic in the field of robot skill learning. However, the contradiction between the difficulty of collecting robot–environment interaction data and the low data efficiency causes all these methods to face a serious data dilemma, which has become one of the key issues restricting their development. Therefore, this paper aims to comprehensively sort out and analyze the cause and solutions for the data dilemma in robot skill learning.

Design/methodology/approach

First, this review analyzes the causes of the data dilemma based on the classification and comparison of data-driven methods for robot skill learning; Then, the existing methods used to solve the data dilemma are introduced in detail. Finally, this review discusses the remaining open challenges and promising research topics for solving the data dilemma in the future.

Findings

This review shows that simulation–reality combination, state representation learning and knowledge sharing are crucial for overcoming the data dilemma of robot skill learning.

Originality/value

To the best of the authors’ knowledge, there are no surveys that systematically and comprehensively sort out and analyze the data dilemma in robot skill learning in the existing literature. It is hoped that this review can be helpful to better address the data dilemma in robot skill learning in the future.

Details

Robotic Intelligence and Automation, vol. 44 no. 2
Type: Research Article
ISSN: 2754-6969

Keywords

Article
Publication date: 1 April 2024

Tao Pang, Wenwen Xiao, Yilin Liu, Tao Wang, Jie Liu and Mingke Gao

This paper aims to study the agent learning from expert demonstration data while incorporating reinforcement learning (RL), which enables the agent to break through the…

Abstract

Purpose

This paper aims to study the agent learning from expert demonstration data while incorporating reinforcement learning (RL), which enables the agent to break through the limitations of expert demonstration data and reduces the dimensionality of the agent’s exploration space to speed up the training convergence rate.

Design/methodology/approach

Firstly, the decay weight function is set in the objective function of the agent’s training to combine both types of methods, and both RL and imitation learning (IL) are considered to guide the agent's behavior when updating the policy. Second, this study designs a coupling utilization method between the demonstration trajectory and the training experience, so that samples from both aspects can be combined during the agent’s learning process, and the utilization rate of the data and the agent’s learning speed can be improved.

Findings

The method is superior to other algorithms in terms of convergence speed and decision stability, avoiding training from scratch for reward values, and breaking through the restrictions brought by demonstration data.

Originality/value

The agent can adapt to dynamic scenes through exploration and trial-and-error mechanisms based on the experience of demonstrating trajectories. The demonstration data set used in IL and the experience samples obtained in the process of RL are coupled and used to improve the data utilization efficiency and the generalization ability of the agent.

Details

International Journal of Web Information Systems, vol. 20 no. 3
Type: Research Article
ISSN: 1744-0084

Keywords

Open Access
Article
Publication date: 4 April 2024

Bassem T. ElHassan and Alya A. Arabi

The purpose of this paper is to illuminate the ethical concerns associated with the use of artificial intelligence (AI) in the medical sector and to provide solutions that allow…

Abstract

Purpose

The purpose of this paper is to illuminate the ethical concerns associated with the use of artificial intelligence (AI) in the medical sector and to provide solutions that allow deriving maximum benefits from this technology without compromising ethical principles.

Design/methodology/approach

This paper provides a comprehensive overview of AI in medicine, exploring its technical capabilities, practical applications, and ethical implications. Based on our expertise, we offer insights from both technical and practical perspectives.

Findings

The study identifies several advantages of AI in medicine, including its ability to improve diagnostic accuracy, enhance surgical outcomes, and optimize healthcare delivery. However, there are pending ethical issues such as algorithmic bias, lack of transparency, data privacy issues, and the potential for AI to deskill healthcare professionals and erode humanistic values in patient care. Therefore, it is important to address these issues as promptly as possible to make sure that we benefit from the AI’s implementation without causing any serious drawbacks.

Originality/value

This paper gains its value from the combined practical experience of Professor Elhassan gained through his practice at top hospitals worldwide, and the theoretical expertise of Dr. Arabi acquired from international institutes. The shared experiences of the authors provide valuable insights that are beneficial for raising awareness and guiding action in addressing the ethical concerns associated with the integration of artificial intelligence in medicine.

Details

International Journal of Ethics and Systems, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2514-9369

Keywords

Article
Publication date: 12 April 2024

Youwei Li and Jian Qu

The purpose of this research is to achieve multi-task autonomous driving by adjusting the network architecture of the model. Meanwhile, after achieving multi-task autonomous…

Abstract

Purpose

The purpose of this research is to achieve multi-task autonomous driving by adjusting the network architecture of the model. Meanwhile, after achieving multi-task autonomous driving, the authors found that the trained neural network model performs poorly in untrained scenarios. Therefore, the authors proposed to improve the transfer efficiency of the model for new scenarios through transfer learning.

Design/methodology/approach

First, the authors achieved multi-task autonomous driving by training a model combining convolutional neural network and different structured long short-term memory (LSTM) layers. Second, the authors achieved fast transfer of neural network models in new scenarios by cross-model transfer learning. Finally, the authors combined data collection and data labeling to improve the efficiency of deep learning. Furthermore, the authors verified that the model has good robustness through light and shadow test.

Findings

This research achieved road tracking, real-time acceleration–deceleration, obstacle avoidance and left/right sign recognition. The model proposed by the authors (UniBiCLSTM) outperforms the existing models tested with model cars in terms of autonomous driving performance. Furthermore, the CMTL-UniBiCL-RL model trained by the authors through cross-model transfer learning improves the efficiency of model adaptation to new scenarios. Meanwhile, this research proposed an automatic data annotation method, which can save 1/4 of the time for deep learning.

Originality/value

This research provided novel solutions in the achievement of multi-task autonomous driving and neural network model scenario for transfer learning. The experiment was achieved on a single camera with an embedded chip and a scale model car, which is expected to simplify the hardware for autonomous driving.

Details

Data Technologies and Applications, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 16 July 2024

Peng Wu, Heng Su, Hao Dong, Tengfei Liu, Min Li and Zhihao Chen

Robotic arms play a crucial role in various industrial operations, such as sorting, assembly, handling and spraying. However, traditional robotic arm control algorithms often…

Abstract

Purpose

Robotic arms play a crucial role in various industrial operations, such as sorting, assembly, handling and spraying. However, traditional robotic arm control algorithms often struggle to adapt when faced with the challenge of dynamic obstacles. This paper aims to propose a dynamic obstacle avoidance method based on reinforcement learning to address real-time processing of dynamic obstacles.

Design/methodology/approach

This paper introduces an innovative method that introduces a feature extraction network that integrates gating mechanisms on the basis of traditional reinforcement learning algorithms. Additionally, an adaptive dynamic reward mechanism is designed to optimize the obstacle avoidance strategy.

Findings

Validation through the CoppeliaSim simulation environment and on-site testing has demonstrated the method's capability to effectively evade randomly moving obstacles, with a significant improvement in the convergence speed compared to traditional algorithms.

Originality/value

The proposed dynamic obstacle avoidance method based on Reinforcement Learning not only accomplishes the task of dynamic obstacle avoidance efficiently but also offers a distinct advantage in terms of convergence speed. This approach provides a novel solution to the obstacle avoidance methods for robotic arms.

Details

Industrial Robot: the international journal of robotics research and application, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0143-991X

Keywords

Article
Publication date: 9 July 2024

Anjali Tiwari, Saleena Khan, Renju Chandran and Alok Tewari

This study dwells into the crucial aspects of gig workers' jobs that are absent, with specific focus on the work life of food delivery workers (FDWs) and how it impacts their work…

Abstract

Purpose

This study dwells into the crucial aspects of gig workers' jobs that are absent, with specific focus on the work life of food delivery workers (FDWs) and how it impacts their work happiness.

Design/methodology/approach

To create a conceptual model, 21 delivery workers were first interviewed, and the data gathered were scrutinized. Subsequently, a questionnaire was sent to 493 delivery partners, probing about their opinions of work factors that could affect their level of happiness. The collected data were put to study by the authors using AMOS and SPSS.

Findings

Five missing work components were revealed by qualitative investigation. The absence of voice, recognition, career growth, work satisfaction, and dignity at work contributed to unhappiness of the workers. The qualitative analysis was supported by quantitative findings. Additionally, company policy moderated the relationship between absence of voice, absence of career growth, absence of job satisfaction and absence of work happiness.

Originality/value

To the best of the authors’ knowledge, this is one of the first studies using a mixed-method approach to identify factors contributing to the unhappiness of FDWs in work. The originality of this study also lies in establishing the moderating influence of company policy on the relationship between the absence of voice, absence of career growth, absence of work satisfaction and absence of workplace happiness among the workers.

Details

Employee Relations: The International Journal, vol. 46 no. 5
Type: Research Article
ISSN: 0142-5455

Keywords

Open Access
Article
Publication date: 30 April 2024

Armando Di Meglio, Nicola Massarotti and Perumal Nithiarasu

In this study, the authors propose a novel digital twinning approach specifically designed for controlling transient thermal systems. The purpose of this study is to harness the…

Abstract

Purpose

In this study, the authors propose a novel digital twinning approach specifically designed for controlling transient thermal systems. The purpose of this study is to harness the combined power of deep learning (DL) and physics-based methods (PBM) to create an active virtual replica of the physical system.

Design/methodology/approach

To achieve this goal, we introduce a deep neural network (DNN) as the digital twin and a Finite Element (FE) model as the physical system. This integrated approach is used to address the challenges of controlling an unsteady heat transfer problem with an integrated feedback loop.

Findings

The results of our study demonstrate the effectiveness of the proposed digital twinning approach in regulating the maximum temperature within the system under varying and unsteady heat flux conditions. The DNN, trained on stationary data, plays a crucial role in determining the heat transfer coefficients necessary to maintain temperatures below a defined threshold value, such as the material’s melting point. The system is successfully controlled in 1D, 2D and 3D case studies. However, careful evaluations should be conducted if such a training approach, based on steady-state data, is applied to completely different transient heat transfer problems.

Originality/value

The present work represents one of the first examples of a comprehensive digital twinning approach to transient thermal systems, driven by data. One of the noteworthy features of this approach is its robustness. Adopting a training based on dimensionless data, the approach can seamlessly accommodate changes in thermal capacity and thermal conductivity without the need for retraining.

Details

International Journal of Numerical Methods for Heat & Fluid Flow, vol. 34 no. 6
Type: Research Article
ISSN: 0961-5539

Keywords

Article
Publication date: 12 July 2024

Peng Guo, Weiyong Si and Chenguang Yang

The purpose of this paper is to enhance the performance of robots in peg-in-hole assembly tasks, enabling them to swiftly and robustly accomplish the task. It also focuses on the…

69

Abstract

Purpose

The purpose of this paper is to enhance the performance of robots in peg-in-hole assembly tasks, enabling them to swiftly and robustly accomplish the task. It also focuses on the robot’s ability to generalize across assemblies with different hole sizes.

Design/methodology/approach

Human behavior in peg-in-hole assembly serves as inspiration, where individuals visually locate the hole firstly and then continuously adjust the peg pose based on force/torque feedback during the insertion process. This paper proposes a novel framework that integrate visual servo and adjustment based on force/torque feedback, the authors use deep neural network (DNN) and image processing techniques to determine the pose of hole, then an incremental learning approach based on a broad learning system (BLS) is used to simulate human learning ability, the number of adjustments required for insertion process is continuously reduced.

Findings

The author conducted experiments on visual servo, adjustment based on force/torque feedback, and the proposed framework. Visual servo inferred the pixel position and orientation of the target hole in only about 0.12 s, and the robot achieved peg insertion with 1–3 adjustments based on force/torque feedback. The success rate for peg-in-hole assembly using the proposed framework was 100%. These results proved the effectiveness of the proposed framework.

Originality/value

This paper proposes a framework for peg-in-hole assembly that combines visual servo and adjustment based on force/torque feedback. The assembly tasks are accomplished using DNN, image processing and BLS. To the best of the authors’ knowledge, no similar methods were found in other people’s work. Therefore, the authors believe that this work is original.

Details

Robotic Intelligence and Automation, vol. 44 no. 5
Type: Research Article
ISSN: 2754-6969

Keywords

Article
Publication date: 15 August 2023

Amit Jain

This study aims to develop a model of learning-by-hiring in which knowledge gains may occur at the time of recruitment but also after recruitment when other incumbent…

Abstract

Purpose

This study aims to develop a model of learning-by-hiring in which knowledge gains may occur at the time of recruitment but also after recruitment when other incumbent organizational members assimilate a recruit’s knowledge. The author’s model predicts that experienced recruits are more likely to catalyze change to their organization’s core technological capabilities.

Design/methodology/approach

The continuous-time parametric hazard rate regressions predict core technological change in a long panel (1970–2017) of US biotechnology industry patent data. The author uses over 140,000 patents to model the evolution of knowledge of over 52,000 scientists and over 4,450 firms. To address endogeneity concerns, the author uses the Heckman selection method and does robustness tests using a difference-in-difference analysis.

Findings

The author finds that a hire’s prior research and development (R&D) experience helps overcome inertia arising from her or his new-to-an-organization “distant” knowledge to increase the likelihood of core technological change. In addition, while the author finds that incumbent organizational members resist technological change, experienced hires may effectively induce them to adopt new ways of doing things. This is particularly the case when hires collaborate with incumbents in R&D projects. Understanding the effects of hiring on core technological change, therefore, benefits from an assessment of hire R&D experience and its effects on incumbent inertia in an organization.

Practical implications

First, the author does not recommend managers to hire scientists with considerable distant knowledge only as this may be detrimental to core technological change. Second, the author recommends organizations striving to effectuate technological change to hire people with considerable prior R&D experience as this confers them with the ability to influence other members and socialize incumbent members. Third, the author recommends that managers hire people with both significant levels of prior experience and distant knowledge as they are complements. Finally, the author recommends managers to encourage collaboration between highly experienced hired scientists and long-tenured incumbent organizational members to facilitate incumbent learning, socialization and adoption of new ways of doing things.

Originality/value

This study develops a model of learning-by-hiring, which, to the best of the authors’ knowledge, is the first to propose, test and advance KM literature by showing the effectiveness of experienced hires to stimulate knowledge diffusion and core technological change over time after being hired. This study contributes to innovation, organizational learning and strategy literatures.

Article
Publication date: 9 August 2024

Reza Basiri, Mansour Abedian, Saeed Aghasi and Zahra Dashtaali

Over the last years, powerful advances in the area of dynamic games have enriched game theory and made it more applicable to the modeling of real-world competitive strategies. The…

Abstract

Purpose

Over the last years, powerful advances in the area of dynamic games have enriched game theory and made it more applicable to the modeling of real-world competitive strategies. The study of strategic behaviors of firms in an oligopoly market has received little attention, even though real firms have been shown to compete in output and in price in a single industry. The purpose of this study is to propose a game-theoretic approach to studying strategic behaviors of firms in an oligopoly market structure.

Design/methodology/approach

This approach was developed to study market dynamics and pricing strategic behavior of firms that have the possibility of deciding to be one of the two types (price-maker or price-taker) and reconsider the choice overtime on the basis of their current insights and knowledge and their experience. Firms try to improve their performance in the competitive market in a strategic way, by considering their steady-state profits and choosing the best type given the other firms’ types, actions and interactions.

Findings

The results of the present study confirm the previous study that the Cournot market is a stable market, where each firm can be a price-maker and enjoy individual learning as well as social learning. On the contrary, the market with price-takers only is never stable, and, therefore, the Walrasian equilibrium may not be supported in some instances. The Cournot market loses its stability as the number of firms in the market increases due to the fact that it will be more profitable for a firm to switch to price-taking when the number of firms is high enough. In such a situation, when the number of price-takers increases, there are no stable markets and price dynamics are destabilized.

Originality/value

The study and modeling of real-world competitive strategies would enhance the understanding of oligopoly markets. The study of strategic behaviors of firms in an oligopoly market has received little attention, even though real firms have been shown to compete in output and in price in a single industry as price-takers and price-makers.

Details

Journal of Modelling in Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1746-5664

Keywords

1 – 10 of 74