Search results

1 – 10 of 10
Article
Publication date: 14 August 2017

Neha Verma and Jatinder Singh

The purpose of this paper is to explore various limitations of conventional mining systems in extracting useful buying patterns from retail transactional databases flooded with…

1868

Abstract

Purpose

The purpose of this paper is to explore various limitations of conventional mining systems in extracting useful buying patterns from retail transactional databases flooded with Big Data. The key objective is to assist retail business owners to better understand the purchase needs of their customers and hence to attract customers to physical retail stores away from competitor e-commerce websites.

Design/methodology/approach

This paper employs a systematic and category-based review of relevant literature to explore the challenges possessed by Big Data for retail industry followed by discussion and implementation of association between MapReduce based Apriori association mining and Hadoop-based intelligent cloud architecture.

Findings

The findings reveal that conventional mining algorithms have not evolved to support Big Data analysis as required by modern retail businesses. They require a lot of resources such as memory and computational engines. This study aims to develop MR-Apriori algorithm in the form of IRM tool to address all these issues in an efficient manner.

Research limitations/implications

The paper suggests that a lot of research is yet to be done in market basket analysis, if full potential of cloud-based Big Data framework is required to be utilized.

Originality/value

This research arms the retail business owners with innovative IRM tool to easily extract comprehensive knowledge of useful buying patterns of customers to increase profits. This study experimentally verifies the effectiveness of proposed algorithm.

Details

Industrial Management & Data Systems, vol. 117 no. 7
Type: Research Article
ISSN: 0263-5577

Keywords

Article
Publication date: 21 December 2021

Laouni Djafri

This work can be used as a building block in other settings such as GPU, Map-Reduce, Spark or any other. Also, DDPML can be deployed on other distributed systems such as P2P…

384

Abstract

Purpose

This work can be used as a building block in other settings such as GPU, Map-Reduce, Spark or any other. Also, DDPML can be deployed on other distributed systems such as P2P networks, clusters, clouds computing or other technologies.

Design/methodology/approach

In the age of Big Data, all companies want to benefit from large amounts of data. These data can help them understand their internal and external environment and anticipate associated phenomena, as the data turn into knowledge that can be used for prediction later. Thus, this knowledge becomes a great asset in companies' hands. This is precisely the objective of data mining. But with the production of a large amount of data and knowledge at a faster pace, the authors are now talking about Big Data mining. For this reason, the authors’ proposed works mainly aim at solving the problem of volume, veracity, validity and velocity when classifying Big Data using distributed and parallel processing techniques. So, the problem that the authors are raising in this work is how the authors can make machine learning algorithms work in a distributed and parallel way at the same time without losing the accuracy of classification results. To solve this problem, the authors propose a system called Dynamic Distributed and Parallel Machine Learning (DDPML) algorithms. To build it, the authors divided their work into two parts. In the first, the authors propose a distributed architecture that is controlled by Map-Reduce algorithm which in turn depends on random sampling technique. So, the distributed architecture that the authors designed is specially directed to handle big data processing that operates in a coherent and efficient manner with the sampling strategy proposed in this work. This architecture also helps the authors to actually verify the classification results obtained using the representative learning base (RLB). In the second part, the authors have extracted the representative learning base by sampling at two levels using the stratified random sampling method. This sampling method is also applied to extract the shared learning base (SLB) and the partial learning base for the first level (PLBL1) and the partial learning base for the second level (PLBL2). The experimental results show the efficiency of our solution that the authors provided without significant loss of the classification results. Thus, in practical terms, the system DDPML is generally dedicated to big data mining processing, and works effectively in distributed systems with a simple structure, such as client-server networks.

Findings

The authors got very satisfactory classification results.

Originality/value

DDPML system is specially designed to smoothly handle big data mining classification.

Details

Data Technologies and Applications, vol. 56 no. 4
Type: Research Article
ISSN: 2514-9288

Keywords

Book part
Publication date: 4 December 2020

Gauri Rajendra Virkar and Supriya Sunil Shinde

Predictive analytics is the science of decision-making that eliminates guesswork out of the decision-making process and applies proven scientific procedures to find right…

Abstract

Predictive analytics is the science of decision-making that eliminates guesswork out of the decision-making process and applies proven scientific procedures to find right solutions. Predictive analytics provides ideas on the occurrences of future downtimes and rejections thereby aids in taking preventive actions before abnormalities occur. Considering these advantages, predictive analytics is adopted in various diverse fields such as health care, finance, education, marketing, automotive, etc. Predictive analytics tools can be used to predict various behaviors and patterns, thereby saving the time and money of its users. Many open-source predictive analysis tools namely R, scikit-learn, Konstanz Information Miner (KNIME), Orange, RapidMiner, Waikato Environment for Knowledge Analysis (WEKA), etc. are freely available for the users. This chapter aims to reveal the best accurate tools and techniques for the classification task that aid in decision-making. Our experimental results show that no specific tool provides the best results in all scenarios; rather it depends upon the datasets and the classifier.

Article
Publication date: 30 August 2022

Pinsheng Duan, Jianliang Zhou and Wenhan Fan

Effective construction safety training has been considered to play a significant role in reducing the incidence of accidents. However, the current safety training methods pay less…

Abstract

Purpose

Effective construction safety training has been considered to play a significant role in reducing the incidence of accidents. However, the current safety training methods pay less attention to the relationship between workers' personalized characteristics and their learning needs, which results in workers' low learning participation and poor training effect. The purpose of this paper is to improve the participation and effect of safety training for construction workers with a persona-based approach.

Design/methodology/approach

This paper presents a persona-based approach to safety tag generation and training material recommendation. By extracting the demographic characteristics and behavior patterns tags of construction workers, a neural network algorithm is introduced to calculate the learning needs tags of workers, and the collaborative filtering recommendation method is integrated to enrich the innovation of recommendation results. Offline experiments and online experiments are designed to verify the rationality of the proposed method.

Findings

The results show that the learning needs of workers are closely related to their background. The proposed method can effectively improve workers' interest in materials and the training effect compared with conventional safety training methods. The research provides a theoretical and practical reference for promoting active safety management and achieving worker-centered safety management.

Originality/value

First, a persona-based approach is introduced to establish a novel framework for solving the problem of personalized construction safety management. Second, an artificial intelligence algorithm is used to automatically extract the learning needs tag values and design a hybrid recommendation method for construction workers' personalized safety training. The collaborative filtering method is integrated to enrich the innovation of recommendation results.

Details

Engineering, Construction and Architectural Management, vol. 31 no. 1
Type: Research Article
ISSN: 0969-9988

Keywords

Open Access
Article
Publication date: 26 July 2021

Weifei Hu, Tongzhou Zhang, Xiaoyu Deng, Zhenyu Liu and Jianrong Tan

Digital twin (DT) is an emerging technology that enables sophisticated interaction between physical objects and their virtual replicas. Although DT has recently gained significant…

12113

Abstract

Digital twin (DT) is an emerging technology that enables sophisticated interaction between physical objects and their virtual replicas. Although DT has recently gained significant attraction in both industry and academia, there is no systematic understanding of DT from its development history to its different concepts and applications in disparate disciplines. The majority of DT literature focuses on the conceptual development of DT frameworks for a specific implementation area. Hence, this paper provides a state-of-the-art review of DT history, different definitions and models, and six types of key enabling technologies. The review also provides a comprehensive survey of DT applications from two perspectives: (1) applications in four product-lifecycle phases, i.e. product design, manufacturing, operation and maintenance, and recycling and (2) applications in four categorized engineering fields, including aerospace engineering, tunneling and underground engineering, wind engineering and Internet of things (IoT) applications. DT frameworks, characteristic components, key technologies and specific applications are extracted for each DT category in this paper. A comprehensive survey of the DT references reveals the following findings: (1) The majority of existing DT models only involve one-way data transfer from physical entities to virtual models and (2) There is a lack of consideration of the environmental coupling, which results in the inaccurate representation of the virtual components in existing DT models. Thus, this paper highlights the role of environmental factor in DT enabling technologies and in categorized engineering applications. In addition, the review discusses the key challenges and provides future work for constructing DTs of complex engineering systems.

Details

Journal of Intelligent Manufacturing and Special Equipment, vol. 2 no. 1
Type: Research Article
ISSN: 2633-6596

Keywords

Article
Publication date: 25 October 2018

Yoon-Sung Kim, Hae-Chang Rim and Do-Gil Lee

The purpose of this paper is to propose a methodology to analyze a large amount of unstructured textual data into categories of business environmental analysis frameworks.

1915

Abstract

Purpose

The purpose of this paper is to propose a methodology to analyze a large amount of unstructured textual data into categories of business environmental analysis frameworks.

Design/methodology/approach

This paper uses machine learning to classify a vast amount of unstructured textual data by category of business environmental analysis framework. Generally, it is difficult to produce high quality and massive training data for machine-learning-based system in terms of cost. Semi-supervised learning techniques are used to improve the classification performance. Additionally, the lack of feature problem that traditional classification systems have suffered is resolved by applying semantic features by utilizing word embedding, a new technique in text mining.

Findings

The proposed methodology can be used for various business environmental analyses and the system is fully automated in both the training and classifying phases. Semi-supervised learning can solve the problems with insufficient training data. The proposed semantic features can be helpful for improving traditional classification systems.

Research limitations/implications

This paper focuses on classifying sentences that contain the information of business environmental analysis in large amount of documents. However, the proposed methodology has a limitation on the advanced analyses which can directly help managers establish strategies, since it does not summarize the environmental variables that are implied in the classified sentences. Using the advanced summarization and recommendation techniques could extract the environmental variables among the sentences, and they can assist managers to establish effective strategies.

Originality/value

The feature selection technique developed in this paper has not been used in traditional systems for business and industry, so that the whole process can be fully automated. It also demonstrates practicality so that it can be applied to various business environmental analysis frameworks. In addition, the system is more economical than traditional systems because of semi-supervised learning, and can resolve the lack of feature problem that traditional systems suffer. This work is valuable for analyzing environmental factors and establishing strategies for companies.

Details

Industrial Management & Data Systems, vol. 119 no. 1
Type: Research Article
ISSN: 0263-5577

Keywords

Article
Publication date: 5 July 2023

Yuxiang Shan, Qin Ren, Gang Yu, Tiantian Li and Bin Cao

Internet marketing underground industry users refer to people who use technology means to simulate a large number of real consumer behaviors to obtain marketing activities rewards…

Abstract

Purpose

Internet marketing underground industry users refer to people who use technology means to simulate a large number of real consumer behaviors to obtain marketing activities rewards illegally, which leads to increased cost of enterprises and reduced effect of marketing. Therefore, this paper aims to construct a user risk assessment model to identify potential underground industry users to protect the interests of real consumers and reduce the marketing costs of enterprises.

Design/methodology/approach

Method feature extraction is based on two aspects. The first aspect is based on traditional statistical characteristics, using density-based spatial clustering of applications with noise clustering method to obtain user-dense regions. According to the total number of users in the region, the corresponding risk level of the receiving address is assigned. So that high-quality address information can be extracted. The second aspect is based on the time period during which users participate in activities, using frequent item set mining to find multiple users with similar operations within the same time period. Extract the behavior flow chart according to the user participation, so that the model can mine the deep relationship between the participating behavior and the underground industry users.

Findings

Based on the real underground industry user data set, the features of the data set are extracted by the proposed method. The features are experimentally verified by different models such as random forest, fully-connected layer network, SVM and XGBOST, and the proposed method is comprehensively evaluated. Experimental results show that in the best case, our method can improve the F1-score of traditional models by 55.37%.

Originality/value

This paper investigates the relative importance of static information and dynamic behavior characteristics of users in predicting underground industry users, and whether the absence of features of these categories affects the prediction results. This investigation can go a long way in aiding further research on this subject and found the features which improved the accuracy of predicting underground industry users.

Details

International Journal of Web Information Systems, vol. 19 no. 2
Type: Research Article
ISSN: 1744-0084

Keywords

Article
Publication date: 8 February 2019

Chao Wang, Longfeng Zhao, André L.M. Vilela and Ming K. Lim

The purpose of this paper is to examine publication characteristics and dynamic evolution of the Industrial Management & Data Systems (IMDS) over the past 25 years from volume 94…

840

Abstract

Purpose

The purpose of this paper is to examine publication characteristics and dynamic evolution of the Industrial Management & Data Systems (IMDS) over the past 25 years from volume 94, issue 1, in 1994 through volume 118, issue 9, in 2018, using a bibliometric analysis, and identify the leading trends that have affected the journal during this time frame.

Design/methodology/approach

A bibliometric approach was used to provide a basic overview of the IMDS, including distribution of publication and citations, articles citing the IMDS, top-cited papers and publication patterns. Then, a complex network analysis was employed to present the most productive, influential and active authors, institutes and countries/regions. In addition, cluster analysis and alluvial diagram were used to analyze author keywords.

Findings

This study presents the basic bibliometric results for the IMDS and focuses on exploring its performance over the last 25 years. And it reveals the most productive, influential and active authors, institutes and countries/regions in IMDS. Moreover, this study detects the existence of at least five different keywords clusters and discovers how themes have evolved through the intricate citation relationships in IMDS.

Originality/value

The main contribution of this paper is the use of multiple analysis techniques from a complex network paradigm to emphasize the time evolving nature of the co-occurrence networks and to explore the variation of the collaboration networks in the IMDS. For the first time, the evolution of research themes is revealed with a purely data-driven approach.

Details

Industrial Management & Data Systems, vol. 119 no. 1
Type: Research Article
ISSN: 0263-5577

Keywords

Article
Publication date: 17 March 2021

Rakesh Raut, Vaibhav Narwane, Sachin Kumar Mangla, Vinay Surendra Yadav, Balkrishna Eknath Narkhede and Sunil Luthra

This study initially aims to identify the barriers to the big data analytics (BDA) initiative and further evaluates the barriers for knowing their interrelations and priority in…

1039

Abstract

Purpose

This study initially aims to identify the barriers to the big data analytics (BDA) initiative and further evaluates the barriers for knowing their interrelations and priority in improving the performance of manufacturing firms.

Design/methodology/approach

A total of 15 barriers to BDA adoption were identified through literature review and expert opinions. Data were collected from three types of industries: automotive, machine tools and electronics manufacturers in India. The grey-decision-making trial and evaluation laboratory (DEMATEL) method was employed to explore the cause–effect relationship amongst barriers. Further, the barrier's influences were outranked and cross-validated through analytic network process (ANP).

Findings

The results showed that “lack of data storage facility”, “lack of IT infrastructure”, “lack of organisational strategy” and “uncertain about benefits and long terms usage” were most common barriers to adopt BDA practices in all three industries.

Practical implications

The findings of the study can assist service providers, industrial managers and government organisations in understanding the barriers and subsequently evaluating interrelationships and ranks of barriers in the successful adoption of BDA in a manufacturing organisation context.

Originality/value

The paper is one of the initial efforts in evaluating the barriers to BDA in improving the performance of manufacturing firms in India.

Details

Industrial Management & Data Systems, vol. 121 no. 9
Type: Research Article
ISSN: 0263-5577

Keywords

Open Access
Article
Publication date: 22 December 2023

Tao Xu, Hanning Shi, Yongjiang Shi and Jianxin You

The purpose of this paper is to explore the concept of data assets and how companies can assetize their data. Using the literature review methodology, the paper first summarizes…

968

Abstract

Purpose

The purpose of this paper is to explore the concept of data assets and how companies can assetize their data. Using the literature review methodology, the paper first summarizes the conceptual controversies over data assets in the existing literature. Subsequently, the paper defines the concept of data assets. Finally, keywords from the existing research literature are presented visually and a foundational framework for achieving data assetization is proposed.

Design/methodology/approach

This paper uses a systematic literature review approach to discuss the conceptual evolution and strategic imperatives of data assets. To establish a robust research methodology, this paper takes into account two main aspects. First, it conducts a comprehensive review of the existing literature on digital technology and data assets, which enables the derivation of an evolutionary path of data assets and the development of a clear and concise definition of the concept. Second, the paper uses Citespace, a widely used software for literature review, to examine the research framework of enterprise data assetization.

Findings

The paper offers pivotal insights into the realm of data assets. It highlights the changing perceptions of data assets with digital progression and addresses debates on data asset categorization, value attributes and ownership. The study introduces a definitive concept of data assets as electronically recorded data resources with real or potential value under legal parameters. Moreover, it delineates strategic imperatives for harnessing data assets, presenting a practical framework that charts the stages of “resource readiness, capacity building, and data application”, guiding businesses in optimizing their data throughout its lifecycle.

Originality/value

This paper comprehensively explores the issue of data assets, clarifying controversial concepts and categorizations and bridging gaps in the existing literature. The paper introduces a clear conceptualization of data assets, bridging the gap between academia and practice. In addition, the study proposes a strategic framework for data assetization. This study not only helps to promote a unified understanding among academics and professionals but also helps businesses to understand the process of data assetization.

Details

Asia Pacific Journal of Innovation and Entrepreneurship, vol. 18 no. 1
Type: Research Article
ISSN: 2071-1395

Keywords

1 – 10 of 10