Search results

1 – 10 of over 162000
Article
Publication date: 13 September 2019

Zirui Jia and Zengli Wang

Frequent itemset mining (FIM) is a basic topic in data mining. Most FIM methods build itemset database containing all possible itemsets, and use predefined thresholds to determine…

Abstract

Purpose

Frequent itemset mining (FIM) is a basic topic in data mining. Most FIM methods build itemset database containing all possible itemsets, and use predefined thresholds to determine whether an itemset is frequent. However, the algorithm has some deficiencies. It is more fit for discrete data rather than ordinal/continuous data, which may result in computational redundancy, and some of the results are difficult to be interpreted. The purpose of this paper is to shed light on this gap by proposing a new data mining method.

Design/methodology/approach

Regression pattern (RP) model will be introduced, in which the regression model and FIM method will be combined to solve the existing problems. Using a survey data of computer technology and software professional qualification examination, the multiple linear regression model is selected to mine associations between items.

Findings

Some interesting associations mined by the proposed algorithm and the results show that the proposed method can be applied in ordinal/continuous data mining area. The experiment of RP model shows that, compared to FIM, the computational redundancy decreased and the results contain more information.

Research limitations/implications

The proposed algorithm is designed for ordinal/continuous data and is expected to provide inspiration for data stream mining and unstructured data mining.

Practical implications

Compared to FIM, which mines associations between discrete items, RP model could mine associations between ordinal/continuous data sets. Importantly, RP model performs well in saving computational resource and mining meaningful associations.

Originality/value

The proposed algorithms provide a novelty view to define and mine association.

Details

Data Technologies and Applications, vol. 54 no. 3
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 5 May 2021

Shanshan Wang, Jiahui Xu, Youli Feng, Meiling Peng and Kaijie Ma

This study aims to overcome the problem of traditional association rules relying almost entirely on expert experience to set relevant interest indexes in mining. Second, this…

Abstract

Purpose

This study aims to overcome the problem of traditional association rules relying almost entirely on expert experience to set relevant interest indexes in mining. Second, this project can effectively solve the problem of four types of rules being present in the database at the same time. The traditional association algorithm can only mine one or two types of rules and cannot fully explore the database knowledge in the decision-making process for library recommendation.

Design/methodology/approach

The authors proposed a Markov logic network method to reconstruct association rule-mining tasks for library recommendation and compared the method proposed in this paper to traditional Apriori, FP-Growth, Inverse, Sporadic and UserBasedCF algorithms on two history library data sets and the Chess and Accident data sets.

Findings

The method used in this project had two major advantages. First, the authors were able to mine four types of rules in an integrated manner without having to set interest measures. In addition, because it represents the relevance of mining in the network, decision-makers can use network visualization tools to fully understand the results of mining in library recommendation and data sets from other fields.

Research limitations/implications

The time cost of the project is still high for large data sets. The authors will solve this problem by mapping books, items, or attributes to higher granularity to reduce the computational complexity in the future.

Originality/value

The authors believed that knowledge of complex real-world problems can be well captured from a network perspective. This study can help researchers to avoid setting interest metrics and to comprehensively extract frequent, rare, positive, and negative rules in an integrated manner.

Details

Information Discovery and Delivery, vol. 50 no. 1
Type: Research Article
ISSN: 2398-6247

Keywords

Article
Publication date: 6 February 2019

Ganjar Alfian, Muhammad Fazal Ijaz, Muhammad Syafrudin, M. Alex Syaekhoni, Norma Latif Fitriyani and Jongtae Rhee

The purpose of this paper is to propose customer behavior analysis based on real-time data processing and association rule for digital signage-based online store (DSOS). The…

3169

Abstract

Purpose

The purpose of this paper is to propose customer behavior analysis based on real-time data processing and association rule for digital signage-based online store (DSOS). The real-time data processing based on big data technology (such as NoSQL MongoDB and Apache Kafka) is utilized to handle the vast amount of customer behavior data.

Design/methodology/approach

In order to extract customer behavior patterns, customers’ browsing history and transactional data from digital signage (DS) could be used as the input for decision making. First, the authors developed a DSOS and installed it in different locations, so that customers could have the experience of browsing and buying a product. Second, the real-time data processing system gathered customers’ browsing history and transaction data as it occurred. In addition, the authors utilized the association rule to extract useful information from customer behavior, so it may be used by the managers to efficiently enhance the service quality.

Findings

First, as the number of customers and DS increases, the proposed system was capable of processing a gigantic amount of input data conveniently. Second, the data set showed that as the number of visit and shopping duration increases, the chance of products being purchased also increased. Third, by combining purchasing and browsing data from customers, the association rules from the frequent transaction pattern were achieved. Thus, the products will have a high possibility to be purchased if they are used as recommendations.

Research limitations/implications

This research empirically supports the theory of association rule that frequent patterns, correlations or causal relationship found in various kinds of databases. The scope of the present study is limited to DSOS, although the findings can be interpreted and generalized in a global business scenario.

Practical implications

The proposed system is expected to help management in taking decisions such as improving the layout of the DS and providing better product suggestions to the customer.

Social implications

The proposed system may be utilized to promote green products to the customer, having a positive impact on sustainability.

Originality/value

The key novelty of the present study lies in system development based on big data technology to handle the enormous amounts of data as well as analyzing the customer behavior in real time in the DSOS. The real-time data processing based on big data technology (such as NoSQL MongoDB and Apache Kafka) is used to handle the vast amount of customer behavior data. In addition, the present study proposed association rule to extract useful information from customer behavior. These results can be used for promotion as well as relevant product recommendations to DSOS customers. Besides in today’s changing retail environment, analyzing the customer behavior in real time in DSOS helps to attract and retain customers more efficiently and effectively, and retailers can get a competitive advantage over their competitors.

Details

Asia Pacific Journal of Marketing and Logistics, vol. 31 no. 1
Type: Research Article
ISSN: 1355-5855

Keywords

Article
Publication date: 19 May 2020

Praveen Kumar Gopagoni and Mohan Rao S K

Association rule mining generates the patterns and correlations from the database, which requires large scanning time, and the cost of computation associated with the generation…

Abstract

Purpose

Association rule mining generates the patterns and correlations from the database, which requires large scanning time, and the cost of computation associated with the generation of the rules is quite high. On the other hand, the candidate rules generated using the traditional association rules mining face a huge challenge in terms of time and space, and the process is lengthy. In order to tackle the issues of the existing methods and to render the privacy rules, the paper proposes the grid-based privacy association rule mining.

Design/methodology/approach

The primary intention of the research is to design and develop a distributed elephant herding optimization (EHO) for grid-based privacy association rule mining from the database. The proposed method of rule generation is processed as two steps: in the first step, the rules are generated using apriori algorithm, which is the effective association rule mining algorithm. In general, the extraction of the association rules from the input database is based on confidence and support that is replaced with new terms, such as probability-based confidence and holo-entropy. Thus, in the proposed model, the extraction of the association rules is based on probability-based confidence and holo-entropy. In the second step, the generated rules are given to the grid-based privacy rule mining, which produces privacy-dependent rules based on a novel optimization algorithm and grid-based fitness. The novel optimization algorithm is developed by integrating the distributed concept in EHO algorithm.

Findings

The experimentation of the method using the databases taken from the Frequent Itemset Mining Dataset Repository to prove the effectiveness of the distributed grid-based privacy association rule mining includes the retail, chess, T10I4D100K and T40I10D100K databases. The proposed method outperformed the existing methods through offering a higher degree of privacy and utility, and moreover, it is noted that the distributed nature of the association rule mining facilitates the parallel processing and generates the privacy rules without much computational burden. The rate of hiding capacity, the rate of information preservation and rate of the false rules generated for the proposed method are found to be 0.4468, 0.4488 and 0.0654, respectively, which is better compared with the existing rule mining methods.

Originality/value

Data mining is performed in a distributed manner through the grids that subdivide the input data, and the rules are framed using the apriori-based association mining, which is the modification of the standard apriori with the holo-entropy and probability-based confidence replacing the support and confidence in the standard apriori algorithm. The mined rules do not assure the privacy, and hence, the grid-based privacy rules are employed that utilize the adaptive elephant herding optimization (AEHO) for generating the privacy rules. The AEHO inherits the adaptive nature in the standard EHO, which renders the global optimal solution.

Details

Data Technologies and Applications, vol. 54 no. 3
Type: Research Article
ISSN: 2514-9288

Keywords

Open Access
Article
Publication date: 6 October 2023

Renata Konrad, Solomiya Sorokotyaha and Daniel Walker

Conflict and violence are the main drivers of globally escalating humanitarian needs. Local grassroots initiatives are pivotal in distributing humanitarian supplies in the acute…

Abstract

Purpose

Conflict and violence are the main drivers of globally escalating humanitarian needs. Local grassroots initiatives are pivotal in distributing humanitarian supplies in the acute response phase until more established humanitarian aid organizations can enter. Nevertheless, scant research exists regarding the role of grassroots associations in providing humanitarian assistance during a military conflict. The purpose of this paper is to understand the role of grassroots associations and identify important themes for effective operations.

Design/methodology/approach

This paper adopts a case-study approach of three Ukrainian grassroots associations that began operating in the immediate days of the full-scale invasion of Ukraine. The findings are based on analyzing primary sources, including interviews with Ukrainian volunteers, and are supported by secondary sources.

Findings

Grassroots associations have local contacts and a contextual understanding of population needs and can respond more rapidly and effectively than large intergovernmental agencies. Four critical themes regarding the operations of grassroots associations emerged: information management, inventory management, coordination and performance measurement. Grassroots humanitarian response operations during conflict are challenged by personal security risks, the unpredictability of unsolicited supplies, emerging volunteer roles, dynamic transportation routes and shifting demands.

Originality/value

Grassroots responses are central to humanitarian responses during the acute phase of a military conflict. By examining the operations of grassroots associations in the early months of the 2022 war in Ukraine, the authors provide a unique perspective on humanitarian logistics. Nonetheless, more inclusive models of humanitarian responses are needed to harness the capacities and resilience of grassroots operations in practice.

Details

Journal of Humanitarian Logistics and Supply Chain Management, vol. 14 no. 2
Type: Research Article
ISSN: 2042-6747

Keywords

Article
Publication date: 16 May 2019

Mohamed Marzouk and Mohamed Enaba

The purpose of this paper is to expand the benefits of building information modeling (BIM) to include data analytics to analyze construction project performance. BIM is a great…

1017

Abstract

Purpose

The purpose of this paper is to expand the benefits of building information modeling (BIM) to include data analytics to analyze construction project performance. BIM is a great tool which improves communication and information flow between construction project parties. This research aims to integrate different types of data within the BIM environment, then, to perform descriptive data analytics. Data analytics helps in identifying hidden patterns and detecting relationships between different attributes in the database.

Design/methodology/approach

This research is considered to be an inductive research that starts with an observation of integrating BIM and descriptive data analytics. Thus, the project’s correspondence, daily progress reports and inspection requests are integrated within the project 5D BIM model. Subsequently, data mining comprising association analysis, clustering and trend analysis is performed. The research hypothesis is that descriptive data analytics and BIM have a great leverage to analyze construction project performance. Finally, a case study for a construction project is carried out to test the research hypothesis.

Findings

The research finds that integrating BIM and descriptive data analytics helps in improving project communication performance, in terms of integrating project data in a structured format, efficiently retrieving useful information from project raw data and visualizing analytics results within the BIM environment.

Originality/value

The research develops a dynamic model that helps in detecting hidden patterns and different progress attributes from construction project raw data.

Details

Built Environment Project and Asset Management, vol. 9 no. 4
Type: Research Article
ISSN: 2044-124X

Keywords

Article
Publication date: 8 January 2018

Mahmood Reza Khabbazi, Jan Wikander, Mauro Onori and Antonio Maffei

This paper introduces a schema for the product assembly feature data in an object-oriented and module-based format using Unified Modeling Language (UML). To link production with…

Abstract

Purpose

This paper introduces a schema for the product assembly feature data in an object-oriented and module-based format using Unified Modeling Language (UML). To link production with product design, it is essential to determine at an early stage which entities of product design and development are involved and used at the automated assembly planning and operations. To this end, it is absolutely reasonable to assign meaningful attributes to the parts’ design entities (assembly features) in a systematic and structured way. As such, this approach empowers processes such as motion planning and sequence planning in assembly design.

Design/methodology/approach

The assembly feature data requirements are studied and definitions are analyzed and redefined. Using object-oriented techniques, the assembly feature data structure and relationships are modeled based on the identified requirements as five UML packages (Part, three-dimensional (3D) models, Mating, Joint and Handling). All geometric and non-geometric design data entities endorsed with assembly design perspective are extracted or assigned from 3D models and realized through the featured entity interface class. The featured entities are then associated (used) with the mating, handling and joints features. The AssemblyFeature interface is realized through mating, handling and joint packages related to the assembly and part classes. Each package contains all relevant classes which further classify the important attributes of the main class.

Findings

This paper sets out to provide an explanatory approach using object-oriented techniques to model the schema of assembly features association and artifacts at the product design level, all of which are essential in several subsequent and parallel steps of the assembly planning process, as well as assembly feature entity assignments in design improvement cycle.

Practical implications

The practical implication based on the identified advantages can be classified in three main features: module-based design, comprehensive classification, integration. These features help the automation and solution development processes based on the proposed models much easier and systematic.

Originality/value

The proposed schema’s comprehensiveness and reliability are verified through comparisons with other works and the advantages are discussed in detail.

Details

Assembly Automation, vol. 38 no. 1
Type: Research Article
ISSN: 0144-5154

Keywords

Article
Publication date: 6 December 2023

Qing Fan

The purpose of this article is to contribute to the digital development and utilization of China’s intangible cultural heritage resources, research on the theft of intangible…

Abstract

Purpose

The purpose of this article is to contribute to the digital development and utilization of China’s intangible cultural heritage resources, research on the theft of intangible cultural heritage resources and knowledge integration based on linked data is proposed to promote the standardized description of intangible cultural heritage knowledge and realize the digital dissemination and development of intangible cultural heritage.

Design/methodology/approach

In this study, firstly, the knowledge organization theory and semantic Web technology are used to describe the intangible cultural heritage digital resource objects in metadata specifications. Secondly, the ontology theory and technical methods are used to build a conceptual model of the intangible cultural resources field and determine the concept sets and hierarchical relationships in this field. Finally, the semantic Web technology is used to establish semantic associations between intangible cultural heritage resource knowledge.

Findings

The study findings indicate that the knowledge organization of intangible cultural heritage resources constructed in this study provides a solution for the digital development of intangible cultural heritage in China. It also provides semantic retrieval with better knowledge granularity and helps to visualize the knowledge content of intangible cultural heritage.

Originality/value

This study summarizes and provides significant theoretical and practical value for the digital development of intangible cultural heritage and the resource description and knowledge fusion of intangible cultural heritage can help to discover the semantic relationship of intangible cultural heritage in multiple dimensions and levels.

Details

The Electronic Library , vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0264-0473

Keywords

Article
Publication date: 1 January 2016

Cheng-Hsiung Weng

The paper aims to understand the book subscription characteristics of the students at each college and help the library administrators to conduct efficient library management…

Abstract

Purpose

The paper aims to understand the book subscription characteristics of the students at each college and help the library administrators to conduct efficient library management plans for books in the library. Unlike the traditional association rule mining (ARM) techniques which mine patterns from a single data set, this paper proposes a model, recency-frequency-college (RFC) model, to analyse book subscription characteristics of library users and then discovers interesting association rules from equivalence-class RFC segments.

Design/methodology/approach

A framework which integrates the RFC model and ARM technique is proposed to analyse book subscription characteristics of library users. First, the author applies the RFC model to determine library users’ RFC values. After that, the author clusters library users’ transactions into several RFC segments by their RFC values. Finally, the author discovers RFC association rules and analyses book subscription characteristics of RFC segments (library users).

Findings

The paper provides experimental results from the survey data. It shows that the precision of the frequent itemsets discovered by the proposed RFC model outperforms the traditional approach in predicting library user subscription itemsets in the following time periods. Besides, the proposed approach can discover interesting and valuable patterns from library book circulation transactions.

Research limitations/implications

Because RFC thresholds were assigned based on expert opinion in this paper, it is an acquisition bottleneck. Therefore, researchers are encouraged to automatically infer the RFC thresholds from the library book circulation transactions.

Practical implications

The paper includes implications for the library administrators in conducting library book management plans for different library users.

Originality/value

This paper proposes a model, the RFC model, to analyse book subscription characteristics of library users.

Details

The Electronic Library, vol. 34 no. 5
Type: Research Article
ISSN: 0264-0473

Keywords

Article
Publication date: 16 July 2021

Navid Nezafati, Shokouh Razaghi, Hossein Moradi, Sajjad Shokouhyar and Sepideh Jafari

This paper aims to identify the impact of demographical and organizational variables such as age, gender, experiences use of knowledge management system (KMS), education and job…

Abstract

Purpose

This paper aims to identify the impact of demographical and organizational variables such as age, gender, experiences use of knowledge management system (KMS), education and job level on knowledge sharing (KS) performance of knowledge workers in knowledge activities of a KMS. Specifically, it seeks to explore that is there any relationship between the KS behavior patterns of high KS performance knowledge workers with their performance. Furthermore, this study using its conceptual attitude model aims to show that whether knowledge workers’ behavior patterns in sharing information and knowledge throughout a KMS have any specific effect or not.

Design/methodology/approach

This paper proposed a framework to mine knowledge workers’ raw data using data mining techniques such as clustering and association rules mining. Also, this research uses a case-based approach to a knowledge-intensive company in Iran that works in the field of information technology with 730 numbers of workers.

Findings

Findings suggest that demographical and organizational variables such as age, education and experience use of KMS have positive effects on knowledge worker’s KS behavior in KMSs. In fact, people who have lower age, higher education degrees and more experience use of KMS, have more participation in KS in KMS. Also, results depict that the experienced use of KMS has the most impact on the intention of KS in this KMS. Findings emphasize on the importance of the influence of the behavioral, organizational environments and psychological factors such as reward system, top management support, openness and trust, on KS performance of knowledge workers in the KMS. In fact, according to data, the KMS reward system caused to increasing participation of the users in KS, also in each knowledge activity that top managers participate in, the scores were higher.

Practical implications

This research helps top managers in designing policies and strategies to improve the participation of knowledge workers in KS and helps human resource managers to improve their membership policies. Also, assist Information Technology (IT) managers to enhance KMSs’ design to leverage with organization strategies in the field of improving KS and encourage people to participate in KMS.

Originality/value

This research has two key values. First, this paper applies a data mining framework to mining and analyzing data and this paper uses actual data of a KMS in a specialist company in Iran, with about 27,740 real data points. Second, this paper investigates the impact of demographical and organizational attributes on KS behavior, which little is empirically known about the impact of demographical variables on KS intention.

Details

VINE Journal of Information and Knowledge Management Systems, vol. 53 no. 4
Type: Research Article
ISSN: 2059-5891

Keywords

1 – 10 of over 162000