Search results

1 – 10 of over 2000
Open Access
Article
Publication date: 2 February 2018

Wil van der Aalst

Process mining provides a generic collection of techniques to turn event data into valuable insights, improvement ideas, predictions, and recommendations. This paper uses…

9071

Abstract

Purpose

Process mining provides a generic collection of techniques to turn event data into valuable insights, improvement ideas, predictions, and recommendations. This paper uses spreadsheets as a metaphor to introduce process mining as an essential tool for data scientists and business analysts. The purpose of this paper is to illustrate that process mining can do with events what spreadsheets can do with numbers.

Design/methodology/approach

The paper discusses the main concepts in both spreadsheets and process mining. Using a concrete data set as a running example, the different types of process mining are explained. Where spreadsheets work with numbers, process mining starts from event data with the aim to analyze processes.

Findings

Differences and commonalities between spreadsheets and process mining are described. Unlike process mining tools like ProM, spreadsheets programs cannot be used to discover processes, check compliance, analyze bottlenecks, animate event data, and provide operational process support. Pointers to existing process mining tools and their functionality are given.

Practical implications

Event logs and operational processes can be found everywhere and process mining techniques are not limited to specific application domains. Comparable to spreadsheet software widely used in finance, production, sales, education, and sports, process mining software can be used in a broad range of organizations.

Originality/value

The paper provides an original view on process mining by relating it to the spreadsheets. The value of spreadsheet-like technology tailored toward the analysis of behavior rather than numbers is illustrated by the over 20 commercial process mining tools available today and the growing adoption in a variety of application domains.

Details

Business Process Management Journal, vol. 24 no. 1
Type: Research Article
ISSN: 1463-7154

Keywords

Open Access
Article
Publication date: 21 May 2021

Yue Huang, Hu Liu and Jing Pan

Identifying the frontiers of a specific research field is one of the most basic tasks in bibliometrics and research published in leading conferences is crucial to the data mining

1204

Abstract

Purpose

Identifying the frontiers of a specific research field is one of the most basic tasks in bibliometrics and research published in leading conferences is crucial to the data mining research community, whereas few research studies have focused on it. The purpose of this study is to detect the intellectual structure of data mining based on conference papers.

Design/methodology/approach

This study takes the authoritative conference papers of the ranking 9 in the data mining field provided by Google Scholar Metrics as a sample. According to paper amount, this paper first detects the annual situation of the published documents and the distribution of the published conferences. Furthermore, from the research perspective of keywords, CiteSpace was used to dig into the conference papers to identify the frontiers of data mining, which focus on keywords term frequency, keywords betweenness centrality, keywords clustering and burst keywords.

Findings

Research showed that the research heat of data mining had experienced a linear upward trend during 2007 and 2016. The frontier identification based on the conference papers showed that there were five research hotspots in data mining, including clustering, classification, recommendation, social network analysis and community detection. The research contents embodied in the conference papers were also very rich.

Originality/value

This study detected the research frontier from leading data mining conference papers. Based on the keyword co-occurrence network, from four dimensions of keyword term frequency, betweeness centrality, clustering analysis and burst analysis, this paper identified and analyzed the research frontiers of data mining discipline from 2007 to 2016.

Details

International Journal of Crowd Science, vol. 5 no. 2
Type: Research Article
ISSN: 2398-7294

Keywords

Open Access
Article
Publication date: 23 December 2022

Patrick Ajibade and Ndakasharwa Muchaonyerwa

This study aims to promote the need for advanced skills acquisition within the LIS and academic libraries. This study focuses on the importance of library management systems and…

2054

Abstract

Purpose

This study aims to promote the need for advanced skills acquisition within the LIS and academic libraries. This study focuses on the importance of library management systems and the need for the graduates to be equipped with analytics skills. Combined with basic data, text mining and analytics, knowledge classification and information audit skills would benefit libraries and improve resource allocation. Agile institutional libraries in this big data era success hinge on the ability to perform depth analytics of both data and text to generate useful insight for information literacy training and information governance.

Design/methodology/approach

This paper adopted a living-lab methodology to use existing technology to conduct system analysis and LMS audit of an academic library of one of the highly ranked universities in the world. One of the benefits of this approach is the ability to apply technological innovation and tools to carry out research that is relevant to the context of LIS or other research fields such as management, education, humanities and social sciences. The techniques allow us to gain access to publicly available information because of system audits that were performed. The level of responsiveness of the online library was accessed, and basic information audits were conducted.

Findings

This study indicated skill gaps in the LIS training and the academic libraries in response to the fourth industrial technologies. This study argued that the role of skill acquisition and how it can foster data-driven library management operations. Hence, data mining, text mining and analytics are needed to probe into such massive, big data housed in the various libraries’ repositories. This study, however, indicated that without retraining of librarians or including this analytics programming in the LIS curriculum, the libraries would not be able to reap the benefits these techniques provided.

Research limitations/implications

This paper covered research within the general and academic libraries and the broader LIS fields. The same principle and concept is very important for both public and private libraries with substantial usage and patrons.

Practical implications

This paper indicated that librarianship training must fill the gaps within the LIS training. This can be done by including data mining, data analytics, text mining and processing in the curriculum. This skill will enable the news graduates to have skills to assist the library managers in making informed decisions based on user-generated content (UGC), LMS system audits and information audits. Thus, this paper provided practical insights and suggested solutions for academic libraries to improve the agility of information services.

Social implications

The academic librarian can improve institutional and LMS management through insights that are generated from the user. This study indicated that libraries' UGC could serve as robust insights into library management.

Originality/value

This paper argued that the librarian expertise transcends information literacy and knowledge classification and debated the interwoven of LMS and data analytics, text mining and analysis as a solution to improve efficient resources and training.

Details

Library Hi Tech News, vol. 40 no. 4
Type: Research Article
ISSN: 0741-9058

Keywords

Open Access
Article
Publication date: 3 July 2017

Rahila Umer, Teo Susnjak, Anuradha Mathrani and Suriadi Suriadi

The purpose of this paper is to propose a process mining approach to help in making early predictions to improve students’ learning experience in massive open online courses…

6483

Abstract

Purpose

The purpose of this paper is to propose a process mining approach to help in making early predictions to improve students’ learning experience in massive open online courses (MOOCs). It investigates the impact of various machine learning techniques in combination with process mining features to measure effectiveness of these techniques.

Design/methodology/approach

Student’s data (e.g. assessment grades, demographic information) and weekly interaction data based on event logs (e.g. video lecture interaction, solution submission time, time spent weekly) have guided this design. This study evaluates four machine learning classification techniques used in the literature (logistic regression (LR), Naïve Bayes (NB), random forest (RF) and K-nearest neighbor) to monitor weekly progression of students’ performance and to predict their overall performance outcome. Two data sets – one, with traditional features and second, with features obtained from process conformance testing – have been used.

Findings

The results show that techniques used in the study are able to make predictions on the performance of students. Overall accuracy (F1-score, area under curve) of machine learning techniques can be improved by integrating process mining features with standard features. Specifically, the use of LR and NB classifiers outperforms other techniques in a statistical significant way.

Practical implications

Although MOOCs provide a platform for learning in highly scalable and flexible manner, they are prone to early dropout and low completion rate. This study outlines a data-driven approach to improve students’ learning experience and decrease the dropout rate.

Social implications

Early predictions based on individual’s participation can help educators provide support to students who are struggling in the course.

Originality/value

This study outlines the innovative use of process mining techniques in education data mining to help educators gather data-driven insight on student performances in the enrolled courses.

Details

Journal of Research in Innovative Teaching & Learning, vol. 10 no. 2
Type: Research Article
ISSN: 2397-7604

Keywords

Open Access
Article
Publication date: 11 June 2024

Julian Rott, Markus Böhm and Helmut Krcmar

Process mining (PM) has emerged as a leading technology for gaining data-based insights into organizations’ business processes. As processes increasingly cross-organizational…

Abstract

Purpose

Process mining (PM) has emerged as a leading technology for gaining data-based insights into organizations’ business processes. As processes increasingly cross-organizational boundaries, firms need to conduct PM jointly with multiple organizations to optimize their operations. However, current knowledge on cross-organizational process mining (coPM) is widely dispersed. Therefore, we synthesize current knowledge on coPM, identify challenges and enablers of coPM, and build a socio-technical framework and agenda for future research.

Design/methodology/approach

We conducted a literature review of 66 articles and summarized the findings according to the framework for Information Technology (IT)-enabled inter-organizational coordination (IOC) and the refined PM framework. The former states that within inter-organizational relationships, uncertainty sources determine information processing needs and coordination mechanisms determine information processing capabilities, while the fit between needs and capabilities determines the relationships’ performance. The latter distinguishes three categories of PM activities: cartography, auditing and navigation.

Findings

Past literature focused on coPM techniques, for example, algorithms for ensuring privacy and PM for cartography. Future research should focus on socio-technical aspects and follow four steps: First, determine uncertainty sources within coPM. Second, design, develop and evaluate coordination mechanisms. Third, investigate how the mechanisms assist with handling uncertainty. Fourth, analyze the impact on coPM performance. In addition, we present 18 challenges (e.g. integrating distributed data) and 9 enablers (e.g. aligning different strategies) for coPM application.

Originality/value

This is the first article to systematically investigate the status quo of coPM research and lay out a socio-technical research agenda building upon the well-established framework for IT-enabled IOC.

Details

Business Process Management Journal, vol. 30 no. 8
Type: Research Article
ISSN: 1463-7154

Keywords

Open Access
Article
Publication date: 9 December 2019

Zhiwen Pan, Jiangtian Li, Yiqiang Chen, Jesus Pacheco, Lianjun Dai and Jun Zhang

The General Society Survey(GSS) is a kind of government-funded survey which aims at examining the Socio-economic status, quality of life, and structure of contemporary society…

Abstract

Purpose

The General Society Survey(GSS) is a kind of government-funded survey which aims at examining the Socio-economic status, quality of life, and structure of contemporary society. GSS data set is regarded as one of the authoritative source for the government and organization practitioners to make data-driven policies. The previous analytic approaches for GSS data set are designed by combining expert knowledges and simple statistics. By utilizing the emerging data mining algorithms, we proposed a comprehensive data management and data mining approach for GSS data sets.

Design/methodology/approach

The approach are designed to be operated in a two-phase manner: a data management phase which can improve the quality of GSS data by performing attribute pre-processing and filter-based attribute selection; a data mining phase which can extract hidden knowledge from the data set by performing data mining analysis including prediction analysis, classification analysis, association analysis and clustering analysis.

Findings

According to experimental evaluation results, the paper have the following findings: Performing attribute selection on GSS data set can increase the performance of both classification analysis and clustering analysis; all the data mining analysis can effectively extract hidden knowledge from the GSS data set; the knowledge generated by different data mining analysis can somehow cross-validate each other.

Originality/value

By leveraging the power of data mining techniques, the proposed approach can explore knowledge in a fine-grained manner with minimum human interference. Experiments on Chinese General Social Survey data set are conducted at the end to evaluate the performance of our approach.

Details

International Journal of Crowd Science, vol. 3 no. 3
Type: Research Article
ISSN: 2398-7294

Keywords

Open Access
Article
Publication date: 28 July 2023

Jeremias De Klerk and Bernard Swart

Background: Amid increasing leadership failures in the global business context, the mining industry is one of the industries with many adverse incidents, affecting employee…

Abstract

Background: Amid increasing leadership failures in the global business context, the mining industry is one of the industries with many adverse incidents, affecting employee safety, the environment, and surrounding communities. Emerging economies tend to have unique socio-economic challenges and greater relative economic dependence on mining, presenting unique challenges to leaders. The purpose of this research was to study the realities of responsible leadership in the mining industry in an emerging economy.

Methods: A qualitative research study, consisting of semi-structured interviews was conducted. Nine senior mine managers were selected to represent perspectives from different operations and mining houses. Data was gathered from August to October 2020 in South Africa, an emerging economy with significant mining operations. A thematic analysis of interview transcripts was conducted through the use of software, rendering five themes, with 12 sub-themes.

Results: The research found that requirements on mining leaders in emerging economies demand consistent balancing of a complex set of competing risks, whilst attending to paradoxical requirements among operations, and internal and external stakeholders. Leaders face several competing requirements from stakeholders, the environment, mining practices, and time frames. Responsible leaders must navigate a paradoxical maze of needs and time horizons, with several conflicting forces and dilemmas, and dichotomous relationships. Responsible leadership in the mining industry of an emerging economy is a proverbial minefield of paradoxes and dilemmas between responsible intentions and practical realities. These paradoxes and dilemmas are specifically acute in the context of emerging economies due to the dire socio-economic situations. A total of 10 competencies emerged as essential responsible leadership requirements in this context.

Conclusions: The study provides an in-depth understanding of the intricacies of responsible leadership in the mining industry of an emerging economy. This understanding will contribute to capacitating leaders in the mining industries of emerging economies to act responsibly.

Details

Emerald Open Research, vol. 1 no. 11
Type: Research Article
ISSN: 2631-3952

Keywords

Open Access
Article
Publication date: 25 October 2023

Christian Novak, Lukas Pfahlsberger, Saimir Bala, Kate Revoredo and Jan Mendling

Digitalization, innovation and changing customer requirements drive the continuous improvement of an organization's business processes. IT demand management (ITDM) as a…

1269

Abstract

Purpose

Digitalization, innovation and changing customer requirements drive the continuous improvement of an organization's business processes. IT demand management (ITDM) as a methodology supports the holistic governance of IT and the corresponding business process change (BPC), by allocating resources to meet a company's requirements and strategic objectives. As ITDM decision-makers are not fully aware of how the as-is business processes operate and interact, making informed decisions that positively impact the to-be process is a key challenge.

Design/methodology/approach

In this paper, the authors address this challenge by developing a novel approach that integrates process mining and ITDM. To this end, the authors conduct an action research study where the researchers participated in the design, creation and evaluation of the approach. The proposed approach is illustrated using two sample demands of an insurance claims process. These demands are used to construct the artefact in multiple research circles and to validate the approach in practice. The authors applied learning and reflection methods for incrementally adjusting this study’s approach.

Findings

The study shows that the utilization of process mining activities during process changes on an operational level contributes to (1) increasing accuracy and efficiency of ITDM; (2) timely identification of potential risks and dependencies and (3) support of testing and acceptance of IT demands.

Originality/value

The implementation of this study’s approach improved ITDM practice. It appropriately addressed the information needs of decision-makers and unveiled the effects and consequences of process changes. Furthermore, providing a clearer picture of the process dependencies clarified the responsibilities and the interfaces at the intra- and inter-process level.

Details

Business Process Management Journal, vol. 29 no. 8
Type: Research Article
ISSN: 1463-7154

Keywords

Open Access
Article
Publication date: 3 April 2023

Kateryna Kubrak, Fredrik Milani and Alexander Nolte

When improving business processes, process analysts can use data-driven methods, such as process mining, to identify improvement opportunities. However, despite being supported by…

2578

Abstract

Purpose

When improving business processes, process analysts can use data-driven methods, such as process mining, to identify improvement opportunities. However, despite being supported by data, process analysts decide which changes to implement. Analysts often use process visualisations to assess and determine which changes to pursue. This paper helps explore how process mining visualisations can aid process analysts in their work to identify, prioritise and communicate business process improvement opportunities.

Design/methodology/approach

The study follows the design science methodology to create and evaluate an artefact for visualising identified improvement opportunities (IRVIN).

Findings

A set of principles to facilitate the visualisation of process mining outputs for analysts to work with improvement opportunities was suggested. Particularly, insights into identifying, prioritising and communicating process improvement opportunities from visual representation are outlined.

Originality/value

Prior work focuses on visualisation from the perspectives – among others – of process exploration, process comparison and performance analysis. This study, however, considers process mining visualisation that aids in analysing process improvement opportunities.

Details

Business Process Management Journal, vol. 29 no. 8
Type: Research Article
ISSN: 1463-7154

Keywords

Open Access
Article
Publication date: 19 August 2022

Bedour M. Alshammari, Fairouz Aldhmour, Zainab M. AlQenaei and Haidar Almohri

There is a gap in knowledge about the Gulf Cooperation Council (GCC) because most studies are undertaken in countries outside the Gulf region – such as China, India, the US and…

5163

Abstract

Purpose

There is a gap in knowledge about the Gulf Cooperation Council (GCC) because most studies are undertaken in countries outside the Gulf region – such as China, India, the US and Taiwan. The stock market contains rich, valuable and considerable data, and these data need careful analysis for good decisions to be made that can lead to increases in the efficiency of a business. Data mining techniques offer data processing tools and applications used to enhance decision-maker decisions. This study aims to predict the Kuwait stock market by applying big data mining.

Design/methodology/approach

The methodology used is quantitative techniques, which are mathematical and statistical models that describe a various array of the relationships of variables. Quantitative methods used to predict the direction of the stock market returns by using four techniques were implemented: logistic regression, decision trees, support vector machine and random forest.

Findings

The results are all variables statistically significant at the 5% level except gold price and oil price. Also, the variables that do not have an influence on the direction of the rate of return of Boursa Kuwait are money supply and gold price, unlike the Kuwait index, which has the highest coefficient. Furthermore, the height score of the variable that affects the direction of the rate of return is the firms, and the accuracy of the overall performance of the four models is nearly 50%.

Research limitations/implications

Some of the limitations identified for this study are as follows: (1) location limitation: Kuwait Stock Exchange; (2) time limitation: the amount of time available to accomplish the study, where the period was completed within the academic year 2019-2020 and the academic year 2020-2021. During 2020, the coronavirus pandemic (COVID-19), which was a major obstacle, occurred during data collection and analysis; (3) data limitation: The Kuwait Stock Exchange data were collected from May 2019 to March 2020, while the factors affecting the stock exchange data were collected in July 2020 due to the corona pandemic.

Originality/value

The study used new titles, variables and techniques such as using data mining to predict the Kuwait stock market. There are no adequate studies that predict the stock market by data mining in the GCC, especially in Kuwait. There is a gap in knowledge in the GCC as most studies are in foreign countries, such as China, India, the US and Taiwan.

Details

Arab Gulf Journal of Scientific Research, vol. 40 no. 2
Type: Research Article
ISSN: 1985-9899

Keywords

1 – 10 of over 2000