Search results

1 – 10 of over 23000
Article
Publication date: 26 September 2023

Mohammed Ayoub Ledhem and Warda Moussaoui

This paper aims to apply several data mining techniques for predicting the daily precision improvement of Jakarta Islamic Index (JKII) prices based on big data of symmetric…

Abstract

Purpose

This paper aims to apply several data mining techniques for predicting the daily precision improvement of Jakarta Islamic Index (JKII) prices based on big data of symmetric volatility in Indonesia’s Islamic stock market.

Design/methodology/approach

This research uses big data mining techniques to predict daily precision improvement of JKII prices by applying the AdaBoost, K-nearest neighbor, random forest and artificial neural networks. This research uses big data with symmetric volatility as inputs in the predicting model, whereas the closing prices of JKII were used as the target outputs of daily precision improvement. For choosing the optimal prediction performance according to the criteria of the lowest prediction errors, this research uses four metrics of mean absolute error, mean squared error, root mean squared error and R-squared.

Findings

The experimental results determine that the optimal technique for predicting the daily precision improvement of the JKII prices in Indonesia’s Islamic stock market is the AdaBoost technique, which generates the optimal predicting performance with the lowest prediction errors, and provides the optimum knowledge from the big data of symmetric volatility in Indonesia’s Islamic stock market. In addition, the random forest technique is also considered another robust technique in predicting the daily precision improvement of the JKII prices as it delivers closer values to the optimal performance of the AdaBoost technique.

Practical implications

This research is filling the literature gap of the absence of using big data mining techniques in the prediction process of Islamic stock markets by delivering new operational techniques for predicting the daily stock precision improvement. Also, it helps investors to manage the optimal portfolios and to decrease the risk of trading in global Islamic stock markets based on using big data mining of symmetric volatility.

Originality/value

This research is a pioneer in using big data mining of symmetric volatility in the prediction of an Islamic stock market index.

Details

Journal of Modelling in Management, vol. 19 no. 3
Type: Research Article
ISSN: 1746-5664

Keywords

Article
Publication date: 7 August 2017

Wei-Chao Lin, Shih-Wen Ke and Chih-Fong Tsai

Data mining is widely considered necessary in many business applications for effective decision-making. The importance of business data mining is reflected by the existence of…

1900

Abstract

Purpose

Data mining is widely considered necessary in many business applications for effective decision-making. The importance of business data mining is reflected by the existence of numerous surveys in the literature focusing on the investigation of related works using data mining techniques for solving specific business problems. The purpose of this paper is to answer the following question: What are the widely used data mining techniques in business applications?

Design/methodology/approach

The aim of this paper is to examine related surveys in the literature and thus to identify the frequently applied data mining techniques. To ensure the recent relevance and quality of the conclusions, the criterion for selecting related studies are that the works be published in reputed journals within the past 10 years.

Findings

There are 33 different data mining techniques employed in eight different application areas. Most of them are supervised learning techniques and the application area where such techniques are most often seen is bankruptcy prediction, followed by the areas of customer relationship management, fraud detection, intrusion detection and recommender systems. Furthermore, the widely used ten data mining techniques for business applications are the decision tree (including C4.5 decision tree and classification and regression tree), genetic algorithm, k-nearest neighbor, multilayer perceptron neural network, naïve Bayes and support vector machine as the supervised learning techniques and association rule, expectation maximization and k-means as the unsupervised learning techniques.

Originality/value

The originality of this paper is to survey the recent 10 years of related survey and review articles about data mining in business applications to identify the most popular techniques.

Details

Kybernetes, vol. 46 no. 7
Type: Research Article
ISSN: 0368-492X

Keywords

Article
Publication date: 1 July 2021

Mohammed Ayoub Ledhem

The purpose of this paper is to apply various data mining techniques for predicting the financial performance of Islamic banking in Indonesia through the main exogenous…

Abstract

Purpose

The purpose of this paper is to apply various data mining techniques for predicting the financial performance of Islamic banking in Indonesia through the main exogenous determinants of profitability by choosing the best data mining technique based on the criteria of the highest accuracy score of testing and training.

Design/methodology/approach

This paper used data mining techniques to predict the financial performance of Islamic banking by applying all of LASSO regression, random forest (RF), artificial neural networks and k-nearest neighbor (KNN) over monthly data sets of all the full-fledged Islamic banks working in Indonesia from January 2011 until March 2020. This study used return on assets as a real measurement of financial performance, whereas the capital adequacy ratio, asset quality and liquidity management were used as exogenous determinants of financial performance.

Findings

The experimental results showed that the optimal task for predicting the financial performance of Islamic banking in Indonesia is the KNN technique, which affords the best-predicting accuracy, and gives the optimal knowledge from the financial performance of Islamic banking determinants in Indonesia. As well, the RF provides closer values to the optimal accuracy of the KNN, which makes it another robust technique in predicting the financial performance of Islamic banking.

Research limitations/implications

This paper restricted modeling the financial performance of Islamic banking to profitability through the main determinants of return of assets in Indonesia. Future research could consider enlarging the modeling of financial performance using other models such as CAMELS and Z-Score to predict the financial performance of Islamic banking under data mining techniques.

Practical implications

Owing to the lack of using data mining techniques in the Islamic banking sector, this paper would fill the literature gap by providing new effective techniques for predicting financial performance in the Islamic banking sector using data mining approaches, which can be efficient tools in business and management modeling for financial researchers and decision-makers in the Islamic banking sector.

Originality/value

According to the author’s knowledge, this paper is the first that provides data mining techniques for predicting the financial performance of the Islamic banking sector in Indonesia.

Details

Journal of Modelling in Management, vol. 17 no. 3
Type: Research Article
ISSN: 1746-5664

Keywords

Article
Publication date: 21 December 2020

Sudha Cheerkoot-Jalim and Kavi Kumar Khedo

This work shows the results of a systematic literature review on biomedical text mining. The purpose of this study is to identify the different text mining approaches used in…

Abstract

Purpose

This work shows the results of a systematic literature review on biomedical text mining. The purpose of this study is to identify the different text mining approaches used in different application areas of the biomedical domain, the common tools used and the challenges of biomedical text mining as compared to generic text mining algorithms. This study will be of value to biomedical researchers by allowing them to correlate text mining approaches to specific biomedical application areas. Implications for future research are also discussed.

Design/methodology/approach

The review was conducted following the principles of the Kitchenham method. A number of research questions were first formulated, followed by the definition of the search strategy. The papers were then selected based on a list of assessment criteria. Each of the papers were analyzed and information relevant to the research questions were extracted.

Findings

It was found that researchers have mostly harnessed data sources such as electronic health records, biomedical literature, social media and health-related forums. The most common text mining technique was natural language processing using tools such as MetaMap and Unstructured Information Management Architecture, alongside the use of medical terminologies such as Unified Medical Language System. The main application area was the detection of adverse drug events. Challenges identified included the need to deal with huge amounts of text, the heterogeneity of the different data sources, the duality of meaning of words in biomedical text and the amount of noise introduced mainly from social media and health-related forums.

Originality/value

To the best of the authors’ knowledge, other reviews in this area have focused on either specific techniques, specific application areas or specific data sources. The results of this review will help researchers to correlate most relevant and recent advances in text mining approaches to specific biomedical application areas by providing an up-to-date and holistic view of work done in this research area. The use of emerging text mining techniques has great potential to spur the development of innovative applications, thus considerably impacting on the advancement of biomedical research.

Details

Journal of Knowledge Management, vol. 25 no. 3
Type: Research Article
ISSN: 1367-3270

Keywords

Article
Publication date: 20 October 2021

Sumeer Gul, Shohar Bano and Taseen Shah

Data mining along with its varied technologies like numerical mining, textual mining, multimedia mining, web mining, sentiment analysis and big data mining proves itself as an…

1002

Abstract

Purpose

Data mining along with its varied technologies like numerical mining, textual mining, multimedia mining, web mining, sentiment analysis and big data mining proves itself as an emerging field and manifests itself in the form of different techniques such as information mining; big data mining; big data mining and Internet of Things (IoT); and educational data mining. This paper aims to discuss how these technologies and techniques are used to derive information and, eventually, knowledge from data.

Design/methodology/approach

An extensive review of literature on data mining and its allied techniques was carried to ascertain the emerging procedures and techniques in the domain of data mining. Clarivate Analytic’s Web of Science and Sciverse Scopus were explored to discover the extent of literature published on Data Mining and its varied facets. Literature was searched against various keywords such as data mining; information mining; big data; big data and IoT; and educational data mining. Further, the works citing the literature on data mining were also explored to visualize a broad gamut of emerging techniques about this growing field.

Findings

The study validates that knowledge discovery in databases has rendered data mining as an emerging field; the data present in these databases paves the way for data mining techniques and analytics. This paper provides a unique view about the usage of data, and logical patterns derived from it, how new procedures, algorithms and mining techniques are being continuously upgraded for their multipurpose use for the betterment of human life and experiences.

Practical implications

The paper highlights different aspects of data mining, its different technological approaches, and how these emerging data technologies are used to derive logical insights from data and make data more meaningful.

Originality/value

The paper tries to highlight the current trends and facets of data mining.

Details

Digital Library Perspectives, vol. 37 no. 4
Type: Research Article
ISSN: 2059-5816

Keywords

Article
Publication date: 2 November 2015

Ana Rocío Cárdenas Maita, Lucas Corrêa Martins, Carlos Ramón López Paz, Sarajane Marques Peres and Marcelo Fantinato

Process mining is a research area used to discover, monitor and improve real business processes by extracting knowledge from event logs available in process-aware information…

4050

Abstract

Purpose

Process mining is a research area used to discover, monitor and improve real business processes by extracting knowledge from event logs available in process-aware information systems. The purpose of this paper is to evaluate the application of artificial neural networks (ANNs) and support vector machines (SVMs) in data mining tasks in the process mining context. The goal was to understand how these computational intelligence techniques are currently being applied in process mining.

Design/methodology/approach

The authors conducted a systematic literature review with three research questions formulated to evaluate the use of ANNs and SVMs in process mining.

Findings

The authors identified 11 papers as primary studies according to the criteria established in the review protocol. Most of them deal with process mining enhancement, mainly using ANNs. Regarding the data mining task, the authors identified three types of tasks used: categorical prediction (or classification); numeric prediction, considering the “regression” type, and clustering analysis.

Originality/value

Although there is scientific interest in process mining, little attention has been specifically given to ANNs and SVM. This scenario does not reflect the general context of data mining, where these two techniques are widely used. This low use may be possibly due to a relative lack of knowledge about their potential for this type of problem, which the authors seek to reverse with the completion of this study.

Details

Business Process Management Journal, vol. 21 no. 6
Type: Research Article
ISSN: 1463-7154

Keywords

Article
Publication date: 1 February 2001

Sang Jun Lee and Keng Siau

Terabytes of data are generated everyday in many organizations. To extract hidden predictive information from large volumes of data, data mining (DM) techniques are needed…

15181

Abstract

Terabytes of data are generated everyday in many organizations. To extract hidden predictive information from large volumes of data, data mining (DM) techniques are needed. Organizations are starting to realize the importance of data mining in their strategic planning and successful application of DM techniques can be an enormous payoff for the organizations. This paper discusses the requirements and challenges of DM, and describes major DM techniques such as statistics, artificial intelligence, decision tree approach, genetic algorithm, and visualization.

Details

Industrial Management & Data Systems, vol. 101 no. 1
Type: Research Article
ISSN: 0263-5577

Keywords

Article
Publication date: 1 April 2004

Hian Chye Koh and Chan Kee Low

Going concern is a fundamental concept in accounting and auditing and the assessment of a firm's going concern status is not an easy task. Several going concern prediction models…

5763

Abstract

Going concern is a fundamental concept in accounting and auditing and the assessment of a firm's going concern status is not an easy task. Several going concern prediction models based on statistical methods to assist auditors have been suggested in the literature. This study explores and compares the usefulness of neural networks, decision trees and logistic regression in predicting a firm's going concern status. The sample data comprise financial ratios for 165 going concerns and 165 matched non‐going concerns. The classification results indicate the potential usefulness of data mining techniques in a going concern prediction context. Further, the decision tree going concern prediction model outperforms the logistic regression and neural network models. Data mining techniques such as neural networks and decision trees are powerful for analysing complex non‐linear and interaction relationships, and hence can supplement and complement traditional statistical methods in constructing going concern prediction models.

Details

Managerial Auditing Journal, vol. 19 no. 3
Type: Research Article
ISSN: 0268-6902

Keywords

Open Access
Article
Publication date: 3 July 2017

Rahila Umer, Teo Susnjak, Anuradha Mathrani and Suriadi Suriadi

The purpose of this paper is to propose a process mining approach to help in making early predictions to improve students’ learning experience in massive open online courses…

6230

Abstract

Purpose

The purpose of this paper is to propose a process mining approach to help in making early predictions to improve students’ learning experience in massive open online courses (MOOCs). It investigates the impact of various machine learning techniques in combination with process mining features to measure effectiveness of these techniques.

Design/methodology/approach

Student’s data (e.g. assessment grades, demographic information) and weekly interaction data based on event logs (e.g. video lecture interaction, solution submission time, time spent weekly) have guided this design. This study evaluates four machine learning classification techniques used in the literature (logistic regression (LR), Naïve Bayes (NB), random forest (RF) and K-nearest neighbor) to monitor weekly progression of students’ performance and to predict their overall performance outcome. Two data sets – one, with traditional features and second, with features obtained from process conformance testing – have been used.

Findings

The results show that techniques used in the study are able to make predictions on the performance of students. Overall accuracy (F1-score, area under curve) of machine learning techniques can be improved by integrating process mining features with standard features. Specifically, the use of LR and NB classifiers outperforms other techniques in a statistical significant way.

Practical implications

Although MOOCs provide a platform for learning in highly scalable and flexible manner, they are prone to early dropout and low completion rate. This study outlines a data-driven approach to improve students’ learning experience and decrease the dropout rate.

Social implications

Early predictions based on individual’s participation can help educators provide support to students who are struggling in the course.

Originality/value

This study outlines the innovative use of process mining techniques in education data mining to help educators gather data-driven insight on student performances in the enrolled courses.

Details

Journal of Research in Innovative Teaching & Learning, vol. 10 no. 2
Type: Research Article
ISSN: 2397-7604

Keywords

Article
Publication date: 6 February 2009

Jayanthi Ranjan

The amount of data getting generated in any sector at present is enormous. The information flow in the pharma industry is huge. Pharma firms are progressing into increased…

2735

Abstract

Purpose

The amount of data getting generated in any sector at present is enormous. The information flow in the pharma industry is huge. Pharma firms are progressing into increased technology‐enabled products and services. Data mining, which is knowledge discovery from large sets of data, helps pharma firms to discover patterns in improving the quality of drug discovery and delivery methods. The paper aims to present how data mining is useful in the pharma industry, how its techniques can yield good results in pharma sector, and to show how data mining can really enhance in making decisions using pharmaceutical data.

Design/methodology/approach

This conceptual paper is written based on secondary study, research and observations from magazines, reports and notes. The author has listed the types of patterns that can be discovered using data mining in pharma data.

Findings

The paper shows how data mining is useful in the pharma industry and how its techniques can yield good results in pharma sector.

Research limitations/implications

Although much work can be produced for discovering knowledge in pharma data using data mining, the paper is limited to conceptualizing the ideas and view points at this stage; future work may include applying data mining techniques to pharma data based on primary research using the available, famous significant data mining tools.

Originality/value

Research papers and conceptual papers related to data mining in Pharma industry are rare; this is the motivation for the paper.

Details

International Journal of Health Care Quality Assurance, vol. 22 no. 1
Type: Research Article
ISSN: 0952-6862

Keywords

1 – 10 of over 23000