Search results

1 – 10 of over 11000
Article
Publication date: 20 February 2009

Che‐Chern Lin, Hung‐Jen Yang and Lung‐Hsing Kuo

The purpose of this paper is to explore teachers' behaviours in completing an internet survey using decision trees. Furthermore, to reduce the complexity of the decision trees, a…

1404

Abstract

Purpose

The purpose of this paper is to explore teachers' behaviours in completing an internet survey using decision trees. Furthermore, to reduce the complexity of the decision trees, a statistical technique was used to decrease the number of input variables in the decision trees.

Design/methodology/approach

A dataset of 47,647 samples was used to build the decision trees. These samples were collected from an internet survey of teachers in Taiwan. The output of the decision trees was the answering time (the time taken to complete the internet questionnaire). Eight variables were selected as the inputs for the decision trees. Two techniques were employed to build the decision trees – the exhaustive chi‐squared automatic interaction detector (ECHAID) and classification and regression tree (CRT) analysis. To reduce the complexity of the decision models, factor analysis technique was used to decrease the data dimensions (number of input variables) and to obtain a simplified decision model. One‐way ANOVA was used to validate the effects of the dimension reduction.

Findings

From the results of the factor analysis, a simplified decision tree is recommended using four input variables – teaching years, school level, sex and area. The classification accuracy of the simplified model is statistically equivalent to that of the original one, which used eight input variables.

Originality/value

The complexity of decision trees theoretically depends on the number of input variables. This study used a statistical technique to decrease the number of input variables and thereby reduce the complexity of the decision trees. A statistical technique was employed to validate that the classification accuracy is not statistically different between the original decision model and the simplified one. The decision models proposed in this paper can be applied in estimating the answering time for completing a questionnaire during an internet survey.

Details

Online Information Review, vol. 33 no. 1
Type: Research Article
ISSN: 1468-4527

Keywords

Article
Publication date: 19 February 2020

Shashidhar Kaparthi and Daniel Bumblauskas

The after-sale service industry is estimated to contribute over 8 percent to the US GDP. For use in this considerably large service management industry, this article provides…

2647

Abstract

Purpose

The after-sale service industry is estimated to contribute over 8 percent to the US GDP. For use in this considerably large service management industry, this article provides verification in the application of decision tree-based machine learning algorithms for optimal maintenance decision-making. The motivation for this research arose from discussions held with a large agricultural equipment manufacturing company interested in increasing the uptime of their expensive machinery and in helping their dealer network.

Design/methodology/approach

We propose a general strategy for the design of predictive maintenance systems using machine learning techniques. Then, we present a case study where multiple machine learning algorithms are applied to a particular example situation for an illustration of the proposed strategy and evaluation of its performance.

Findings

We found progressive improvements using such machine learning techniques in terms of accuracy in predictions of failure, demonstrating that the proposed strategy is successful.

Research limitations/implications

This approach is scalable to a wide variety of applications to aid in failure prediction. These approaches are generalizable to many systems irrespective of the underlying physics. Even though we focus on decision tree-based machine learning techniques in this study, the general design strategy proposed can be used with all other supervised learning techniques like neural networks, boosting algorithms, support vector machines, and statistical methods.

Practical implications

This approach is applicable to many different types of systems that require maintenance and repair decision-making. A case is provided for a cloud data storage provider. The methods described in the case can be used in any number of systems and industrial applications, making this a very scalable case for industry practitioners. This scalability is possible as the machine learning techniques learn the correspondence between machine conditions and outcome state irrespective of the underlying physics governing the systems.

Social implications

Sustainable systems and operations require allocating and utilizing resources efficiently and effectively. This approach can help asset managers decide how to sustainably allocate resources by increasing uptime and utilization for expensive equipment.

Originality/value

This is a novel application and case study for decision tree-based machine learning that will aid researchers in developing tools and techniques in this area as well as those working in the artificial intelligence and service management space.

Details

International Journal of Quality & Reliability Management, vol. 37 no. 4
Type: Research Article
ISSN: 0265-671X

Keywords

Article
Publication date: 1 April 2004

Hian Chye Koh and Chan Kee Low

Going concern is a fundamental concept in accounting and auditing and the assessment of a firm's going concern status is not an easy task. Several going concern prediction models…

5763

Abstract

Going concern is a fundamental concept in accounting and auditing and the assessment of a firm's going concern status is not an easy task. Several going concern prediction models based on statistical methods to assist auditors have been suggested in the literature. This study explores and compares the usefulness of neural networks, decision trees and logistic regression in predicting a firm's going concern status. The sample data comprise financial ratios for 165 going concerns and 165 matched non‐going concerns. The classification results indicate the potential usefulness of data mining techniques in a going concern prediction context. Further, the decision tree going concern prediction model outperforms the logistic regression and neural network models. Data mining techniques such as neural networks and decision trees are powerful for analysing complex non‐linear and interaction relationships, and hence can supplement and complement traditional statistical methods in constructing going concern prediction models.

Details

Managerial Auditing Journal, vol. 19 no. 3
Type: Research Article
ISSN: 0268-6902

Keywords

Article
Publication date: 15 June 2021

Manogna R.L. and Aswini Kumar Mishra

Determining the relevant information using financial measures is of great interest for various stakeholders to analyze the performance of the firm. This paper aims at identifying…

Abstract

Purpose

Determining the relevant information using financial measures is of great interest for various stakeholders to analyze the performance of the firm. This paper aims at identifying these financial measures (ratios) which critically affect the firm performance. The authors specifically focus on discovering the most prominent ratios using a two-step process. First, the authors use an exploratory factor analysis to identify the underlying dimensions of these ratios, followed by predictive modeling techniques to identify the potential relationship between measures and performance.

Design/methodology/approach

The study uses data of 25 financial variables for a sample of 1923 Indian manufacturing firms which exist continuously between 2011 and 2018. For prediction models, four popular decision tree algorithms [Chi-squared automatic interaction detector (CHAID), classification and regression trees (C&RT), C5.0 and quick, unbiased, efficient statistical tree (QUEST)] were investigated, and the information fusion-based sensitivity analyses were performed to identify the relative importance of these input measures.

Findings

Results show that C5.0 and CHAID algorithms produced the best predictive results. The fusion sensitivity results find that net profit margin and total assets turnover rate are the most critical factors determining the firm performance in an Indian manufacturing context. These findings may enable managers in their decision-making process and also have vital implications for investors in assessing the performance of the firm.

Originality/value

To the best of the authors’ knowledge, the current paper is the first to address the application of decision tree algorithms to predict the performance of manufacturing firms in an emerging economy such as India, with the latest data. This practical perspective helps the organizations in managing the critical parameters for the firm’s growth.

Details

Measuring Business Excellence, vol. 26 no. 3
Type: Research Article
ISSN: 1368-3047

Keywords

Open Access
Article
Publication date: 19 August 2022

Bedour M. Alshammari, Fairouz Aldhmour, Zainab M. AlQenaei and Haidar Almohri

There is a gap in knowledge about the Gulf Cooperation Council (GCC) because most studies are undertaken in countries outside the Gulf region – such as China, India, the US and…

4638

Abstract

Purpose

There is a gap in knowledge about the Gulf Cooperation Council (GCC) because most studies are undertaken in countries outside the Gulf region – such as China, India, the US and Taiwan. The stock market contains rich, valuable and considerable data, and these data need careful analysis for good decisions to be made that can lead to increases in the efficiency of a business. Data mining techniques offer data processing tools and applications used to enhance decision-maker decisions. This study aims to predict the Kuwait stock market by applying big data mining.

Design/methodology/approach

The methodology used is quantitative techniques, which are mathematical and statistical models that describe a various array of the relationships of variables. Quantitative methods used to predict the direction of the stock market returns by using four techniques were implemented: logistic regression, decision trees, support vector machine and random forest.

Findings

The results are all variables statistically significant at the 5% level except gold price and oil price. Also, the variables that do not have an influence on the direction of the rate of return of Boursa Kuwait are money supply and gold price, unlike the Kuwait index, which has the highest coefficient. Furthermore, the height score of the variable that affects the direction of the rate of return is the firms, and the accuracy of the overall performance of the four models is nearly 50%.

Research limitations/implications

Some of the limitations identified for this study are as follows: (1) location limitation: Kuwait Stock Exchange; (2) time limitation: the amount of time available to accomplish the study, where the period was completed within the academic year 2019-2020 and the academic year 2020-2021. During 2020, the coronavirus pandemic (COVID-19), which was a major obstacle, occurred during data collection and analysis; (3) data limitation: The Kuwait Stock Exchange data were collected from May 2019 to March 2020, while the factors affecting the stock exchange data were collected in July 2020 due to the corona pandemic.

Originality/value

The study used new titles, variables and techniques such as using data mining to predict the Kuwait stock market. There are no adequate studies that predict the stock market by data mining in the GCC, especially in Kuwait. There is a gap in knowledge in the GCC as most studies are in foreign countries, such as China, India, the US and Taiwan.

Details

Arab Gulf Journal of Scientific Research, vol. 40 no. 2
Type: Research Article
ISSN: 1985-9899

Keywords

Article
Publication date: 21 December 2023

Majid Rahi, Ali Ebrahimnejad and Homayun Motameni

Taking into consideration the current human need for agricultural produce such as rice that requires water for growth, the optimal consumption of this valuable liquid is…

Abstract

Purpose

Taking into consideration the current human need for agricultural produce such as rice that requires water for growth, the optimal consumption of this valuable liquid is important. Unfortunately, the traditional use of water by humans for agricultural purposes contradicts the concept of optimal consumption. Therefore, designing and implementing a mechanized irrigation system is of the highest importance. This system includes hardware equipment such as liquid altimeter sensors, valves and pumps which have a failure phenomenon as an integral part, causing faults in the system. Naturally, these faults occur at probable time intervals, and the probability function with exponential distribution is used to simulate this interval. Thus, before the implementation of such high-cost systems, its evaluation is essential during the design phase.

Design/methodology/approach

The proposed approach included two main steps: offline and online. The offline phase included the simulation of the studied system (i.e. the irrigation system of paddy fields) and the acquisition of a data set for training machine learning algorithms such as decision trees to detect, locate (classification) and evaluate faults. In the online phase, C5.0 decision trees trained in the offline phase were used on a stream of data generated by the system.

Findings

The proposed approach is a comprehensive online component-oriented method, which is a combination of supervised machine learning methods to investigate system faults. Each of these methods is considered a component determined by the dimensions and complexity of the case study (to discover, classify and evaluate fault tolerance). These components are placed together in the form of a process framework so that the appropriate method for each component is obtained based on comparison with other machine learning methods. As a result, depending on the conditions under study, the most efficient method is selected in the components. Before the system implementation phase, its reliability is checked by evaluating the predicted faults (in the system design phase). Therefore, this approach avoids the construction of a high-risk system. Compared to existing methods, the proposed approach is more comprehensive and has greater flexibility.

Research limitations/implications

By expanding the dimensions of the problem, the model verification space grows exponentially using automata.

Originality/value

Unlike the existing methods that only examine one or two aspects of fault analysis such as fault detection, classification and fault-tolerance evaluation, this paper proposes a comprehensive process-oriented approach that investigates all three aspects of fault analysis concurrently.

Details

International Journal of Intelligent Computing and Cybernetics, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1756-378X

Keywords

Book part
Publication date: 30 September 2020

Hera Khan, Ayush Srivastav and Amit Kumar Mishra

A detailed description will be provided of all the classification algorithms that have been widely used in the domain of medical science. The foundation will be laid by giving a…

Abstract

A detailed description will be provided of all the classification algorithms that have been widely used in the domain of medical science. The foundation will be laid by giving a comprehensive overview pertaining to the background and history of the classification algorithms. This will be followed by an extensive discussion regarding various techniques of classification algorithm in machine learning (ML) hence concluding with their relevant applications in data analysis in medical science and health care. To begin with, the initials of this chapter will deal with the basic fundamentals required for a profound understanding of the classification techniques in ML which will comprise of the underlying differences between Unsupervised and Supervised Learning followed by the basic terminologies of classification and its history. Further, it will include the types of classification algorithms ranging from linear classifiers like Logistic Regression, Naïve Bayes to Nearest Neighbour, Support Vector Machine, Tree-based Classifiers, and Neural Networks, and their respective mathematics. Ensemble algorithms such as Majority Voting, Boosting, Bagging, Stacking will also be discussed at great length along with their relevant applications. Furthermore, this chapter will also incorporate comprehensive elucidation regarding the areas of application of such classification algorithms in the field of biomedicine and health care and their contribution to decision-making systems and predictive analysis. To conclude, this chapter will devote highly in the field of research and development as it will provide a thorough insight to the classification algorithms and their relevant applications used in the cases of the healthcare development sector.

Details

Big Data Analytics and Intelligence: A Perspective for Health Care
Type: Book
ISBN: 978-1-83909-099-8

Keywords

Article
Publication date: 13 December 2021

Hamidreza Abbasianjahromi and Mehdi Aghakarimi

Unsafe behavior accounts for a major part of high accident rates in construction projects. The awareness of unsafe circumstances can help modify unsafe behaviors. To improve…

616

Abstract

Purpose

Unsafe behavior accounts for a major part of high accident rates in construction projects. The awareness of unsafe circumstances can help modify unsafe behaviors. To improve awareness in project teams, the present study proposes a framework for predicting safety performance before the implementation of projects.

Design/methodology/approach

The machine learning approach was adopted in this work. The proposed framework consists of two major phases: (1) data collection and (2) model development. The first phase involved several steps, including the identification of safety performance criteria, using a questionnaire to collect data, and converting the data into useful information. The second phase, on the other hand, included the use of the decision tree algorithm coupled with the k-Nearest Neighbors algorithm as the predictive tool along with the proposing modification strategies.

Findings

A total of nine safety performance criteria were identified. The results showed that safety employees, training, rule adherence and management commitment were key criteria for safety performance prediction. It was also found that the decision tree algorithm is capable of predicting safety performance.

Originality/value

The main novelty of the present study is developing an integrated model to propose strategies for the safety enhancement of projects in the case of incorrect predictions.

Details

Engineering, Construction and Architectural Management, vol. 30 no. 3
Type: Research Article
ISSN: 0969-9988

Keywords

Article
Publication date: 17 July 2009

Kamel Rouibah and Sulaiman Al‐Rafee

The purpose of this paper is to investigate the perceptions of 19 requirement engineering (RE) techniques in Kuwait in term of three criteria “awareness,” “use,” and “perceived…

1863

Abstract

Purpose

The purpose of this paper is to investigate the perceptions of 19 requirement engineering (RE) techniques in Kuwait in term of three criteria “awareness,” “use,” and “perceived value generated over past system development projects.” Also, this paper aims to examine possible relationships between these RE techniques and two information system development success factors.

Design/methodology/approach

This paper develops a questionnaire and tests with a sample of respondents from 175 organizations in Kuwait.

Findings

Results show that: Arab culture influence perception of RE techniques; most companies have good knowledge of different techniques; several different techniques for identifying and analyzing customer requirements are used; the most highly valued RE techniques are decision trees, goal oriented, prototyping, data flow diagram (DFD), and interviews; six techniques (tree analysis, role playing, unified modeling language, Kawakita Jiro method, flow charts, and Ishikawa) are found to have the least perceived value; and only two techniques (prototyping and decision tree) are highly correlated with the statement “Obtaining the right requirements is a critical success factor for system development,” while other three techniques (quality function deployment, DFD and role playing) are correlated with “We experienced problems during past system developments projects because of wrong requirements collection.”

Research limitations/implications

The study sheds light on perceptions on RE techniques perception in Kuwait where less is known about the subject from Western researchers.

Practical implications

This paper suggests re‐examining university curriculums in order to prepare students for familiarity with techniques that have proven their effectiveness elsewhere and call for more collaboration between academia and practitioners in order to appropriate research outcomes. In addition, this paper is of benefit to foreign consulting companies willing to penetrate the Gulf Cooperative Council.

Originality/value

This is the first Arab study that sheds light on system development practices in the Arab world.

Details

Information Management & Computer Security, vol. 17 no. 3
Type: Research Article
ISSN: 0968-5227

Keywords

Article
Publication date: 1 October 1995

Susan Coles and Jennifer Rowley

Explores, using appropriate examples, the ways in which decisiontrees can be used by the manager to assist in the longitudinaldecision‐making process. Since the mathematical…

3814

Abstract

Explores, using appropriate examples, the ways in which decision trees can be used by the manager to assist in the longitudinal decision‐making process. Since the mathematical concepts associated with decision trees are complex, managers can be reluctant to attempt to use decision tree models. A recognition that such models can be simply developed in a spreadsheet environment, and can then be used for sensitivity analysis using different decision criteria, demonstrates that decision trees can offer valuable insights into the structure of a strategic decision problem.

Details

Management Decision, vol. 33 no. 8
Type: Research Article
ISSN: 0025-1747

Keywords

1 – 10 of over 11000