Search results

1 – 10 of over 7000
Article
Publication date: 31 July 2019

Zhe Zhang and Yue Dai

For classification problems of customer relationship management (CRM), the purpose of this paper is to propose a method with interpretability of the classification results that…

Abstract

Purpose

For classification problems of customer relationship management (CRM), the purpose of this paper is to propose a method with interpretability of the classification results that combines multiple decision trees based on a genetic algorithm.

Design/methodology/approach

In the proposed method, multiple decision trees are combined in parallel. Subsequently, a genetic algorithm is used to optimize the weight matrix in the combination algorithm.

Findings

The method is applied to customer credit rating assessment and customer response behavior pattern recognition. The results demonstrate that compared to a single decision tree, the proposed combination method improves the predictive accuracy and optimizes the classification rules, while maintaining interpretability of the classification results.

Originality/value

The findings of this study contribute to research methodologies in CRM. It specifically focuses on a new method with interpretability by combining multiple decision trees based on genetic algorithms for customer classification.

Details

Asia Pacific Journal of Marketing and Logistics, vol. 32 no. 5
Type: Research Article
ISSN: 1355-5855

Keywords

Article
Publication date: 6 June 2008

Norbert Tóth and Béla Pataki

The purpose of this paper is to provide classification confidence value to every individual sample classified by decision trees and use this value to combine the classifiers.

Abstract

Purpose

The purpose of this paper is to provide classification confidence value to every individual sample classified by decision trees and use this value to combine the classifiers.

Design/methodology/approach

The proposed system is first theoretically explained, and then the use and effectiveness of the proposed system is demonstrated on sample datasets.

Findings

In this paper, a novel method is proposed to combine decision tree classifiers using calculated classification confidence values. This confidence in the classification is based on distance calculation to the relevant decision boundary (distance conditional), probability density estimation and (distance conditional) classification confidence estimation. It is shown that these values – provided by individual classification trees – can be integrated to derive a consensus decision.

Research limitations/implications

The proposed method is not limited to axis‐parallel trees, it is applicable not only to oblique trees, but also to any kind of classifier system that uses hyperplanes to cluster the input space.

Originality/value

A novel method is presented to extend decision tree like classifiers with confidence calculation and a voting system is proposed that uses this confidence information. The proposed system possesses several novelties (e.g. it not only gives class probabilities, but also classification confidences) and advantages over previous (traditional) approaches. The voting system does not require an auxiliary combiner or gating network, as in the mixture of experts structure and the method is not limited to decision trees with axis‐parallel splits; it is applicable to any kind of classifiers that use hyperplanes to cluster the input space.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 1 no. 2
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 5 May 2023

Nguyen Thi Dinh, Nguyen Thi Uyen Nhi, Thanh Manh Le and Thanh The Van

The problem of image retrieval and image description exists in various fields. In this paper, a model of content-based image retrieval and image content extraction based on the KD…

Abstract

Purpose

The problem of image retrieval and image description exists in various fields. In this paper, a model of content-based image retrieval and image content extraction based on the KD-Tree structure was proposed.

Design/methodology/approach

A Random Forest structure was built to classify the objects on each image on the basis of the balanced multibranch KD-Tree structure. From that purpose, a KD-Tree structure was generated by the Random Forest to retrieve a set of similar images for an input image. A KD-Tree structure is applied to determine a relationship word at leaves to extract the relationship between objects on an input image. An input image content is described based on class names and relationships between objects.

Findings

A model of image retrieval and image content extraction was proposed based on the proposed theoretical basis; simultaneously, the experiment was built on multi-object image datasets including Microsoft COCO and Flickr with an average image retrieval precision of 0.9028 and 0.9163, respectively. The experimental results were compared with those of other works on the same image dataset to demonstrate the effectiveness of the proposed method.

Originality/value

A balanced multibranch KD-Tree structure was built to apply to relationship classification on the basis of the original KD-Tree structure. Then, KD-Tree Random Forest was built to improve the classifier performance and retrieve a set of similar images for an input image. Concurrently, the image content was described in the process of combining class names and relationships between objects.

Details

Data Technologies and Applications, vol. 57 no. 4
Type: Research Article
ISSN: 2514-9288

Keywords

Open Access
Article
Publication date: 22 November 2022

Kedong Yin, Yun Cao, Shiwei Zhou and Xinman Lv

The purposes of this research are to study the theory and method of multi-attribute index system design and establish a set of systematic, standardized, scientific index systems…

Abstract

Purpose

The purposes of this research are to study the theory and method of multi-attribute index system design and establish a set of systematic, standardized, scientific index systems for the design optimization and inspection process. The research may form the basis for a rational, comprehensive evaluation and provide the most effective way of improving the quality of management decision-making. It is of practical significance to improve the rationality and reliability of the index system and provide standardized, scientific reference standards and theoretical guidance for the design and construction of the index system.

Design/methodology/approach

Using modern methods such as complex networks and machine learning, a system for the quality diagnosis of index data and the classification and stratification of index systems is designed. This guarantees the quality of the index data, realizes the scientific classification and stratification of the index system, reduces the subjectivity and randomness of the design of the index system, enhances its objectivity and rationality and lays a solid foundation for the optimal design of the index system.

Findings

Based on the ideas of statistics, system theory, machine learning and data mining, the focus in the present research is on “data quality diagnosis” and “index classification and stratification” and clarifying the classification standards and data quality characteristics of index data; a data-quality diagnosis system of “data review – data cleaning – data conversion – data inspection” is established. Using a decision tree, explanatory structural model, cluster analysis, K-means clustering and other methods, classification and hierarchical method system of indicators is designed to reduce the redundancy of indicator data and improve the quality of the data used. Finally, the scientific and standardized classification and hierarchical design of the index system can be realized.

Originality/value

The innovative contributions and research value of the paper are reflected in three aspects. First, a method system for index data quality diagnosis is designed, and multi-source data fusion technology is adopted to ensure the quality of multi-source, heterogeneous and mixed-frequency data of the index system. The second is to design a systematic quality-inspection process for missing data based on the systematic thinking of the whole and the individual. Aiming at the accuracy, reliability, and feasibility of the patched data, a quality-inspection method of patched data based on inversion thought and a unified representation method of data fusion based on a tensor model are proposed. The third is to use the modern method of unsupervised learning to classify and stratify the index system, which reduces the subjectivity and randomness of the design of the index system and enhances its objectivity and rationality.

Details

Marine Economics and Management, vol. 5 no. 2
Type: Research Article
ISSN: 2516-158X

Keywords

Article
Publication date: 5 June 2017

Hao Wu

This paper aims to inspect the defects of solder joints of printed circuit board in real-time production line, simple computing and high accuracy are primary consideration factors…

Abstract

Purpose

This paper aims to inspect the defects of solder joints of printed circuit board in real-time production line, simple computing and high accuracy are primary consideration factors for feature extraction and classification algorithm.

Design/methodology/approach

In this study, the author presents an ensemble method for the classification of solder joint defects. The new method is based on extracting the color and geometry features after solder image acquisition and using decision trees to guarantee the algorithm’s running executive efficiency. To improve algorithm accuracy, the author proposes an ensemble method of random forest which combined several trees for the classification of solder joints.

Findings

The proposed method has been tested using 280 samples of solder joints, including good and various defect types, for experiments. The results show that the proposed method has a high accuracy.

Originality/value

The author extracted the color and geometry features and used decision trees to guarantee the algorithm's running executive efficiency. To improve the algorithm accuracy, the author proposes using an ensemble method of random forest which combined several trees for the classification of solder joints. The results show that the proposed method has a high accuracy.

Details

Soldering & Surface Mount Technology, vol. 29 no. 3
Type: Research Article
ISSN: 0954-0911

Keywords

Article
Publication date: 21 December 2023

Majid Rahi, Ali Ebrahimnejad and Homayun Motameni

Taking into consideration the current human need for agricultural produce such as rice that requires water for growth, the optimal consumption of this valuable liquid is…

Abstract

Purpose

Taking into consideration the current human need for agricultural produce such as rice that requires water for growth, the optimal consumption of this valuable liquid is important. Unfortunately, the traditional use of water by humans for agricultural purposes contradicts the concept of optimal consumption. Therefore, designing and implementing a mechanized irrigation system is of the highest importance. This system includes hardware equipment such as liquid altimeter sensors, valves and pumps which have a failure phenomenon as an integral part, causing faults in the system. Naturally, these faults occur at probable time intervals, and the probability function with exponential distribution is used to simulate this interval. Thus, before the implementation of such high-cost systems, its evaluation is essential during the design phase.

Design/methodology/approach

The proposed approach included two main steps: offline and online. The offline phase included the simulation of the studied system (i.e. the irrigation system of paddy fields) and the acquisition of a data set for training machine learning algorithms such as decision trees to detect, locate (classification) and evaluate faults. In the online phase, C5.0 decision trees trained in the offline phase were used on a stream of data generated by the system.

Findings

The proposed approach is a comprehensive online component-oriented method, which is a combination of supervised machine learning methods to investigate system faults. Each of these methods is considered a component determined by the dimensions and complexity of the case study (to discover, classify and evaluate fault tolerance). These components are placed together in the form of a process framework so that the appropriate method for each component is obtained based on comparison with other machine learning methods. As a result, depending on the conditions under study, the most efficient method is selected in the components. Before the system implementation phase, its reliability is checked by evaluating the predicted faults (in the system design phase). Therefore, this approach avoids the construction of a high-risk system. Compared to existing methods, the proposed approach is more comprehensive and has greater flexibility.

Research limitations/implications

By expanding the dimensions of the problem, the model verification space grows exponentially using automata.

Originality/value

Unlike the existing methods that only examine one or two aspects of fault analysis such as fault detection, classification and fault-tolerance evaluation, this paper proposes a comprehensive process-oriented approach that investigates all three aspects of fault analysis concurrently.

Details

International Journal of Intelligent Computing and Cybernetics, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 1 October 2005

Simona Juvan, Tomaz Bartol and Bojana Boh

The article seeks to address a methodological procedure based on keyword analysis and the structuring of data into information systems in the field of functional foods, a…

1191

Abstract

Purpose

The article seeks to address a methodological procedure based on keyword analysis and the structuring of data into information systems in the field of functional foods, a newly‐emerging scientific field within the broader scope of food sciences and technology.

Design/methodology/approach

An experiment was undertaken by selection of a research field or research subject, selection of search profile, selection and processing of relevant databases, keyword analysis, and the arrangement of data (keywords) according to tree‐structures. Keyword analysis was employed to identify narrower research fields within the broader scientific field. The structuring of data into systems was used to classify the terms within the particular narrow field. Keywords with higher and lower frequency were identified. A classification tree was set up, based on keywords (thesaurus‐based descriptors) extracted from the FSTA (Food Science and Technology Abstracts) database available online. The tree was supplemented and upgraded with some additional topical terms that have not as yet been included in the existing thesaurus. To serve as a comparison a completely new classification tree was designed, based on online full‐text data.

Findings

Comparison of the two trees suggests that the previous existing tree is sufficiently accurate in representing the field of functional foods, provided that it is upgraded with some additional terms. A more accurate classification should improve thesauri and consequently enhance retrieval in international databases.

Originality/value

Presents a methodology of database analysis which may serve to improve database patterns, especially with regard to information retrieval.

Details

Online Information Review, vol. 29 no. 5
Type: Research Article
ISSN: 1468-4527

Keywords

Abstract

Details

Machine Learning and Artificial Intelligence in Marketing and Sales
Type: Book
ISBN: 978-1-80043-881-1

Article
Publication date: 1 December 2020

Lanmin Wang, Hongmin Wang, Huiyan Zhang, Naiseman Akemujiang and Aimin Xiao

Body type classification has a great influence on plate making and garment sizing system, and the accuracy of body type classification method will greatly affect the fit of…

Abstract

Purpose

Body type classification has a great influence on plate making and garment sizing system, and the accuracy of body type classification method will greatly affect the fit of garment production. The purpose of this paper is to use the decision tree algorithm to study body classification rules, develop a decision tree body recognition model and judge the body shape of middle-aged women in Xinjiang.

Design/methodology/approach

First, perform dimensionless processing on the collected data of 256 middle-aged women in Xinjiang, and the dimensionless data were used for K-means body clustering; Then, quantitatively analyze the effectiveness of different classification clusters based on the silhouette coefficients. Second, the decision tree algorithm is used to divide the classified sample data into a training set and a test set at a ratio of 70/30, and select the best node and the best branch based on the Gini coefficient to construct a classification tree. Last, the overall optimal decision tree is generated by means of hyperparameter pruning.

Findings

The body shape of middle-aged women in Xinjiang can be divided into three types: standard body, plump body and obese body. The decision tree model has an excellent effect on body classification of middle-aged women in Xinjiang (precision (macro), 95.46%; precision (micro), 95.95%; recall (macro), 95.46%; recall (micro), 95.95%; F1 (macro), 95.46%; F1 (micro), 95.95%).

Originality/value

For scientific research, this paper is conducive to increasing the regional body type theory and stimulating the establishment of a garment sizing subdivision system in Xinjiang. In terms of production practice, this paper not only establishes a model for judging the shape of middle-aged women in Xinjiang, but also provides reference data for intermediates of various sizes. In addition, to facilitate pattern-making and the establishment of a subdivision system for the size of middle-aged women's garments in Xinjiang, this paper provides the grading values of various body control parts of middle-aged women in Xinjiang.

Details

International Journal of Clothing Science and Technology, vol. 33 no. 3
Type: Research Article
ISSN: 0955-6222

Keywords

Article
Publication date: 4 January 2013

Hanh Tran and David G. Carmichael

A common distraction to contractors is that of cash management, and particularly incoming payments. In the extreme, a failure to manage a project's cash flows may bring about…

1281

Abstract

Purpose

A common distraction to contractors is that of cash management, and particularly incoming payments. In the extreme, a failure to manage a project's cash flows may bring about business failure. A contractor's financial viability rests on how actual payments from an owner are received. The purpose of this paper is to present a method for contractors to evaluate the punctuality and fullness of owner payments based on historical behaviour.

Design/methodology/approach

Owners are classified according to their late and incomplete payment practices. The payment profile of an owner, in the form of aging payments received based on claims, is used as a basis for the method's development. Regression trees are constructed based on three predictor variables, namely, the average time to payment following a claim, the total amount ending up being paid within a certain period and the level of variability in claim response times.

Findings

The method will be of interest to contractors concerned with managing their cash positions, as well as those persons looking at contractor‐owner relationships.

Practical implications

The method is intended to be used internally within a contractor's organisation to assist in decision making. The method can also be used by subcontractors, suppliers, and consultants. Owners may use the method reflectively to improve their own practices, to save time and cost by reducing disputes, and to develop better owner‐contractor relationships.

Originality/value

This paper represents an original approach, and an original contribution to contractor pre‐tendering risk analysis practices, and an extension to contractor claim‐payment analysis.

Details

Engineering, Construction and Architectural Management, vol. 20 no. 1
Type: Research Article
ISSN: 0969-9988

Keywords

1 – 10 of over 7000