Search results

1 – 10 of 65
Open Access
Article
Publication date: 3 February 2020

Wen Li, Wei Wang and Wenjun Huo

Inspired by the basic idea of gradient boosting, this study aims to design a novel multivariate regression ensemble algorithm RegBoost by using multivariate linear regression as a…

4525

Abstract

Purpose

Inspired by the basic idea of gradient boosting, this study aims to design a novel multivariate regression ensemble algorithm RegBoost by using multivariate linear regression as a weak predictor.

Design/methodology/approach

To achieve nonlinearity after combining all linear regression predictors, the training data is divided into two branches according to the prediction results using the current weak predictor. The linear regression modeling is recursively executed in two branches. In the test phase, test data is distributed to a specific branch to continue with the next weak predictor. The final result is the sum of all weak predictors across the entire path.

Findings

Through comparison experiments, it is found that the algorithm RegBoost can achieve similar performance to the gradient boosted decision tree (GBDT). The algorithm is very effective compared to linear regression.

Originality/value

This paper attempts to design a novel regression algorithm RegBoost with reference to GBDT. To the best of the knowledge, for the first time, RegBoost uses linear regression as a weak predictor, and combine with gradient boosting to build an ensemble algorithm.

Details

International Journal of Crowd Science, vol. 4 no. 1
Type: Research Article
ISSN: 2398-7294

Keywords

Open Access
Article
Publication date: 5 June 2023

Elias Shohei Kamimura, Anderson Rogério Faia Pinto and Marcelo Seido Nagano

This paper aims to present a literature review of the most recent optimisation methods applied to Credit Scoring Models (CSMs).

2492

Abstract

Purpose

This paper aims to present a literature review of the most recent optimisation methods applied to Credit Scoring Models (CSMs).

Design/methodology/approach

The research methodology employed technical procedures based on bibliographic and exploratory analyses. A traditional investigation was carried out using the Scopus, ScienceDirect and Web of Science databases. The papers selection and classification took place in three steps considering only studies in English language and published in electronic journals (from 2008 to 2022). The investigation led up to the selection of 46 publications (10 presenting literature reviews and 36 proposing CSMs).

Findings

The findings showed that CSMs are usually formulated using Financial Analysis, Machine Learning, Statistical Techniques, Operational Research and Data Mining Algorithms. The main databases used by the researchers were banks and the University of California, Irvine. The analyses identified 48 methods used by CSMs, the main ones being: Logistic Regression (13%), Naive Bayes (10%) and Artificial Neural Networks (7%). The authors conclude that advances in credit score studies will require new hybrid approaches capable of integrating Big Data and Deep Learning algorithms into CSMs. These algorithms should have practical issues considered consider practical issues for improving the level of adaptation and performance demanded for the CSMs.

Practical implications

The results of this study might provide considerable practical implications for the application of CSMs. As it was aimed to demonstrate the application of optimisation methods, it is highly considerable that legal and ethical issues should be better adapted to CSMs. It is also suggested improvement of studies focused on micro and small companies for sales in instalment plans and commercial credit through the improvement or new CSMs.

Originality/value

The economic reality surrounding credit granting has made risk management a complex decision-making issue increasingly supported by CSMs. Therefore, this paper satisfies an important gap in the literature to present an analysis of recent advances in optimisation methods applied to CSMs. The main contribution of this paper consists of presenting the evolution of the state of the art and future trends in studies aimed at proposing better CSMs.

Details

Journal of Economics, Finance and Administrative Science, vol. 28 no. 56
Type: Research Article
ISSN: 2077-1886

Keywords

Open Access
Article
Publication date: 27 March 2023

Annye Braca and Pierpaolo Dondio

Prediction is a critical task in targeted online advertising, where predictions better than random guessing can translate to real economic return. This study aims to use machine…

2294

Abstract

Purpose

Prediction is a critical task in targeted online advertising, where predictions better than random guessing can translate to real economic return. This study aims to use machine learning (ML) methods to identify individuals who respond well to certain linguistic styles/persuasion techniques based on Aristotle’s means of persuasion, rhetorical devices, cognitive theories and Cialdini’s principles, given their psychometric profile.

Design/methodology/approach

A total of 1,022 individuals took part in the survey; participants were asked to fill out the ten item personality measure questionnaire to capture personality traits and the dysfunctional attitude scale (DAS) to measure dysfunctional beliefs and cognitive vulnerabilities. ML classification models using participant profiling information as input were developed to predict the extent to which an individual was influenced by statements that contained different linguistic styles/persuasion techniques. Several ML algorithms were used including support vector machine, LightGBM and Auto-Sklearn to predict the effect of each technique given each individual’s profile (personality, belief system and demographic data).

Findings

The findings highlight the importance of incorporating emotion-based variables as model input in predicting the influence of textual statements with embedded persuasion techniques. Across all investigated models, the influence effect could be predicted with an accuracy ranging 53%–70%, indicating the importance of testing multiple ML algorithms in the development of a persuasive communication (PC) system. The classification ability of models was highest when predicting the response to statements using rhetorical devices and flattery persuasion techniques. Contrastingly, techniques such as authority or social proof were less predictable. Adding DAS scale features improved model performance, suggesting they may be important in modelling persuasion.

Research limitations/implications

In this study, the survey was limited to English-speaking countries and largely Western society values. More work is needed to ascertain the efficacy of models for other populations, cultures and languages. Most PC efforts are targeted at groups such as users, clients, shoppers and voters with this study in the communication context of education – further research is required to explore the capability of predictive ML models in other contexts. Finally, long self-reported psychological questionnaires may not be suitable for real-world deployment and could be subject to bias, thus a simpler method needs to be devised to gather user profile data such as using a subset of the most predictive features.

Practical implications

The findings of this study indicate that leveraging richer profiling data in conjunction with ML approaches may assist in the development of enhanced persuasive systems. There are many applications such as online apps, digital advertising, recommendation systems, chatbots and e-commerce platforms which can benefit from integrating persuasion communication systems that tailor messaging to the individual – potentially translating into higher economic returns.

Originality/value

This study integrates sets of features that have heretofore not been used together in developing ML-based predictive models of PC. DAS scale data, which relate to dysfunctional beliefs and cognitive vulnerabilities, were assessed for their importance in identifying effective persuasion techniques. Additionally, the work compares a range of persuasion techniques that thus far have only been studied separately. This study also demonstrates the application of various ML methods in predicting the influence of linguistic styles/persuasion techniques within textual statements and show that a robust methodology comparing a range of ML algorithms is important in the discovery of a performant model.

Details

Journal of Systems and Information Technology, vol. 25 no. 2
Type: Research Article
ISSN: 1328-7265

Keywords

Open Access
Article
Publication date: 28 July 2020

Xisto L. Travassos, Sérgio L. Avila and Nathan Ida

Ground Penetrating Radar is a multidisciplinary Nondestructive Evaluation technique that requires knowledge of electromagnetic wave propagation, material properties and antenna…

5933

Abstract

Ground Penetrating Radar is a multidisciplinary Nondestructive Evaluation technique that requires knowledge of electromagnetic wave propagation, material properties and antenna theory. Under some circumstances this tool may require auxiliary algorithms to improve the interpretation of the collected data. Detection, location and definition of target’s geometrical and physical properties with a low false alarm rate are the objectives of these signal post-processing methods. Basic approaches are focused in the first two objectives while more robust and complex techniques deal with all objectives at once. This work reviews the use of Artificial Neural Networks and Machine Learning for data interpretation of Ground Penetrating Radar surveys. We show that these computational techniques have progressed GPR forward from locating and testing to imaging and diagnosis approaches.

Details

Applied Computing and Informatics, vol. 17 no. 2
Type: Research Article
ISSN: 2634-1964

Keywords

Open Access
Article
Publication date: 29 April 2021

Roberto Sala, Marco Bertoni, Fabiana Pirola and Giuditta Pezzotta

This paper aims to present a dual-perspective framework for maintenance service delivery that should be used by manufacturing companies to structure and manage their maintenance…

2107

Abstract

Purpose

This paper aims to present a dual-perspective framework for maintenance service delivery that should be used by manufacturing companies to structure and manage their maintenance service delivery process, using aggregated historical and real-time data to improve operational decision-making. The framework, built for continuous improvement, allows the exploitation of maintenance data to improve the knowledge of service processes and machines.

Design/methodology/approach

The Dual-perspective, data-based decision-making process for maintenance delivery (D3M) framework development and test followed a qualitative approach based on literature reviews and semi-structured interviews. The pool of companies interviewed was expanded from the development to the test stage to increase its applicability and present additional perspectives.

Findings

The interviews confirmed that manufacturing companies are interested in exploiting the data generated in the use phase to improve operational decision-making in maintenance service delivery. Feedback to improve the framework methods and tools was collected, as well as suggestions for the introduction of new ones according to the companies' necessities.

Originality/value

The paper presents a novel framework addressing the data-based decision-making process for maintenance service delivery. The D3M framework can be used by manufacturing companies to structure their maintenance service delivery process and improve their knowledge of machines and service processes.

Details

Journal of Manufacturing Technology Management, vol. 32 no. 9
Type: Research Article
ISSN: 1741-038X

Keywords

Open Access
Article
Publication date: 25 August 2021

Weiwei Zhu, Jinglin Wu, Ting Fu, Junhua Wang, Jie Zhang and Qiangqiang Shangguan

Efficient traffic incident management is needed to alleviate the negative impact of traffic incidents. Accurate and reliable estimation of traffic incident duration is of great…

1511

Abstract

Purpose

Efficient traffic incident management is needed to alleviate the negative impact of traffic incidents. Accurate and reliable estimation of traffic incident duration is of great importance for traffic incident management. Previous studies have proposed models for traffic incident duration prediction; however, most of these studies focus on the total duration and could not update prediction results in real-time. From a traveler’s perspective, the relevant factor is the residual duration of the impact of the traffic incident. Besides, few (if any) studies have used dynamic traffic flow parameters in the prediction models. This paper aims to propose a framework to fill these gaps.

Design/methodology/approach

This paper proposes a framework based on the multi-layer perception (MLP) and long short-term memory (LSTM) model. The proposed methodology integrates traffic incident-related factors and real-time traffic flow parameters to predict the residual traffic incident duration. To validate the effectiveness of the framework, traffic incident data and traffic flow data from Shanghai Zhonghuan Expressway are used for modeling training and testing.

Findings

Results show that the model with 30-min time window and taking both traffic volume and speed as inputs performed best. The area under the curve values exceed 0.85 and the prediction accuracies exceed 0.75. These indicators demonstrated that the model is appropriate for this study context. The model provides new insights into traffic incident duration prediction.

Research limitations/implications

The incident samples applied by this study might not be enough and the variables are not abundant. The number of injuries and casualties, more detailed description of the incident location and other variables are expected to be used to characterize the traffic incident comprehensively. The framework needs to be further validated through a sufficiently large number of variables and locations.

Practical implications

The framework can help reduce the impacts of incidents on the safety of efficiency of road traffic once implemented in intelligent transport system and traffic management systems in future practical applications.

Originality/value

This study uses two artificial neural network methods, MLP and LSTM, to establish a framework aiming at providing accurate and time-efficient information on traffic incident duration in the future for transportation operators and travelers. This study will contribute to the deployment of emergency management and urban traffic navigation planning.

Details

Journal of Intelligent and Connected Vehicles, vol. 4 no. 2
Type: Research Article
ISSN: 2399-9802

Keywords

Content available
Article
Publication date: 4 January 2022

D.M.K.N. Seneviratna and R.M. Kapila Tharanga Rathnayaka

The Coronavirus (COVID-19) is one of the major pandemic diseases caused by a newly discovered virus that has been directly affecting the human respiratory system. Because of the…

Abstract

Purpose

The Coronavirus (COVID-19) is one of the major pandemic diseases caused by a newly discovered virus that has been directly affecting the human respiratory system. Because of the gradually increasing magnitude of the COVID-19 pandemic across the world, it has been sparking emergencies and critical issues in the healthcare systems around the world. However, predicting the exact amount of daily reported new COVID cases is the most serious issue faced by governments around the world today. So, the purpose of this current study is to propose a novel hybrid grey exponential smoothing model (HGESM) to predicting transmission dynamics of the COVID-19 outbreak properly.

Design/methodology/approach

As a result of the complications relates to the traditional time series approaches, the proposed HGESM model is well defined to handle exponential data patterns in multidisciplinary systems. The proposed methodology consists of two parts as double exponential smoothing and grey exponential smoothing modeling approach respectively. The empirical analysis of this study was carried out on the basis of the 3rd outbreak of Covid-19 cases in Sri Lanka, from 1st March 2021 to 15th June 2021. Out of the total 90 daily observations, the first 85% of daily confirmed cases were used during the training, and the remaining 15% of the sample.

Findings

The new proposed HGESM is highly accurate (less than 10%) with the lowest root mean square error values in one head forecasting. Moreover, mean absolute deviation accuracy testing results confirmed that the new proposed model has given more significant results than other time-series predictions with the limited samples.

Originality/value

The findings suggested that the new proposed HGESM is more suitable and effective for forecasting time series with the exponential trend in a short-term manner.

Details

Grey Systems: Theory and Application, vol. 12 no. 4
Type: Research Article
ISSN: 2043-9377

Keywords

Content available
Article
Publication date: 4 March 2024

Jie Yan

The purpose of the study is to examine the use of alternative information in bank lending to small and medium enterprises (SMEs). Understanding alternative information and its use…

Abstract

Purpose

The purpose of the study is to examine the use of alternative information in bank lending to small and medium enterprises (SMEs). Understanding alternative information and its use in bank lending to SMEs is important because it has become a growing part of the future of SME finance. The results and findings of my study not only enrich the finance literature but, more importantly, also address the use of Fintech in the risk management of SME lending, a new and complex problem that is specific to both the information technology and finance field.

Design/methodology/approach

To answer the research question, the author used a case study approach that relies upon qualitative data and analysis. By iterating between the existing literature, theoretical pieces and empirical findings, the author explain and interpret in detail how the use of alternative information impacts loan outcomes and develop insights to guide future research.

Findings

The case is outlined in two time periods including the prepartnership period and the postpartnership period. It highlights the establishment of a partnership between LoanBank and FintechInc (pseudonym), aimed at SME-focused Fintech lending. The findings underscore how the partnership has enabled a mutually beneficial situation where LoanBank and FintechInc leverage each other’s strengths to provide efficient and effective lending services. The adoption of alternative information in the risk management Fintech (RMF) platform of FintechInc has transformed LoanBank’s lending processes, showcasing how technological innovations can enhance SME lending practices.

Originality/value

The study’s originality mainly lies in the three detailed insights regarding alternative information’s impact on SME lending: information, platform properties and financial inclusion. The information part demonstrates that RMF platforms expand the information used for lending decisions, shifting from traditional hard and soft data to incorporating various alternative information sources. The platform properties part suggests that location, openness and technology also play a pivotal role in shaping lending outcomes. Finally, the financial inclusion part proposes that the use of alternative information has the potential to improve financial inclusion and offer better credit terms to previously underserved borrowers.

Details

Qualitative Research in Financial Markets, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1755-4179

Keywords

Open Access
Article
Publication date: 28 August 2019

Dongdong Ge, Luhui Hu, Bo Jiang, Guangjun Su and Xiaole Wu

The purpose of this paper is to achieve intelligent superstore site selection. Yonghui Superstores partnered with Cardinal Operations to incorporate a tremendous amount of…

2202

Abstract

Purpose

The purpose of this paper is to achieve intelligent superstore site selection. Yonghui Superstores partnered with Cardinal Operations to incorporate a tremendous amount of site-related information (e.g. points of interest, population density and features, distribution of competitors, transportation, commercial ecosystem, existing own-store network) into its store site optimization.

Design/methodology/approach

This paper showcases the integration of regression, optimization and machine learning approaches in site selection, which has proven practical and effective.

Findings

The result was the development of the “Yonghui Intelligent Site Selection System” that includes three modules: business district scoring, intelligent site engine and precision sales forecasting. The application of this system helps to significantly reduce the labor force required to visit and investigate all potential sites, circumvent the pitfalls associated with possibly biased experience or intuition-based decision making and achieve the same population coverage as competitors while needing only half the number of stores as its competitors.

Originality/value

To our knowledge, this project is among the first to integrate regression, optimization and machine learning approaches in site selection. There is innovation in optimization techniques.

Details

Modern Supply Chain Research and Applications, vol. 1 no. 1
Type: Research Article
ISSN: 2631-3871

Keywords

Open Access
Article
Publication date: 24 June 2021

Bo Wang, Guanwei Wang, Youwei Wang, Zhengzheng Lou, Shizhe Hu and Yangdong Ye

Vehicle fault diagnosis is a key factor in ensuring the safe and efficient operation of the railway system. Due to the numerous vehicle categories and different fault mechanisms…

Abstract

Purpose

Vehicle fault diagnosis is a key factor in ensuring the safe and efficient operation of the railway system. Due to the numerous vehicle categories and different fault mechanisms, there is an unbalanced fault category problem. Most of the current methods to solve this problem have complex algorithm structures, low efficiency and require prior knowledge. This study aims to propose a new method which has a simple structure and does not require any prior knowledge to achieve a fast diagnosis of unbalanced vehicle faults.

Design/methodology/approach

This study proposes a novel K-means with feature learning based on the feature learning K-means-improved cluster-centers selection (FKM-ICS) method, which includes the ICS and the FKM. Specifically, this study defines cluster centers approximation to select the initialized cluster centers in the ICS. This study uses improved term frequency-inverse document frequency to measure and adjust the feature word weights in each cluster, retaining the top τ feature words with the highest weight in each cluster and perform the clustering process again in the FKM. With the FKM-ICS method, clustering performance for unbalanced vehicle fault diagnosis can be significantly enhanced.

Findings

This study finds that the FKM-ICS can achieve a fast diagnosis of vehicle faults on the vehicle fault text (VFT) data set from a railway station in the 2017 (VFT) data set. The experimental results on VFT indicate the proposed method in this paper, outperforms several state-of-the-art methods.

Originality/value

This is the first effort to address the vehicle fault diagnostic problem and the proposed method performs effectively and efficiently. The ICS enables the FKM-ICS method to exclude the effect of outliers, solves the disadvantages of the fault text data contained a certain amount of noisy data, which effectively enhanced the method stability. The FKM enhances the distribution of feature words that discriminate between different fault categories and reduces the number of feature words to make the FKM-ICS method faster and better cluster for unbalanced vehicle fault diagnostic.

Details

Smart and Resilient Transportation, vol. 3 no. 2
Type: Research Article
ISSN: 2632-0487

Keywords

1 – 10 of 65