Search results

1 – 10 of 423
Article
Publication date: 28 April 2021

Virok Sharma, Mohd Zaki, Kumar Neeraj Jha and N. M. Anoop Krishnan

This paper aims to use a data-driven approach towards optimizing construction operations. To this extent, it presents a machine learning (ML)-aided optimization approach, wherein…

Abstract

Purpose

This paper aims to use a data-driven approach towards optimizing construction operations. To this extent, it presents a machine learning (ML)-aided optimization approach, wherein the construction cost is predicted as a function of time, resources and environmental impact, which is further used as a surrogate model for cost optimization.

Design/methodology/approach

Taking a dataset from literature, the paper has applied various ML algorithms, namely, simple and regularized linear regression, random forest, gradient boosted trees, neural network and Gaussian process regression (GPR) to predict the construction cost as a function of time, resources and environmental impact. Further, the trained models were used to optimize the construction cost applying single-objective (with and without constraints) and multi-objective optimizations, employing Bayesian optimization, particle swarm optimization (PSO) and non-dominated sorted genetic algorithm.

Findings

The results presented in the paper demonstrate that the ensemble methods, such as gradient boosted trees, exhibit the best performance for construction cost prediction. Further, it shows that multi-objective optimization can be used to develop a Pareto front for two competing variables, such as cost and environmental impact, which directly allows a practitioner to make a rational decision.

Research limitations/implications

Note that the sequential nature of events which dictates the scheduling is not considered in the present work. This aspect could be incorporated in the future to develop a robust scheme that can optimize the scheduling dynamically.

Originality/value

The paper demonstrates that a ML approach coupled with optimization could enable the development of an efficient and economic strategy to plan the construction operations.

Details

Engineering, Construction and Architectural Management, vol. 29 no. 3
Type: Research Article
ISSN: 0969-9988

Keywords

Article
Publication date: 25 July 2019

Xia Li, Ruibin Bai, Peer-Olaf Siebers and Christian Wagner

Many transport and logistics companies nowadays use raw vehicle GPS data for travel time prediction. However, they face difficult challenges in terms of the costs of information…

Abstract

Purpose

Many transport and logistics companies nowadays use raw vehicle GPS data for travel time prediction. However, they face difficult challenges in terms of the costs of information storage, as well as the quality of the prediction. This paper aims to systematically investigate various meta-data (features) that require significantly less storage space but provide sufficient information for high-quality travel time predictions.

Design/methodology/approach

The paper systematically studied the combinatorial effects of features and different model fitting strategies with two popular decision tree ensemble methods for travel time prediction, namely, random forests and gradient boosting regression trees. First, the investigation was conducted using pseudo travel time data that were generated using a pseudo travel time sampling algorithm, which allows generating travel time data using different noise processes so that the prediction performance under different travel conditions and noise characteristics can be studied systematically. The results and findings were then further compared and evaluated through a real-life case.

Findings

The paper provides empirical insights and guidelines about how raw GPS data can be reduced into a small-sized feature vector for the purposes of vehicle travel time prediction. It suggests that, add travel time observations from the previous departure time intervals are beneficial to the prediction, particularly when there is no other types of real-time information (e.g. traffic flow, speed) are available. It was also found that modular model fitting does not improve the quality of the prediction in all experimental settings used in this paper.

Research limitations/implications

The findings are primarily based on empirical studies on limited real-life data instances, and the results may lack generalisabilities. Therefore, the researchers are encouraged to test them further in more real-life data instances.

Practical implications

The paper includes implications and guidelines for the development of efficient GPS data storage and high-quality travel time prediction under different types of travel conditions.

Originality/value

This paper systematically studies the combinatorial feature effects for tree-ensemble-based travel time prediction approaches.

Details

VINE Journal of Information and Knowledge Management Systems, vol. 49 no. 3
Type: Research Article
ISSN: 2059-5891

Keywords

Open Access
Article
Publication date: 13 November 2018

Bo Liu, Libin Shen, Huanling You, Yan Dong, Jianqiang Li and Yong Li

The influence of road surface temperature (RST) on vehicles is becoming more and more obvious. Accurate predication of RST is distinctly meaningful. At present, however, the…

1014

Abstract

Purpose

The influence of road surface temperature (RST) on vehicles is becoming more and more obvious. Accurate predication of RST is distinctly meaningful. At present, however, the prediction accuracy of RST is not satisfied with physical methods or statistical learning methods. To find an effective prediction method, this paper selects five representative algorithms to predict the road surface temperature separately.

Design/methodology/approach

Multiple linear regressions, least absolute shrinkage and selection operator, random forest and gradient boosting regression tree (GBRT) and neural network are chosen to be representative predictors.

Findings

The experimental results show that for temperature data set of this experiment, the prediction effect of GBRT in the ensemble algorithm is the best compared with the other four algorithms.

Originality/value

This paper compares different kinds of machine learning algorithms, observes the road surface temperature data from different angles, and finds the most suitable prediction method.

Details

International Journal of Crowd Science, vol. 2 no. 3
Type: Research Article
ISSN: 2398-7294

Keywords

Book part
Publication date: 1 October 2014

Marcelo Brutti Righi, Yi Yang and Paulo Sergio Ceretta

In this chapter, we estimate the Expected Shortfall (ES) in conditional autoregressive expectile models by using a nonparametric multiple expectile regression via gradient tree

Abstract

In this chapter, we estimate the Expected Shortfall (ES) in conditional autoregressive expectile models by using a nonparametric multiple expectile regression via gradient tree boosting. This approach has the advantages generated by the flexibility of not having to rely on data assumptions and avoids the drawbacks and fragilities of a restrictive estimator such as Historical Simulation. We consider distinct specifications for the information sets that produce the ES estimates. The results obtained with simulated and real market data indicate that the proposed approach has good performance, with some distinctions between the specifications.

Details

Risk Management Post Financial Crisis: A Period of Monetary Easing
Type: Book
ISBN: 978-1-78441-027-8

Keywords

Article
Publication date: 28 June 2021

Meseret Getnet Meharie, Wubshet Jekale Mengesha, Zachary Abiero Gariy and Raphael N.N. Mutuku

The purpose of this study to apply stacking ensemble machine learning algorithm for predicting the cost of highway construction projects.

Abstract

Purpose

The purpose of this study to apply stacking ensemble machine learning algorithm for predicting the cost of highway construction projects.

Design/methodology/approach

The proposed stacking ensemble model was developed by combining three distinct base predictive models automatically and optimally: linear regression, support vector machine and artificial neural network models using gradient boosting algorithm as meta-regressor.

Findings

The findings reveal that the proposed model predicted the final project cost with a very small prediction error value. This implies that the difference between predicted and actual cost was quite small. A comparison of the results of the models revealed that in all performance metrics, the stacking ensemble model outperforms the sole ones. The stacking ensemble cost model produces 86.8, 87.8 and 5.6 percent more accurate results than linear regression, vector machine support, and neural network models, respectively, based on the root mean square error values.

Research limitations/implications

The study shows how stacking ensemble machine learning algorithm applies to predict the cost of construction projects. The estimators or practitioners can use the new model as an effectual and reliable tool for predicting the cost of Ethiopian highway construction projects at the preliminary stage.

Originality/value

The study provides insight into the machine learning algorithm application in forecasting the cost of future highway construction projects in Ethiopia.

Details

Engineering, Construction and Architectural Management, vol. 29 no. 7
Type: Research Article
ISSN: 0969-9988

Keywords

Article
Publication date: 9 August 2023

Siyu Su, Youchao Sun, Chong Peng and Yuanyuan Guo

The purpose of this paper is to identify the key influencing factors of aviation accidents and to predict the aviation accidents caused by the factors.

Abstract

Purpose

The purpose of this paper is to identify the key influencing factors of aviation accidents and to predict the aviation accidents caused by the factors.

Design/methodology/approach

This paper proposes an improved gray correlation analysis (IGCA) theory to make the relational analysis of aviation accidents and influencing factors and find out the critical causes of aviation accidents. The optimal varying weight combination model (OVW-CM) is constructed based on gradient boosted regression tree (GBRT), extreme gradient boosting (XGBoost) and support vector regression (SVR) to predict aviation accidents due to critical factors.

Findings

The global aviation accident data from 1919 to 2020 is selected as the experimental data. The airplane, takeoff/landing and unexpected results are the leading causes of the aviation accidents based on IGCA. Then GBRT, XGBoost, SVR, equal-weight combination model (EQ-CM), variance-covariance combination model (VCW-CM) and OVW-CM are used to predict aviation accidents caused by airplane, takeoff/landing and unexpected results, respectively. The experimental results show that OVW-CM has a better prediction effect, and the prediction accuracy and stability are higher than other models.

Originality/value

Unlike the traditional gray correlation analysis (GCA), IGCA weights the sample by distance analysis to more objectively reflect the degree of influence of different factors on aviation accidents. OVW-CM is built by minimizing the combined prediction error at sample points and assigns different weights to different individual models at different moments, which can make full use of the advantages of each model and has higher prediction accuracy. And the model parameters of GBRT, XGBoost and SVR are optimized by the particle swarm algorithm. The study can guide the analysis and prediction of aviation accidents and provide a scientific basis for aviation safety management.

Details

Engineering Computations, vol. 40 no. 7/8
Type: Research Article
ISSN: 0264-4401

Keywords

Article
Publication date: 5 July 2022

Mahesh Babu Mariappan, Kanniga Devi, Yegnanarayanan Venkataraman and Samuel Fosso Wamba

The purpose of this study is to present a large-scale real-world comparative study using pre-COVID lockdown data versus post-COVID lockdown data on predicting shipment times of…

Abstract

Purpose

The purpose of this study is to present a large-scale real-world comparative study using pre-COVID lockdown data versus post-COVID lockdown data on predicting shipment times of therapeutic supplies in e-pharmacy supply chains and show that our proposed methodology is robust to lockdown effects.

Design/methodology/approach

The researchers used organic data of over 5.9 million records of therapeutic shipments, with 2.87 million records collected pre-COVID lockdown and 3.03 million records collected post-COVID lockdown. The researchers built various Machine Learning (ML) classifier models on the two datasets, namely, Random Forest (RF), Extra Trees (XRT), Decision Tree (DT), Multi-Layer Perceptron (MLP), XGBoost (XGB), CatBoost (CB), Linear Stochastic Gradient Descent (SGD) and the Linear Naïve Bayes (NB). Then, the researchers stacked these base models and built meta models on top of them. Further, the researchers performed a detailed comparison of the performances of ML models on pre-COVID lockdown and post-COVID lockdown datasets.

Findings

The proposed approach attains performance of 93.5% on real-world post-COVID lockdown data and 91.35% on real-world pre-COVID lockdown data. In contrast, the turn-around times (TAT) provided by therapeutic supply logistics providers are 62.91% accurate compared to reality in post-COVID lockdown times and 73.68% accurate compared to reality pre-COVID lockdown times. Hence, it is clear that while the TAT provided by logistics providers has deteriorated in the post-pandemic business climate, the proposed method is robust to handle pandemic lockdown effects on e-pharmacy supply chains.

Research limitations/implications

The implication of the study provides a novel ML-based framework for predicting the shipment times of therapeutics, diagnostics and vaccines, and it is robust to COVID-19 lockdown effects.

Practical implications

E-pharmacy companies can readily adopt the proposed approach to enhance their supply chain management (SCM) capabilities and build resilience during COVID lockdown times.

Originality/value

The present study is one of the first to perform a large-scale real-world comparative analysis on predicting therapeutic supply shipment times in the e-pharmacy supply chain with novel ML ensemble stacking, obtaining robust results in these COVID lockdown times.

Details

International Journal of Physical Distribution & Logistics Management, vol. 52 no. 7
Type: Research Article
ISSN: 0960-0035

Keywords

Article
Publication date: 7 July 2021

Amirhessam Tahmassebi, Mehrtash Motamedi, Amir H. Alavi and Amir H. Gandomi

Engineering design and operational decisions depend largely on deep understanding of applications that requires assumptions for simplification of the problems in order to find…

207

Abstract

Purpose

Engineering design and operational decisions depend largely on deep understanding of applications that requires assumptions for simplification of the problems in order to find proper solutions. Cutting-edge machine learning algorithms can be used as one of the emerging tools to simplify this process. In this paper, we propose a novel scalable and interpretable machine learning framework to automate this process and fill the current gap.

Design/methodology/approach

The essential principles of the proposed pipeline are mainly (1) scalability, (2) interpretibility and (3) robust probabilistic performance across engineering problems. The lack of interpretibility of complex machine learning models prevents their use in various problems including engineering computation assessments. Many consumers of machine learning models would not trust the results if they cannot understand the method. Thus, the SHapley Additive exPlanations (SHAP) approach is employed to interpret the developed machine learning models.

Findings

The proposed framework can be applied to a variety of engineering problems including seismic damage assessment of structures. The performance of the proposed framework is investigated using two case studies of failure identification in reinforcement concrete (RC) columns and shear walls. In addition, the reproducibility, reliability and generalizability of the results were validated and the results of the framework were compared to the benchmark studies. The results of the proposed framework outperformed the benchmark results with high statistical significance.

Originality/value

Although, the current study reveals that the geometric input features and reinforcement indices are the most important variables in failure modes detection, better model can be achieved with employing more robust strategies to establish proper database to decrease the errors in some of the failure modes identification.

Details

Engineering Computations, vol. 39 no. 2
Type: Research Article
ISSN: 0264-4401

Keywords

Article
Publication date: 9 July 2020

James Wakiru, Liliane Pintelon, Peter Muchiri and Peter Chemweno

The purpose of this paper is to develop a maintenance decision support system (DSS) framework using in-service lubricant data for fault diagnosis. The DSS reveals embedded…

Abstract

Purpose

The purpose of this paper is to develop a maintenance decision support system (DSS) framework using in-service lubricant data for fault diagnosis. The DSS reveals embedded patterns in the data (knowledge discovery) and automatically quantifies the influence of lubricant parameters on the unhealthy state of the machine using alternative classifiers. The classifiers are compared for robustness from which decision-makers select an appropriate classifier given a specific lubricant data set.

Design/methodology/approach

The DSS embeds a framework integrating cluster and principal component analysis, for feature extraction, and eight classifiers among them extreme gradient boosting (XGB), random forest (RF), decision trees (DT) and logistic regression (LR). A qualitative and quantitative criterion is developed in conjunction with practitioners for comparing the classifier models.

Findings

The results show the importance of embedded knowledge, explored via a knowledge discovery approach. Moreover, the efficacy of the embedded knowledge on maintenance DSS is emphasized. Importantly, the proposed framework is demonstrated as plausible for decision support due to its high accuracy and consideration of practitioners needs.

Practical implications

The proposed framework will potentially assist maintenance managers in accurately exploiting lubricant data for maintenance DSS, while offering insights with reduced time and errors.

Originality/value

Advances in lubricant-based intelligent approach for fault diagnosis is seldom utilized in practice, however, may be incorporated in the information management systems offering high predictive accuracy. The classification models' comparison approach, will inevitably assist the industry in selecting amongst divergent models' for DSS.

Details

Journal of Quality in Maintenance Engineering, vol. 27 no. 2
Type: Research Article
ISSN: 1355-2511

Keywords

Article
Publication date: 1 March 2023

Farouq Sammour, Heba Alkailani, Ghaleb J. Sweis, Rateb J. Sweis, Wasan Maaitah and Abdulla Alashkar

Demand forecasts are a key component of planning efforts and are crucial for managing core operations. This study aims to evaluate the use of several machine learning (ML…

Abstract

Purpose

Demand forecasts are a key component of planning efforts and are crucial for managing core operations. This study aims to evaluate the use of several machine learning (ML) algorithms to forecast demand for residential construction in Jordan.

Design/methodology/approach

The identification and selection of variables and ML algorithms that are related to the demand for residential construction are indicated using a literature review. Feature selection was done by using a stepwise backward elimination. The developed algorithm’s accuracy has been demonstrated by comparing the ML predictions with real residual values and compared based on the coefficient of determination.

Findings

Nine economic indicators were selected to develop the demand models. Elastic-Net showed the highest accuracy of (0.838) versus artificial neural networkwith an accuracy of (0.727), followed by Eureqa with an accuracy of (0.715) and the Extra Trees with an accuracy of (0.703). According to the results of the best-performing model forecast, Jordan’s 2023 first-quarter demand for residential construction is anticipated to rise by 11.5% from the same quarter of the year 2022.

Originality/value

The results of this study extend to the existing body of knowledge through the identification of the most influential variables in the Jordanian residential construction industry. In addition, the models developed will enable users in the fields of construction engineering to make reliable demand forecasts while also assisting in effective financial decision-making.

Details

Construction Innovation , vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1471-4175

Keywords

1 – 10 of 423