Search results

1 – 10 of over 1000
Article
Publication date: 7 February 2022

Muralidhar Vaman Kamath, Shrilaxmi Prashanth, Mithesh Kumar and Adithya Tantri

The compressive strength of concrete depends on many interdependent parameters; its exact prediction is not that simple because of complex processes involved in strength…

Abstract

Purpose

The compressive strength of concrete depends on many interdependent parameters; its exact prediction is not that simple because of complex processes involved in strength development. This study aims to predict the compressive strength of normal concrete and high-performance concrete using four datasets.

Design/methodology/approach

In this paper, five established individual Machine Learning (ML) regression models have been compared: Decision Regression Tree, Random Forest Regression, Lasso Regression, Ridge Regression and Multiple-Linear regression. Four datasets were studied, two of which are previous research datasets, and two datasets are from the sophisticated lab using five established individual ML regression models.

Findings

The five statistical indicators like coefficient of determination (R2), mean absolute error, root mean squared error, Nash–Sutcliffe efficiency and mean absolute percentage error have been used to compare the performance of the models. The models are further compared using statistical indicators with previous studies. Lastly, to understand the variable effect of the predictor, the sensitivity and parametric analysis were carried out to find the performance of the variable.

Originality/value

The findings of this paper will allow readers to understand the factors involved in identifying the machine learning models and concrete datasets. In so doing, we hope that this research advances the toolset needed to predict compressive strength.

Details

Journal of Engineering, Design and Technology , vol. 22 no. 2
Type: Research Article
ISSN: 1726-0531

Keywords

Article
Publication date: 13 April 2023

Ian Lenaers, Kris Boudt and Lieven De Moor

The purpose is twofold. First, this study aims to establish that black box tree-based machine learning (ML) models have better predictive performance than a standard linear

168

Abstract

Purpose

The purpose is twofold. First, this study aims to establish that black box tree-based machine learning (ML) models have better predictive performance than a standard linear regression (LR) hedonic model for rent prediction. Second, it shows the added value of analyzing tree-based ML models with interpretable machine learning (IML) techniques.

Design/methodology/approach

Data on Belgian residential rental properties were collected. Tree-based ML models, random forest regression and eXtreme gradient boosting regression were applied to derive rent prediction models to compare predictive performance with a LR model. Interpretations of the tree-based models regarding important factors in predicting rent were made using SHapley Additive exPlanations (SHAP) feature importance (FI) plots and SHAP summary plots.

Findings

Results indicate that tree-based models perform better than a LR model for Belgian residential rent prediction. The SHAP FI plots agree that asking price, cadastral income, surface livable, number of bedrooms, number of bathrooms and variables measuring the proximity to points of interest are dominant predictors. The direction of relationships between rent and its factors is determined with SHAP summary plots. In addition to linear relationships, it emerges that nonlinear relationships exist.

Originality/value

Rent prediction using ML is relatively less studied than house price prediction. In addition, studying prediction models using IML techniques is relatively new in real estate economics. Moreover, to the best of the authors’ knowledge, this study is the first to derive insights of driving determinants of predicted rents from SHAP FI and SHAP summary plots.

Details

International Journal of Housing Markets and Analysis, vol. 17 no. 1
Type: Research Article
ISSN: 1753-8270

Keywords

Article
Publication date: 9 January 2024

Visar Hoxha

The purpose of this study is to carry out a comparative analysis of four machine learning models such as linear regression, decision trees, k-nearest neighbors and support vector…

Abstract

Purpose

The purpose of this study is to carry out a comparative analysis of four machine learning models such as linear regression, decision trees, k-nearest neighbors and support vector regression in predicting housing prices in Prishtina.

Design/methodology/approach

Using Python, the models were assessed on a data set of 1,512 property transactions with mean squared error, coefficient of determination, mean absolute error and root mean squared error as metrics. The study also conducts variable importance test.

Findings

Upon preprocessing and standardization of the data, the models were trained and tested, with the decision tree model producing the best performance. The variable importance test found the distance from central business district and distance to the road leading to central business district as the most relevant drivers of housing prices across all models, with the exception of support vector machine model, which showed minimal importance for all variables.

Originality/value

To the best of the author’s knowledge, the originality of this research rests in its methodological approach and emphasis on Prishtina's real estate market, which has never been studied in this context, and its findings may be generalizable to comparable transitional economies with booming real estate sector like Kosovo.

Details

International Journal of Housing Markets and Analysis, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1753-8270

Keywords

Article
Publication date: 1 November 2023

Muhammad Asim, Muhammad Yar Khan and Khuram Shafi

The study aims to investigate the presence of herding behavior in the stock market of UK with a special emphasis on news sentiment regarding the economy. The authors focus on the…

Abstract

Purpose

The study aims to investigate the presence of herding behavior in the stock market of UK with a special emphasis on news sentiment regarding the economy. The authors focus on the news sentiment because in the current digital era, investors take their decision making on the basis of current trends projected by news and media platforms.

Design/methodology/approach

For empirical modeling, the authors use machine learning models to investigate the presence of herding behavior in UK stock market for the period starting from 2006 to 2021. The authors use support vector regression, single layer neural network and multilayer neural network models to predict the herding behavior in the stock market of the UK. The authors estimate the herding coefficients using all the models and compare the findings with the linear regression model.

Findings

The results show a strong evidence of herding behavior in the stock market of the UK during different time regimes. Furthermore, when the authors incorporate the economic uncertainty news sentiment in the model, the results show a significant improvement. The results of support vector regression, single layer perceptron and multilayer perceptron model show the evidence of herding behavior in UK stock market during global financial crises of 2007–08 and COVID’19 period. In addition, the authors compare the findings with the linear regression which provides no evidence of herding behavior in all the regimes except COVID’19. The results also provide deep insights for both individual investors and policy makers to construct efficient portfolios and avoid market crashes, respectively.

Originality/value

In the existing literature of herding behavior, news sentiment regarding economic uncertainty has not been used before. However, in the present era this parameter is quite critical in context of market anomalies hence and needs to be investigated. In addition, the literature exhibits varying results about the existence of herding behavior when different methodologies are used. In this context, the use of machine learning models is quite rare in the herding literature. The machine learning models are quite robust and provide accurate results. Therefore, this research study uses three different models, i.e. single layer perceptron model, multilayer perceptron model and support vector regression model to investigate the herding behavior in the stock market of the UK. A comparative analysis is also presented among the results of all the models. The study sheds light on the importance of economic uncertainty news sentiment to predict the herding behavior.

Details

Review of Behavioral Finance, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1940-5979

Keywords

Article
Publication date: 8 September 2021

Odey Alshboul, Ali Shehadeh, Maha Al-Kasasbeh, Rabia Emhamed Al Mamlook, Neda Halalsheh and Muna Alkasasbeh

Heavy equipment residual value forecasting is dynamic as it relies on the age, type, brand and model of the equipment, ranking condition, place of sale, operating hours and other…

Abstract

Purpose

Heavy equipment residual value forecasting is dynamic as it relies on the age, type, brand and model of the equipment, ranking condition, place of sale, operating hours and other macroeconomic gauges. The main objective of this study is to predict the residual value of the main types of heavy construction equipment. The residual value of heavy construction equipment is predicted via deep learning (DL) and machine learning (ML) approaches.

Design/methodology/approach

Based on deep and machine learning regression network integrated with data mining, random forest (RF), decision tree (DT), deep neural network (DNN) and linear regression (LR)-based modeling decision support models are developed. This research aims to forecast the residual value for different types of heavy construction equipment. A comprehensive investigation of publicly accessible auction data related to various types and categories of construction equipment was utilized to generate the model's training and testing datasets. In total, four performance metrics (i.e. the mean absolute error (MAE), mean squared error (MSE), the mean absolute percentage error (MAPE) and coefficient of determination (R2)) were used to measure and compare the developed algorithms' accuracy.

Findings

The developed algorithm's efficiency has been demonstrated by comparing the deep and machine learning predictions with real residual value. The accuracy of the results obtained by different proposed modeling techniques was comparable based on the performance evaluation metrics. DT shows the highest accuracy of 0.9111 versus RF with an accuracy of 0.8123, followed by DNN with an accuracy of 0.7755 and the linear regression with an accuracy of 0.5967.

Originality/value

The proposed novel model is designed as a supportive tool for construction project managers for equipment selling, purchasing, overhauling, repairing, disposing and replacing decisions.

Details

Engineering, Construction and Architectural Management, vol. 29 no. 10
Type: Research Article
ISSN: 0969-9988

Keywords

Article
Publication date: 2 October 2017

Ajay Kumar Dhamija, Surendra S. Yadav and P.K. Jain

The purpose of this paper is to find out the best method for forecasting European Union Allowance (EUA) returns and determine its price determinants. The previous studies in this…

Abstract

Purpose

The purpose of this paper is to find out the best method for forecasting European Union Allowance (EUA) returns and determine its price determinants. The previous studies in this area have focused on a particular subset of EUA data and do not take care of the multicollinearities. The authors take EUA data from all three phases and the continuous series, adopt the principal component analysis (PCA) to eliminate multicollinearities and fit seven different homoscedastic models for a comprehensive analysis.

Design/methodology/approach

PCA is adopted to extract independent factors. Seven different linear regression and auto regressive integrated moving average (ARIMA) models are employed for forecasting EUA returns and isolating their price determinants. The seven models are then compared and the one with minimum (root mean square error is adjudged as the best model.

Findings

The best model for forecasting the EUA returns of all three phases is dynamic linear regression with lagged predictors and that for forecasting EUA continuous series is ARIMA errors. The latent factors such as switch to gas (STG) and clean spread (capturing the effects of the clean dark spread, clean spark spread, switching price and natural gas price), National Allocation Plan announcements events, energy variables, German Stock Exchange index and extreme temperature events have been isolated as the price determinants of EUA returns.

Practical implications

The current study contributes to effective carbon management by providing a quantitative framework for analyzing cap-and-trade schemes.

Originality/value

This study differs from earlier studies mainly in three aspects. First, instead of focusing on a particular subset of EUA data, it comprehensively analyses the data of all the three phases of EUA along with the EUA continuous series. Second, it expressly adopts PCA to eliminate multicollinearities, thereby reducing the error variance. Finally, it evaluates both linear and non-linear homoscedastic models incorporating lags of predictor variables to isolate the price determinants of EUA.

Details

Journal of Advances in Management Research, vol. 14 no. 4
Type: Research Article
ISSN: 0972-7981

Keywords

Article
Publication date: 15 June 2021

Hoyoung Rho, Keunho Choi and Donghee Yoo

This study identifies whether the Internet search index can be used as effective enough data to identify agricultural and livestock product demand and compare the accuracy of the…

Abstract

Purpose

This study identifies whether the Internet search index can be used as effective enough data to identify agricultural and livestock product demand and compare the accuracy of the prediction of major agricultural and livestock products purchases between these prediction models using artificial neural network, linear regression and a decision tree.

Design/methodology/approach

Artificial neural network, linear regression and decision tree algorithms were used in this study to compare the accuracy of the prediction of major agricultural and livestock products purchases. The analysis data were studied using 10-fold cross validation.

Findings

First, the importance of the Internet search index among the 20 explanatory variables was found to be high for most items, so the Internet search index can be used as a variable to explain agricultural and livestock products purchases. Second, as a result of comparing the accuracy of the prediction of six agricultural and livestock purchases using three models, beef was the most predictable, followed by radishes, chicken, Chinese cabbage, garlic and dried peppers, and by model, a decision tree shows the highest accuracy of prediction, followed by linear regression and an artificial neural network.

Originality/value

This study is meaningful in that it analyzes the purchase of agricultural and livestock products using data from actual consumers' purchases of agricultural and livestock products. In addition, the use of data mining techniques and Internet search index in the analysis of agricultural and livestock purchases contributes to improving the accuracy and efficiency of agricultural and livestock purchase predictions.

Details

Data Technologies and Applications, vol. 55 no. 5
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 18 August 2023

Enas Hendawy, David G. McMillan, Zaki M. Sakr and Tamer Mohamed Shahwan

This paper aims to introduce a new perspective on long-term stock return predictability by focusing on the relative (individual and hybrid) informative power of a wide range of…

Abstract

Purpose

This paper aims to introduce a new perspective on long-term stock return predictability by focusing on the relative (individual and hybrid) informative power of a wide range of accounting (firm-related), technical and macroeconomic factors while considering the past performance of the stocks using machine learning algorithms.

Design/methodology/approach

The sample includes a panel data set of 94 non-financial firms listed in Egyptian Exchange 100 index from 2014: Q1 to 2019: Q4. Relativity has been investigated by comparing relevant factors’ individual and combined informative power and differentiating between losers and winners based on historical stock returns. To predict the quarterly stock returns, Gaussian process regression (GPR) has been used. The robustness of the results is examined through the out-of-sample test. This study also uses linear regression (LR) as a benchmark model.

Findings

The past performance and the presence of other predictors influence the informative power of relevant factors and hence their predictive ability. The out-of-sample results show a trade-off between GPR and LR with proven superiority to GPR in limited experiments. The individual informative power outperforms the hybrid power, in which macroeconomic indicators outperform the remaining sets of indicators for losers, while winners show mixed results in terms of various performance evaluation metrics. Prediction accuracy is generally higher for losers than for winners.

Practical implications

This study provides interesting insight into the dynamic nature of the predictor variables in terms of stock return predictability. Hence, this study also deepens the understanding of asset pricing in a way that directly contributes to practitioners’ portfolio diversification strategies.

Originality/value

In concern of the chaos of factors in the literature and its accompanying misleading conclusions, this study takes another look at the approach that studies stock return predictability. To the best of the authors’ knowledge, this is the first study in the Egyptian context that re-examines the predictive power of the previously discovered factors from a different perspective that highlights their relative nature.

Details

Journal of Financial Reporting and Accounting, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1985-2517

Keywords

Article
Publication date: 4 October 2022

Mohammed Hamza Momade, Serdar Durdyev, Saurav Dixit, Shamsuddin Shahid and Abubakar Kori Alkali

Construction projects in Malaysia are often delayed and over budget due to heavy reliance on labor. Linear regression (LR) models have been used in most labor cost (LC) studies…

Abstract

Purpose

Construction projects in Malaysia are often delayed and over budget due to heavy reliance on labor. Linear regression (LR) models have been used in most labor cost (LC) studies, which are less accurate than machine learning (ML) tools. Construction management applications have increasingly used ML tools in recent years and have greatly impacted forecasting. The research aims to identify the most influential LC factors using statistical approaches, collect data and forecast LC models for improved forecasts of LC.

Design/methodology/approach

A thorough literature review was completed to identify LC factors. Experienced project managers were administered to rank the factors based on importance and relevance. Then, data were collected for the six highest ranked factors, and five ML models were created. Finally, five categorical indices were used to analyze and measure the effectiveness of models in determining the performance category.

Findings

Worker age, construction skills, worker origin, worker training/education, type of work and worker experience were identified as the most influencing factors on LC. SVM provided the best in comparison to other models.

Originality/value

The findings support data-driven regulatory and practice improvements aimed at improving labor issues in Malaysia, with the possibility for replication in other countries facing comparable problems.

Details

International Journal of Building Pathology and Adaptation, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2398-4708

Keywords

Article
Publication date: 5 August 2019

Behnam Hamedi and Alireza Mokhtar

The purpose of this study is to investigate and analysis of energy consumption for this industry. The core part of any energy management system (EnMS) in industry is to perfectly…

244

Abstract

Purpose

The purpose of this study is to investigate and analysis of energy consumption for this industry. The core part of any energy management system (EnMS) in industry is to perfectly monitor the energy consumption of significant users and to continuously improve the energy performance. In petrochemical plants, production deals with energy-intensive processes, and measuring energy performance for recognition and assessment of potentials for saving is critical.

Design/methodology/approach

The required data are exploited for the period of March 2011-August 2016 (data set: 2,012 days). Multivariate linear regression (MLR) and multi-layer perceptron artificial neural network (ANN) methods are separately used to anticipate the energy consumption. The baseline will be assumed as a reference to be compared with the actual data to estimate the real saving values. Finally, cumulative summations (CUSUM) are proposed and applied as an effective indicator for measurement of energy performance in an LDPE.

Findings

In this study, two statistical methods of MLR and ANN were used to design and develop a comprehensive energy baseline representing the predicted amounts of energy consumption based on the recognized drivers. Although both models imply robust outcomes, when the relative errors are taken into account, performance of ANN models appears fairly superior compared to the MLR model.

Originality/value

It is highly suggested to the ISO technical committee dealing with energy management standards, to consider the proposed model for baseline development in the future version of the standard ISO 50006 as the supplementary extension for the ISO 50001 for measuring energy performance using EnB and EnPI. As for future studies, the research can be extended to investigate the uncertainty and the model could also become completed applying more advanced ANNs such as recurrent neural networks.

1 – 10 of over 1000