Search results

1 – 10 of 303
Article
Publication date: 10 July 2024

Wiput Tuvayanond, Viroon Kamchoom and Lapyote Prasittisopin

This paper aims to clarify the efficient process of the machine learning algorithms implemented in the ready-mix concrete (RMC) onsite. It proposes innovative machine learning…

73

Abstract

Purpose

This paper aims to clarify the efficient process of the machine learning algorithms implemented in the ready-mix concrete (RMC) onsite. It proposes innovative machine learning algorithms in terms of preciseness and computation time for the RMC strength prediction.

Design/methodology/approach

This paper presents an investigation of five different machine learning algorithms, namely, multilinear regression, support vector regression, k-nearest neighbors, extreme gradient boosting (XGBOOST) and deep neural network (DNN), that can be used to predict the 28- and 56-day compressive strengths of nine mix designs and four mixing conditions. Two algorithms were designated for fitting the actual and predicted 28- and 56-day compressive strength data. Moreover, the 28-day compressive strength data were implemented to predict 56-day compressive strength.

Findings

The efficacy of the compressive strength data was predicted by DNN and XGBOOST algorithms. The computation time of the XGBOOST algorithm was apparently faster than the DNN, offering it to be the most suitable strength prediction tool for RMC.

Research limitations/implications

Since none has been practically adopted the machine learning for strength prediction for RMC, the scope of this work focuses on the commercially available algorithms. The adoption of the modified methods to fit with the RMC data should be determined thereafter.

Practical implications

The selected algorithms offer efficient prediction for promoting sustainability to the RMC industries. The standard adopting such algorithms can be established, excluding the traditional labor testing. The manufacturers can implement research to introduce machine learning in the quality controcl process of their plants.

Originality/value

Regarding literature review, machine learning has been assessed regarding the laboratory concrete mix design and concrete performance. A study conducted based on the on-site production and prolonged mixing parameters is lacking.

Details

Construction Innovation , vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1471-4175

Keywords

Article
Publication date: 31 May 2022

Osamah M. Al-Qershi, Junbum Kwon, Shuning Zhao and Zhaokun Li

For the case of many content features, This paper aims to investigate which content features in video and text ads more contribute to accurately predicting the success of…

1089

Abstract

Purpose

For the case of many content features, This paper aims to investigate which content features in video and text ads more contribute to accurately predicting the success of crowdfunding by comparing prediction models.

Design/methodology/approach

With 1,368 features extracted from 15,195 Kickstarter campaigns in the USA, the authors compare base models such as logistic regression (LR) with tree-based homogeneous ensembles such as eXtreme gradient boosting (XGBoost) and heterogeneous ensembles such as XGBoost + LR.

Findings

XGBoost shows higher prediction accuracy than LR (82% vs 69%), in contrast to the findings of a previous relevant study. Regarding important content features, humans (e.g. founders) are more important than visual objects (e.g. products). In both spoken and written language, words related to experience (e.g. eat) or perception (e.g. hear) are more important than cognitive (e.g. causation) words. In addition, a focus on the future is more important than a present or past time orientation. Speech aids (see and compare) to complement visual content are also effective and positive tone matters in speech.

Research limitations/implications

This research makes theoretical contributions by finding more important visuals (human) and language features (experience, perception and future time). Also, in a multimodal context, complementary cues (e.g. speech aids) across different modalities help. Furthermore, the noncontent parts of speech such as positive “tone” or pace of speech are important.

Practical implications

Founders are encouraged to assess and revise the content of their video or text ads as well as their basic campaign features (e.g. goal, duration and reward) before they launch their campaigns. Next, overly complex ensembles may suffer from overfitting problems. In practice, model validation using unseen data is recommended.

Originality/value

Rather than reducing the number of content feature dimensions (Kaminski and Hopp, 2020), by enabling advanced prediction models to accommodate many contents features, prediction accuracy rises substantially.

Article
Publication date: 3 September 2024

Biplab Bhattacharjee, Kavya Unni and Maheshwar Pratap

Product returns are a major challenge for e-businesses as they involve huge logistical and operational costs. Therefore, it becomes crucial to predict returns in advance. This…

Abstract

Purpose

Product returns are a major challenge for e-businesses as they involve huge logistical and operational costs. Therefore, it becomes crucial to predict returns in advance. This study aims to evaluate different genres of classifiers for product return chance prediction, and further optimizes the best performing model.

Design/methodology/approach

An e-commerce data set having categorical type attributes has been used for this study. Feature selection based on chi-square provides a selective features-set which is used as inputs for model building. Predictive models are attempted using individual classifiers, ensemble models and deep neural networks. For performance evaluation, 75:25 train/test split and 10-fold cross-validation strategies are used. To improve the predictability of the best performing classifier, hyperparameter tuning is performed using different optimization methods such as, random search, grid search, Bayesian approach and evolutionary models (genetic algorithm, differential evolution and particle swarm optimization).

Findings

A comparison of F1-scores revealed that the Bayesian approach outperformed all other optimization approaches in terms of accuracy. The predictability of the Bayesian-optimized model is further compared with that of other classifiers using experimental analysis. The Bayesian-optimized XGBoost model possessed superior performance, with accuracies of 77.80% and 70.35% for holdout and 10-fold cross-validation methods, respectively.

Research limitations/implications

Given the anonymized data, the effects of individual attributes on outcomes could not be investigated in detail. The Bayesian-optimized predictive model may be used in decision support systems, enabling real-time prediction of returns and the implementation of preventive measures.

Originality/value

There are very few reported studies on predicting the chance of order return in e-businesses. To the best of the authors’ knowledge, this study is the first to compare different optimization methods and classifiers, demonstrating the superiority of the Bayesian-optimized XGBoost classification model for returns prediction.

Details

Journal of Systems and Information Technology, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1328-7265

Keywords

Article
Publication date: 27 March 2023

Jinghui Deng, Qiyou Cheng and Xing Lu

Helicopter fuselage vibration prediction is important to keep a safety and comfortable flight process. The helicopter vibration mechanism model is difficult to meet of demand for…

Abstract

Purpose

Helicopter fuselage vibration prediction is important to keep a safety and comfortable flight process. The helicopter vibration mechanism model is difficult to meet of demand for accurate vibration prediction. Thus, the purpose of this paper is to develop an intelligent algorithm for accurate helicopter fuselage vibration analysis.

Design/methodology/approach

In this research, a novel weighted variational mode decomposition (VMD)- extreme gradient boosting (xgboost) helicopter fuselage vibration prediction model is proposed. The vibration data is decomposed and reconstructed by the signal clustering results. The vibration response is predicted by xgboost algorithm based on the reconstructed data. The information transfer order between the controllable flight data and flight attitude are analyzed.

Findings

The mean absolute percentage error (MAPE), root mean square error (RMSE) and mean absolute error (MAE) of the proposed weighted VMD-xgboost model are decreased by 6.8%, 31.5% and 32.8% compared with xgboost model. The established weighted VMD-xgboost model has the highest prediction accuracy with the lowest mean MAPE, RMSE and MAE of 4.54%, 0.0162, and 0.0131, respectively. The attitude of horizontal tail and cycle pitch are the key factors to vibration.

Originality/value

A novel weighted VMD-xgboost intelligent prediction methods is proposed. The prediction effect of xgboost model is highly improved by using the signal-weighted reconstruction technique. In addition, the data set used is collected from actual helicopter flight process.

Details

Aircraft Engineering and Aerospace Technology, vol. 95 no. 7
Type: Research Article
ISSN: 1748-8842

Keywords

Article
Publication date: 17 February 2021

Apostolos Ampountolas and Mark P. Legg

This study aims to predict hotel demand through text analysis by investigating keyword series to increase demand predictions’ precision. To do so, this paper presents a framework…

1158

Abstract

Purpose

This study aims to predict hotel demand through text analysis by investigating keyword series to increase demand predictions’ precision. To do so, this paper presents a framework for modeling hotel demand that incorporates machine learning techniques.

Design/methodology/approach

The empirical forecasting is conducted by introducing a segmented machine learning approach of leveraging hierarchical clustering tied to machine learning and deep learning techniques. These features allow the model to yield more precise estimates. This study evaluates an extensive range of social media–derived words with the most significant probability of gradually establishing an understanding of an optimal outcome. Analyzes were performed on a major hotel chain in an urban market setting within the USA.

Findings

The findings indicate that while traditional methods, being the naïve approach and ARIMA models, struggled with forecasting accuracy, segmented boosting methods (XGBoost) leveraging social media predict hotel occupancy with greater precision for all examined time horizons. Additionally, the segmented learning approach improved the forecasts’ stability and robustness while mitigating common overfitting issues within a highly dimensional data set.

Research limitations/implications

Incorporating social media into a segmented learning framework can augment the current generation of forecasting methods’ accuracy. Moreover, the segmented learning approach mitigates the negative effects of market shifts (e.g. COVID-19) that can reduce in-production forecasts’ life-cycles. The ability to be more robust to market deviations will allow hospitality firms to minimize development time.

Originality/value

The results are expected to generate insights by providing revenue managers with an instrument for predicting demand.

Details

International Journal of Contemporary Hospitality Management, vol. 33 no. 6
Type: Research Article
ISSN: 0959-6119

Keywords

Article
Publication date: 19 May 2021

Song Wang and Yang Yang

The rapid development of e-commerce has brought not only great convenience to people but a great challenge to online stores. Phenomenon such as out of stock and slow sales has…

Abstract

Purpose

The rapid development of e-commerce has brought not only great convenience to people but a great challenge to online stores. Phenomenon such as out of stock and slow sales has been common in recent years. These issues can be managed only when the occurrence of the sales volume is predicted in advance, and sufficient warnings can be executed in time. Thus, keeping in mind the importance of the sales prediction system, the purpose of this paper is to propose an effective sales prediction model and make digital marketing strategies with the machine learning model.

Design/methodology/approach

Based on the consumer purchasing behavior decision theory, we discuss the factors affecting product sales, including external factors, consumer perception, consumer potential purchase behavior and consumer traffic. Then we propose a sales prediction model, M-GNA-XGBOOST, using the time-series prediction that ensures the effective prediction of sales about each product in a short time on online stores based on the sales data in the previous term or month or year. The proposed M-GNA-XGBOOST model serves as an adaptive prediction model, for which the instant factors and the sales data of the previous period are the input, and the optimal computation is based on the proposed methodology. The adaptive prediction using the proposed model is developed based on the LSTM (Long Short-Term Memory), GAN (Generative Adversarial Networks) and XGBOOST (eXtreme Gradient Boosting). The model inherits the advantages among the algorithms with better accuracy and forecasts the sales of each product in the store with instant data characteristics for the first time.

Findings

The analysis using Jingdong dataset proves the effectiveness of the proposed prediction method. The effectiveness of the proposed method is enhanced and the accuracy that instant data as input is found to be better compared with the model that lagged data as input. The root means squared error and mean absolute error of the proposed model are found to be around 11.9 and 8.23. According to the sales prediction of each product, the resource can be arranged in advance, and the marketing strategy of product positioning, product display optimization, inventory management and product promotion is designed for online stores.

Originality/value

The paper proposes and implements a new model, M-GNA-XGBOOST, to predict sales of each product for online stores. Our work provides reference and enlightenment for the establishment of accurate sales-based digital marketing strategies for online stores.

Details

Data Technologies and Applications, vol. 55 no. 5
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 11 October 2019

Yuchuan Wu, Shengfeng Qi, Feng Hu, Shuangbao Ma, Wen Mao and Wei Li

In human action recognition based on wearable sensors, most previous studies have focused on a single type of sensor and single classifier. This study aims to use a wearable…

Abstract

Purpose

In human action recognition based on wearable sensors, most previous studies have focused on a single type of sensor and single classifier. This study aims to use a wearable sensor based on flexible sensors and a tri-axial accelerometer to collect action data of elderly people. It uses a statistical modeling approach based on the ensemble algorithm to classify actions and verify its validity.

Design/methodology/approach

Nine types of daily actions were collected by the wearable sensor device from a group of elderly volunteers, and the time-domain features of the action sequences were extracted. The dimensionality of the feature vectors was reduced by linear discriminant analysis. An ensemble learning method based on XGBoost was used to build a model of elderly action recognition. Its performance was compared with the action recognition rate of other algorithms based on the Boosting algorithm, and with the accuracy of single classifier models.

Findings

The effectiveness of the method was validated by three experiments. The results show that XGBoost is able to classify nine daily actions of the elderly and achieve an average recognition rate of 94.8 per cent, which is superior to single classifiers and to other ensemble algorithms.

Practical implications

The research could have important implications for health care, including the treatment and rehabilitation of the elderly, and the prevention of falls.

Originality/value

Instead of using a single type of sensor, this research used a wearable sensor to obtain daily action data of the elderly. The results show that, by using the appropriate method, the device can obtain detailed data of joint action at a low cost. Comparing differences in performance, it was concluded that XGBoost is the most suitable algorithm for building a model of elderly action recognition. This method, together with a wearable sensor, can provide key data and accurate feedback information to monitor the elderly in their rehabilitation activities.

Details

Sensor Review, vol. 39 no. 6
Type: Research Article
ISSN: 0260-2288

Keywords

Article
Publication date: 30 November 2021

Julián Martínez-Vargas, Pedro Carmona and Pol Torrelles

The purpose of this paper is to study the influence of different quantitative (traditionally used) and qualitative variables, such as the possible negative effect in determined…

Abstract

Purpose

The purpose of this paper is to study the influence of different quantitative (traditionally used) and qualitative variables, such as the possible negative effect in determined periods of certain socio-political factors on share price formation.

Design/methodology/approach

We first analyse descriptively the evolution of the Ibex-35 in recent years and compare it with other international benchmark indices. Bellow, two techniques have been compared: a classic linear regression statistical model (GLM) and a method based on machine learning techniques called Extreme Gradient Boosting (XGBoost).

Findings

XGBoost yields a very accurate market value prediction model that clearly outperforms the other, with a coefficient of determination close to 90%, calculated on validation sets.

Practical implications

According to our analysis, individual accounts are equally or more important than consolidated information in predicting the behaviour of share prices. This would justify Spain maintaining the obligation to present individual interim financial statements, which does not happen in other European Union countries because IAS 34 only stipulates consolidated interim financial statements.

Social implications

The descriptive analysis allows us to see how the Ibex-35 has moved away from international trends, especially in periods in which some relevant socio-political events occurred, such as the independence referendum in Catalonia, the double elections of 2019 or the early handling of the Covid-19 pandemic in 2020.

Originality/value

Compared to other variables, the XGBoost model assigns little importance to socio-political factors when it comes to share price formation; however, this model explains 89.33% of its variance.

Propósito

El propósito de este artículo es estudiar la influencia de diferentes variables cuantitativas (tradicionalmente usadas) y cualitativas, como la posible influencia negativa en determinados períodos de ciertos factores sociopolíticos, sobre la formación del precio de.

Diseño/metodología/enfoque

Primero analizamos de forma descriptiva la evolución del Ibex-35 en los últimos años y la comparamos con la de otros índices internacionales de referencia. A continuación, se han contrastado dos técnicas: un modelo estadístico clásico de regresión lineal (GLM) y un método basado en el aprendizaje automático denominado Extreme Gradient Boosting (XGBoost).

Resultados

XGBoost nos permite obtener un modelo de predicción del valor de mercado muy preciso y claramente superior al otro, con un coeficiente de determinación cercano al 90%, calculado sobre las muestras de validación.

Implicaciones prácticas

De acuerdo con nuestro análisis, la información contable individual es igual o más importante que la consolidada para predecir el comportamiento del precio de las acciones. Esto justificaría que España mantenga la obligación de presentar estados financieros intermedios individuales, lo que no ocurre en otros países de la Unión Europea porque la NIC 34 solo obliga a realizar estados financieros intermedios consolidados.

Implicaciones sociales

El análisis descriptivo permite ver cómo el Ibex-35 se ha alejado de las tendencias internacionales, especialmente en periodos en los que se produjo algún hecho sociopolítico relevante, como el referéndum de autodeterminación de Cataluña, el doble proceso electoral de 2019 o la gestión inicial de la pandemia generada por el Covid-19.

Originalidad/valor

En comparación con otras variables, el modelo XGBoost asigna poca importancia a los factores sociopolíticos cuando se trata de la formación del precio de las acciones; sin embargo, este modelo explica el 89.33% de su varianza.

Details

Academia Revista Latinoamericana de Administración, vol. 35 no. 1
Type: Research Article
ISSN: 1012-8255

Keywords

Article
Publication date: 27 September 2023

Myongjee Yoo, Ashok K. Singh and Noah Loewy

The purpose of this study is to develop a model that accurately forecasts hotel room cancelations and further determines the key cancelation drivers.

Abstract

Purpose

The purpose of this study is to develop a model that accurately forecasts hotel room cancelations and further determines the key cancelation drivers.

Design/methodology/approach

Predictive modeling, specifically the machine learning methods, is used to forecast room cancelations and identify the main cancelation factors.

Findings

By using three different classification algorithms, this study demonstrates that hotel room cancelation can be accurately predicted using XGBoost, as well as the ensemble method involving Support Vector Machine, Random Forest and XGBoost.

Originality/value

This study attempted to forecast hotel room cancelations by applying a relatively new method, machine learning. By implementing predictive modeling, one of the most emerging and innovative research methods, this study ultimately provides prediction suggestions in various aspects and levels for hotel management operations.

研究目的

本研究旨在开发一个能够准确预测酒店客房取消的模型, 并进一步确定主要的取消因素。

研究方法

采用预测建模, 具体来说是机器学习方法, 来预测客房取消, 并识别主要的取消因素。

研究发现

通过使用三种不同的分类算法, 本研究表明使用XGBoost以及涉及支持向量机、随机森林和XGBoost的集成方法可以准确预测酒店客房取消。

研究创新

本研究尝试通过应用相对较新的方法, 即机器学习, 来预测酒店客房取消。通过实施预测建模, 这是目前新兴和创新的研究方法之一, 本研究最终为酒店管理运营在各个方面和层面提供了预测建议。

Details

Journal of Hospitality and Tourism Technology, vol. 15 no. 1
Type: Research Article
ISSN: 1757-9880

Keywords

Article
Publication date: 26 December 2023

Farshad Peiman, Mohammad Khalilzadeh, Nasser Shahsavari-Pour and Mehdi Ravanshadnia

Earned value management (EVM)–based models for estimating project actual duration (AD) and cost at completion using various methods are continuously developed to improve the…

Abstract

Purpose

Earned value management (EVM)–based models for estimating project actual duration (AD) and cost at completion using various methods are continuously developed to improve the accuracy and actualization of predicted values. This study primarily aimed to examine natural gradient boosting (NGBoost-2020) with the classification and regression trees (CART) base model (base learner). To the best of the authors' knowledge, this concept has never been applied to EVM AD forecasting problem. Consequently, the authors compared this method to the single K-nearest neighbor (KNN) method, the ensemble method of extreme gradient boosting (XGBoost-2016) with the CART base model and the optimal equation of EVM, the earned schedule (ES) equation with the performance factor equal to 1 (ES1). The paper also sought to determine the extent to which the World Bank's two legal factors affect countries and how the two legal causes of delay (related to institutional flaws) influence AD prediction models.

Design/methodology/approach

In this paper, data from 30 construction projects of various building types in Iran, Pakistan, India, Turkey, Malaysia and Nigeria (due to the high number of delayed projects and the detrimental effects of these delays in these countries) were used to develop three models. The target variable of the models was a dimensionless output, the ratio of estimated duration to completion (ETC(t)) to planned duration (PD). Furthermore, 426 tracking periods were used to build the three models, with 353 samples and 23 projects in the training set, 73 patterns (17% of the total) and six projects (21% of the total) in the testing set. Furthermore, 17 dimensionless input variables were used, including ten variables based on the main variables and performance indices of EVM and several other variables detailed in the study. The three models were subsequently created using Python and several GitHub-hosted codes.

Findings

For the testing set of the optimal model (NGBoost), the better percentage mean (better%) of the prediction error (based on projects with a lower error percentage) of the NGBoost compared to two KNN and ES1 single models, as well as the total mean absolute percentage error (MAPE) and mean lags (MeLa) (indicating model stability) were 100, 83.33, 5.62 and 3.17%, respectively. Notably, the total MAPE and MeLa for the NGBoost model testing set, which had ten EVM-based input variables, were 6.74 and 5.20%, respectively. The ensemble artificial intelligence (AI) models exhibited a much lower MAPE than ES1. Additionally, ES1 was less stable in prediction than NGBoost. The possibility of excessive and unusual MAPE and MeLa values occurred only in the two single models. However, on some data sets, ES1 outperformed AI models. NGBoost also outperformed other models, especially single models for most developing countries, and was more accurate than previously presented optimized models. In addition, sensitivity analysis was conducted on the NGBoost predicted outputs of 30 projects using the SHapley Additive exPlanations (SHAP) method. All variables demonstrated an effect on ETC(t)/PD. The results revealed that the most influential input variables in order of importance were actual time (AT) to PD, regulatory quality (RQ), earned duration (ED) to PD, schedule cost index (SCI), planned complete percentage, rule of law (RL), actual complete percentage (ACP) and ETC(t) of the ES optimal equation to PD. The probabilistic hybrid model was selected based on the outputs predicted by the NGBoost and XGBoost models and the MAPE values from three AI models. The 95% prediction interval of the NGBoost–XGBoost model revealed that 96.10 and 98.60% of the actual output values of the testing and training sets are within this interval, respectively.

Research limitations/implications

Due to the use of projects performed in different countries, it was not possible to distribute the questionnaire to the managers and stakeholders of 30 projects in six developing countries. Due to the low number of EVM-based projects in various references, it was unfeasible to utilize other types of projects. Future prospects include evaluating the accuracy and stability of NGBoost for timely and non-fluctuating projects (mostly in developed countries), considering a greater number of legal/institutional variables as input, using legal/institutional/internal/inflation inputs for complex projects with extremely high uncertainty (such as bridge and road construction) and integrating these inputs and NGBoost with new technologies (such as blockchain, radio frequency identification (RFID) systems, building information modeling (BIM) and Internet of things (IoT)).

Practical implications

The legal/intuitive recommendations made to governments are strict control of prices, adequate supervision, removal of additional rules, removal of unfair regulations, clarification of the future trend of a law change, strict monitoring of property rights, simplification of the processes for obtaining permits and elimination of unnecessary changes particularly in developing countries and at the onset of irregular projects with limited information and numerous uncertainties. Furthermore, the managers and stakeholders of this group of projects were informed of the significance of seven construction variables (institutional/legal external risks, internal factors and inflation) at an early stage, using time series (dynamic) models to predict AD, accurate calculation of progress percentage variables, the effectiveness of building type in non-residential projects, regular updating inflation during implementation, effectiveness of employer type in the early stage of public projects in addition to the late stage of private projects, and allocating reserve duration (buffer) in order to respond to institutional/legal risks.

Originality/value

Ensemble methods were optimized in 70% of references. To the authors' knowledge, NGBoost from the set of ensemble methods was not used to estimate construction project duration and delays. NGBoost is an effective method for considering uncertainties in irregular projects and is often implemented in developing countries. Furthermore, AD estimation models do fail to incorporate RQ and RL from the World Bank's worldwide governance indicators (WGI) as risk-based inputs. In addition, the various WGI, EVM and inflation variables are not combined with substantial degrees of delay institutional risks as inputs. Consequently, due to the existence of critical and complex risks in different countries, it is vital to consider legal and institutional factors. This is especially recommended if an in-depth, accurate and reality-based method like SHAP is used for analysis.

Details

Engineering, Construction and Architectural Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0969-9988

Keywords

1 – 10 of 303