Search results
1 – 10 of over 2000Christian Nnaemeka Egwim, Hafiz Alaka, Youlu Pan, Habeeb Balogun, Saheed Ajayi, Abdul Hye and Oluwapelumi Oluwaseun Egunjobi
The study aims to develop a multilayer high-effective ensemble of ensembles predictive model (stacking ensemble) using several hyperparameter optimized ensemble machine learning…
Abstract
Purpose
The study aims to develop a multilayer high-effective ensemble of ensembles predictive model (stacking ensemble) using several hyperparameter optimized ensemble machine learning (ML) methods (bagging and boosting ensembles) trained with high-volume data points retrieved from Internet of Things (IoT) emission sensors, time-corresponding meteorology and traffic data.
Design/methodology/approach
For a start, the study experimented big data hypothesis theory by developing sample ensemble predictive models on different data sample sizes and compared their results. Second, it developed a standalone model and several bagging and boosting ensemble models and compared their results. Finally, it used the best performing bagging and boosting predictive models as input estimators to develop a novel multilayer high-effective stacking ensemble predictive model.
Findings
Results proved data size to be one of the main determinants to ensemble ML predictive power. Second, it proved that, as compared to using a single algorithm, the cumulative result from ensemble ML algorithms is usually always better in terms of predicted accuracy. Finally, it proved stacking ensemble to be a better model for predicting PM2.5 concentration level than bagging and boosting ensemble models.
Research limitations/implications
A limitation of this study is the trade-off between performance of this novel model and the computational time required to train it. Whether this gap can be closed remains an open research question. As a result, future research should attempt to close this gap. Also, future studies can integrate this novel model to a personal air quality messaging system to inform public of pollution levels and improve public access to air quality forecast.
Practical implications
The outcome of this study will aid the public to proactively identify highly polluted areas thus potentially reducing pollution-associated/ triggered COVID-19 (and other lung diseases) deaths/ complications/ transmission by encouraging avoidance behavior and support informed decision to lock down by government bodies when integrated into an air pollution monitoring system
Originality/value
This study fills a gap in literature by providing a justification for selecting appropriate ensemble ML algorithms for PM2.5 concentration level predictive modeling. Second, it contributes to the big data hypothesis theory, which suggests that data size is one of the most important factors of ML predictive capability. Third, it supports the premise that when using ensemble ML algorithms, the cumulative output is usually always better in terms of predicted accuracy than using a single algorithm. Finally developing a novel multilayer high-performant hyperparameter optimized ensemble of ensembles predictive model that can accurately predict PM2.5 concentration levels with improved model interpretability and enhanced generalizability, as well as the provision of a novel databank of historic pollution data from IoT emission sensors that can be purchased for research, consultancy and policymaking.
Details
Keywords
Xin Huang, Ting Tang, Yu Ning Luo and Ren Wang
This study aims to examine the impact of board characteristics on firm performance while also exploring the influential mechanisms that help Chinese listed companies establish…
Abstract
Purpose
This study aims to examine the impact of board characteristics on firm performance while also exploring the influential mechanisms that help Chinese listed companies establish effective boards of directors and strengthen their corporate governance mechanisms.
Design/methodology/approach
This paper uses machine learning methods to investigate the predictive ability of the board of directors' characteristics on firm performance based on the data from Chinese A-share listed companies on the Shanghai and Shenzhen stock exchanges in China during 2008–2021. This study further analyzes board characteristics with relatively strong predictive ability and their predictive models on firm performance.
Findings
The results show that nonlinear machine learning methods are more effective than traditional linear models in analyzing the impact of board characteristics on Chinese firm performance. Among the series characteristics of the board of directors, the contribution ratio in prediction from directors compensation, director shareholding ratio, the average age of directors and directors' educational level are significant, and these characteristics have a roughly nonlinear correlation to the prediction of firm performance; the improvement of the predictive ability of board characteristics on firm performance in state-owned enterprises in China performs better than that in private enterprises.
Practical implications
The findings of this study provide valuable suggestions for enriching the theory of board governance, strengthening board construction and optimizing the effectiveness of board governance. Furthermore, these impacts can serve as a valuable reference for board construction and selection, aiding in the rational selection of boards to establish an efficient and high-performing board of directors.
Originality/value
The study findings unequivocally demonstrate the superiority of nonlinear machine learning approaches over traditional linear models in examining the relationship between board characteristics and firm performance in China. Within the suite of board characteristics, director compensation, shareholding ratio, average age and educational level are particularly noteworthy, consistently demonstrating strong, nonlinear associations with firm performance. Within the suite of board characteristics, director compensation, shareholding ratio, average age and educational level are particularly noteworthy, consistently demonstrating strong, nonlinear associations with firm performance. The study reveals that the predictive performance of board attributes is generally more robust for state-owned enterprises in China in comparison to their counterparts in the private sector.
Details
Keywords
Diego Espinosa Gispert, Ibrahim Yitmen, Habib Sadri and Afshin Taheri
The purpose of this research is to develop a framework of an ontology-based Asset Information Model (AIM) for a Digital Twin (DT) platform and enhance predictive maintenance…
Abstract
Purpose
The purpose of this research is to develop a framework of an ontology-based Asset Information Model (AIM) for a Digital Twin (DT) platform and enhance predictive maintenance practices in building facilities that could enable proactive and data-driven decision-making during the Operation and Maintenance (O&M) process.
Design/methodology/approach
A scoping literature review was accomplished to establish the theoretical foundation for the current investigation. A study on developing an ontology-based AIM for predictive maintenance in building facilities was conducted. Semi-structured interviews were conducted with industry professionals to gather qualitative data for ontology-based AIM framework validation and insights.
Findings
The research findings indicate that while the development of ontology faced challenges in defining missing entities and relations in the context of predictive maintenance, insights gained from the interviews enabled the establishment of a comprehensive framework for ontology-based AIM adoption in the Facility Management (FM) sector.
Practical implications
The proposed ontology-based AIM has the potential to enable proactive and data-driven decision-making during the process, optimizing predictive maintenance practices and ultimately enhancing energy efficiency and sustainability in the building industry.
Originality/value
The research contributes to a practical guide for ontology development processes and presents a framework of an Ontology-based AIM for a Digital Twin platform.
Details
Keywords
Indranil Ghosh, Rabin K. Jana and Dinesh K. Sharma
Owing to highly volatile and chaotic external events, predicting future movements of cryptocurrencies is a challenging task. This paper advances a granular hybrid predictive…
Abstract
Purpose
Owing to highly volatile and chaotic external events, predicting future movements of cryptocurrencies is a challenging task. This paper advances a granular hybrid predictive modeling framework for predicting the future figures of Bitcoin (BTC), Litecoin (LTC), Ethereum (ETH), Stellar (XLM) and Tether (USDT) during normal and pandemic regimes.
Design/methodology/approach
Initially, the major temporal characteristics of the price series are examined. In the second stage, ensemble empirical mode decomposition (EEMD) and maximal overlap discrete wavelet transformation (MODWT) are used to decompose the original time series into two distinct sets of granular subseries. In the third stage, long- and short-term memory network (LSTM) and extreme gradient boosting (XGB) are applied to the decomposed subseries to estimate the initial forecasts. Lastly, sequential quadratic programming (SQP) is used to fetch the forecast by combining the initial forecasts.
Findings
Rigorous performance assessment and the outcome of the Diebold-Mariano’s pairwise statistical test demonstrate the efficacy of the suggested predictive framework. The framework yields commendable predictive performance during the COVID-19 pandemic timeline explicitly as well. Future trends of BTC and ETH are found to be relatively easier to predict, while USDT is relatively difficult to predict.
Originality/value
The robustness of the proposed framework can be leveraged for practical trading and managing investment in crypto market. Empirical properties of the temporal dynamics of chosen cryptocurrencies provide deeper insights.
Details
Keywords
Xiao Yao, Dongxiao Wu, Zhiyong Li and Haoxiang Xu
Since stock return and volatility matters to investors, this study proposes to incorporate the textual sentiment of annual reports in stock price crash risk prediction.
Abstract
Purpose
Since stock return and volatility matters to investors, this study proposes to incorporate the textual sentiment of annual reports in stock price crash risk prediction.
Design/methodology/approach
Specific sentences gathered from management discussions and their subsequent analyses are tokenized and transformed into numeric vectors using textual mining techniques, and then the Naïve Bayes method is applied to score the sentiment, which is used as an input variable for crash risk prediction. The results are compared between a collection of predictive models, including linear regression (LR) and machine learning techniques.
Findings
The experimental results find that those predictive models that incorporate textual sentiment significantly outperform the baseline models with only accounting and market variables included. These conclusions hold when crash risk is proxied by either the negative skewness of the return distribution or down-to-up volatility (DUVOL).
Research limitations/implications
It should be noted that the authors' study focuses on examining the predictive power of textual sentiment in crash risk prediction, while other dimensions of textual features such as readability and thematic contents are not considered. More analysis is needed to explore the predictive power of textual features from various dimensions, with the most recent sample data included in future studies.
Originality/value
The authors' study provides implications for the information value of textual data in financial analysis and risk management. It suggests that the soft information contained within annual reports may prove informative in crash risk prediction, and the incorporation of textual sentiment provides an incremental improvement in overall predictive performance.
Details
Keywords
Ali A. Awad, Radhi Al-Hamadeen and Malek Alsharairi
This paper aims to examine and compare the dividend ratios’ statistical and economic ability to predict the equity premium in the UK and US markets and two US sub-indices (S&P 500…
Abstract
Purpose
This paper aims to examine and compare the dividend ratios’ statistical and economic ability to predict the equity premium in the UK and US markets and two US sub-indices (S&P 500 Growth and S&P 500 Value).
Design/methodology/approach
In this paper, the authors use the linear regression models to examine the dividend ratios’ statistical ability to predict the equity premium. The in-sample and out-of-sample approaches, including Diebold and Mariano (1995) statistics, and Goyal and Welch’s (2003) graphical approach, are used. Also, the mean-variance analysis is used to test the economic significance.
Findings
The paper findings indicate that the dividend ratios have in-sample and out-of-sample predictive abilities in both UK and US markets and both US sub-indices. However, the results show that the dividend ratios have a less impressive predictive ability in the US market compared to the UK market and less in the US value index than the US growth index. This could indicate that there is no relation between the number of companies that distribute dividends in each index and the informativeness of dividends ratios. Furthermore, the tests show the dividend ratios’ predictive ability departure during particular periods and in some indices.
Research limitations/implications
Results and implications of this research are exclusively applied to the US and UK markets. These results can also be applied with caution to other markets, taking into consideration the distinctive characteristics of these markets.
Practical implications
Results revealed in this paper imply that the investors in any of the indices may experience economic gain by adopting a dynamic trading strategy using the information content of the dividend ratios prediction models instead of the benchmark model, which is the prevailing simple moving average model.
Originality/value
This paper adds value through testing the prediction models’ economic significance in two well-developed markets, in addition to exploring the relationship between the number of companies distributing cash dividends and the dividends ratio prediction ability. Unlike most of the previous studies in which dividend ratios’ prediction ability is attributed to the number of companies that distribute dividends in the market, this paper denied this interpretation by studying two S&P 500 sub-indices. To the best of the authors’ knowledge, this is the first study to test the prediction models’ ability for these sub-indices.
Details
Keywords
Siqi Wang, Jun-Hwa Cheah, Chee Yew Wong and T. Ramayah
This study aims to evaluate the usage of partial least squares structural equation modeling (PLS-SEM) in journals related to logistics and supply chain management (LSCM).
Abstract
Purpose
This study aims to evaluate the usage of partial least squares structural equation modeling (PLS-SEM) in journals related to logistics and supply chain management (LSCM).
Design/methodology/approach
Based on a structured literature review approach, the authors reviewed 401 articles in the field of LSCM applying PLS-SEM published in 15 major journals between 2014 and 2022. The analysis focused on reasons for using PLS-SEM, measurement model and structural model evaluation criteria, advanced analysis techniques and reporting practices.
Findings
LSCM researchers sometimes did not clarify the reasons for using PLS-SEM, such as sample size, complex models and non-normal distributions. Additionally, most articles exhibit limited use of measurement models and structural model evaluation techniques, leading to inappropriate use of assessment criteria. Furthermore, progress in the practical implementation of advanced analysis techniques is slow, and there is a need for improved transparency in reporting analysis algorithms.
Originality/value
This study contributes to the field of LSCM by providing clear criteria and steps for using PLS-SEM, enriching the understanding and advancement of research methodologies in this field.
Details
Keywords
José Bocoya-Maline, Arturo Calvo-Mora and Manuel Rey Moreno
Drawing on resource and capability theory, this study aimed to analyze the relationship between the dynamic capabilities (DC), the knowledge management (KM) process (KMP) and…
Abstract
Purpose
Drawing on resource and capability theory, this study aimed to analyze the relationship between the dynamic capabilities (DC), the knowledge management (KM) process (KMP) and results in customers and people. More specifically, the study argues that the KM process mediates the relationship between DC and the results outlined above. In addition, a predictive analysis is carried out that demonstrates the relevance of the KM process in the model.
Design/methodology/approach
The study sample is made up of 118 Spanish organizations that have some kind of recognition of excellence awarded by the European Foundation for Quality Management (EFQM). Partial least squares methodology is used to validate the research model, the hypothesis testing and the predictive analysis.
Findings
The results show that organizations which leverage the DC through the KMP improve customer and people outcomes. Moreover, the predictive power is higher when the KMPmediates the relationship between the DC and the results.
Originality/value
There is no consensus in the literature on the relationship between DC, KM and performance. Moreover, there are also not enough papers that study KM or DC through the dimensions that define these constructs or variables. Given this need, this work considers the KMP according to the stages of knowledge creation, storage, transfer and application. Similarly, DC is dimensioned in sensing, learning, integrating and coordinating capabilities. These, as reconfigurators of knowledge assets, influence the KMP. Accordingly, the empirical model connects these knowledge domains and analyses their link to outcomes.
Details
Keywords
Christian Nnaemeka Egwim, Hafiz Alaka, Oluwapelumi Oluwaseun Egunjobi, Alvaro Gomes and Iosif Mporas
This study aims to compare and evaluate the application of commonly used machine learning (ML) algorithms used to develop models for assessing energy efficiency of buildings.
Abstract
Purpose
This study aims to compare and evaluate the application of commonly used machine learning (ML) algorithms used to develop models for assessing energy efficiency of buildings.
Design/methodology/approach
This study foremostly combined building energy efficiency ratings from several data sources and used them to create predictive models using a variety of ML methods. Secondly, to test the hypothesis of ensemble techniques, this study designed a hybrid stacking ensemble approach based on the best performing bagging and boosting ensemble methods generated from its predictive analytics.
Findings
Based on performance evaluation metrics scores, the extra trees model was shown to be the best predictive model. More importantly, this study demonstrated that the cumulative result of ensemble ML algorithms is usually always better in terms of predicted accuracy than a single method. Finally, it was discovered that stacking is a superior ensemble approach for analysing building energy efficiency than bagging and boosting.
Research limitations/implications
While the proposed contemporary method of analysis is assumed to be applicable in assessing energy efficiency of buildings within the sector, the unique data transformation used in this study may not, as typical of any data driven model, be transferable to the data from other regions other than the UK.
Practical implications
This study aids in the initial selection of appropriate and high-performing ML algorithms for future analysis. This study also assists building managers, residents, government agencies and other stakeholders in better understanding contributing factors and making better decisions about building energy performance. Furthermore, this study will assist the general public in proactively identifying buildings with high energy demands, potentially lowering energy costs by promoting avoidance behaviour and assisting government agencies in making informed decisions about energy tariffs when this novel model is integrated into an energy monitoring system.
Originality/value
This study fills a gap in the lack of a reason for selecting appropriate ML algorithms for assessing building energy efficiency. More importantly, this study demonstrated that the cumulative result of ensemble ML algorithms is usually always better in terms of predicted accuracy than a single method.
Details
Keywords
Yoonjae Hwang, Sungwon Jung and Eun Joo Park
Initiator crimes, also known as near-repeat crimes, occur in places with known risk factors and vulnerabilities based on prior crime-related experiences or information…
Abstract
Purpose
Initiator crimes, also known as near-repeat crimes, occur in places with known risk factors and vulnerabilities based on prior crime-related experiences or information. Consequently, the environment in which initiator crimes occur might be different from more general crime environments. This study aimed to analyse the differences between the environments of initiator crimes and general crimes, confirming the need for predicting initiator crimes.
Design/methodology/approach
We compared predictive models using data corresponding to initiator crimes and all residential burglaries without considering repetitive crime patterns as dependent variables. Using random forest and gradient boosting, representative ensemble models and predictive models were compared utilising various environmental factor data. Subsequently, we evaluated the performance of each predictive model to derive feature importance and partial dependence based on a highly predictive model.
Findings
By analysing environmental factors affecting overall residential burglary and initiator crimes, we observed notable differences in high-importance variables. Further analysis of the partial dependence of total residential burglary and initiator crimes based on these variables revealed distinct impacts on each crime. Moreover, initiator crimes took place in environments consistent with well-known theories in the field of environmental criminology.
Originality/value
Our findings indicate the possibility that results that do not appear through the existing theft crime prediction method will be identified in the initiator crime prediction model. Emphasising the importance of investigating the environments in which initiator crimes occur, this study underscores the potential of artificial intelligence (AI)-based approaches in creating a safe urban environment. By effectively preventing potential crimes, AI-driven prediction of initiator crimes can significantly contribute to enhancing urban safety.
Details