Search results
1 – 10 of 33Sharifah Heryati Syed Nor, Shafinar Ismail and Bee Wah Yap
Personal bankruptcy is on the rise in Malaysia. The Insolvency Department of Malaysia reported that personal bankruptcy has increased since 2007, and the total accumulated…
Abstract
Purpose
Personal bankruptcy is on the rise in Malaysia. The Insolvency Department of Malaysia reported that personal bankruptcy has increased since 2007, and the total accumulated personal bankruptcy cases stood at 131,282 in 2014. This is indeed an alarming issue because the increasing number of personal bankruptcy cases will have a negative impact on the Malaysian economy, as well as on the society. From the aspect of individual’s personal economy, bankruptcy minimizes their chances of securing a job. Apart from that, their account will be frozen, lost control on their assets and properties and not allowed to start any business nor be a part of any company’s management. Bankrupts also will be denied from any loan application, restricted from travelling overseas and cannot act as a guarantor. This paper aims to investigate this problem by developing the personal bankruptcy prediction model using the decision tree technique.
Design/methodology/approach
In this paper, bankrupt is defined as terminated members who failed to settle their loans. The sample comprised of 24,546 cases with 17 per cent settled cases and 83 per cent terminated cases. The data included a dependent variable, i.e. bankruptcy status (Y = 1(bankrupt), Y = 0 (non-bankrupt)) and 12 predictors. SAS Enterprise Miner 14.1 software was used to develop the decision tree model.
Findings
Upon completion, this study succeeds to come out with the profiles of bankrupts, reliable personal bankruptcy scoring model and significant variables of personal bankruptcy.
Practical implications
This decision tree model is possible for patent and income generation. Financial institutions are able to use this model for potential borrowers to predict their tendency toward personal bankruptcy.
Social implications
Create awareness to society on significant variables of personal bankruptcy so that they can avoid being a bankrupt.
Originality/value
This decision tree model is able to facilitate and assist financial institutions in evaluating and assessing their potential borrower. It helps to identify potential defaulting borrowers. It also can assist financial institutions in implementing the right strategies to avoid defaulting borrowers.
Details
Keywords
R. Shashikant and P. Chetankumar
Cardiac arrest is a severe heart anomaly that results in billions of annual casualties. Smoking is a specific hazard factor for cardiovascular pathology, including coronary heart…
Abstract
Cardiac arrest is a severe heart anomaly that results in billions of annual casualties. Smoking is a specific hazard factor for cardiovascular pathology, including coronary heart disease, but data on smoking and heart death not earlier reviewed. The Heart Rate Variability (HRV) parameters used to predict cardiac arrest in smokers using machine learning technique in this paper. Machine learning is a method of computing experience based on automatic learning and enhances performances to increase prognosis. This study intends to compare the performance of logistical regression, decision tree, and random forest model to predict cardiac arrest in smokers. In this paper, a machine learning technique implemented on the dataset received from the data science research group MITU Skillogies Pune, India. To know the patient has a chance of cardiac arrest or not, developed three predictive models as 19 input feature of HRV indices and two output classes. These model evaluated based on their accuracy, precision, sensitivity, specificity, F1 score, and Area under the curve (AUC). The model of logistic regression has achieved an accuracy of 88.50%, precision of 83.11%, the sensitivity of 91.79%, the specificity of 86.03%, F1 score of 0.87, and AUC of 0.88. The decision tree model has arrived with an accuracy of 92.59%, precision of 97.29%, the sensitivity of 90.11%, the specificity of 97.38%, F1 score of 0.93, and AUC of 0.94. The model of the random forest has achieved an accuracy of 93.61%, precision of 94.59%, the sensitivity of 92.11%, the specificity of 95.03%, F1 score of 0.93 and AUC of 0.95. The random forest model achieved the best accuracy classification, followed by the decision tree, and logistic regression shows the lowest classification accuracy.
Details
Keywords
Reshmy Krishnan, Shantha Kumari, Ali Al Badi, Shermina Jeba and Menila James
Students pursuing different professional courses at the higher education level during 2021–2022 saw the first-time occurrence of a pandemic in the form of coronavirus disease 2019…
Abstract
Purpose
Students pursuing different professional courses at the higher education level during 2021–2022 saw the first-time occurrence of a pandemic in the form of coronavirus disease 2019 (COVID-19), and their mental health was affected. Many works are available in the literature to assess mental health severity. However, it is necessary to identify the affected students early for effective treatment.
Design/methodology/approach
Predictive analytics, a part of machine learning (ML), helps with early identification based on mental health severity levels to aid clinical psychologists. As a case study, engineering and medical course students were comparatively analysed in this work as they have rich course content and a stricter evaluation process than other streams. The methodology includes an online survey that obtains demographic details, academic qualifications, family details, etc. and anxiety and depression questions using the Hospital Anxiety and Depression Scale (HADS). The responses acquired through social media networks are analysed using ML algorithms – support vector machines (SVMs) (robust handling of health information) and J48 decision tree (DT) (interpretability/comprehensibility). Also, random forest is used to identify the predictors for anxiety and depression.
Findings
The results show that the support vector classifier produces outperforming results with classification accuracy of 100%, 1.0 precision and 1.0 recall, followed by the J48 DT classifier with 96%. It was found that medical students are affected by anxiety and depression marginally more when compared with engineering students.
Research limitations/implications
The entire work is dependent on the social media-displayed online questionnaire, and the participants were not met in person. This indicates that the response rate could not be evaluated appropriately. Due to the medical restrictions imposed by COVID-19, which remain in effect in 2022, this is the only method found to collect primary data from college students. Additionally, students self-selected themselves to participate in this survey, which raises the possibility of selection bias.
Practical implications
The responses acquired through social media networks are analysed using ML algorithms. This will be a big support for understanding the mental issues of the students due to COVID-19 and can taking appropriate actions to rectify them. This will improve the quality of the learning process in higher education in Oman.
Social implications
Furthermore, this study aims to provide recommendations for mental health screening as a regular practice in educational institutions to identify undetected students.
Originality/value
Comparing the mental health issues of two professional course students is the novelty of this work. This is needed because both studies require practical learning, long hours of work, etc.
Details
Keywords
Weifei Hu, Tongzhou Zhang, Xiaoyu Deng, Zhenyu Liu and Jianrong Tan
Digital twin (DT) is an emerging technology that enables sophisticated interaction between physical objects and their virtual replicas. Although DT has recently gained significant…
Abstract
Digital twin (DT) is an emerging technology that enables sophisticated interaction between physical objects and their virtual replicas. Although DT has recently gained significant attraction in both industry and academia, there is no systematic understanding of DT from its development history to its different concepts and applications in disparate disciplines. The majority of DT literature focuses on the conceptual development of DT frameworks for a specific implementation area. Hence, this paper provides a state-of-the-art review of DT history, different definitions and models, and six types of key enabling technologies. The review also provides a comprehensive survey of DT applications from two perspectives: (1) applications in four product-lifecycle phases, i.e. product design, manufacturing, operation and maintenance, and recycling and (2) applications in four categorized engineering fields, including aerospace engineering, tunneling and underground engineering, wind engineering and Internet of things (IoT) applications. DT frameworks, characteristic components, key technologies and specific applications are extracted for each DT category in this paper. A comprehensive survey of the DT references reveals the following findings: (1) The majority of existing DT models only involve one-way data transfer from physical entities to virtual models and (2) There is a lack of consideration of the environmental coupling, which results in the inaccurate representation of the virtual components in existing DT models. Thus, this paper highlights the role of environmental factor in DT enabling technologies and in categorized engineering applications. In addition, the review discusses the key challenges and provides future work for constructing DTs of complex engineering systems.
Details
Keywords
Elisabeth Ilie-Zudor, Anikó Ekárt, Zsolt Kemeny, Christopher Buckingham, Philip Welch and Laszlo Monostori
– The purpose of this paper is to examine challenges and potential of big data in heterogeneous business networks and relate these to an implemented logistics solution.
Abstract
Purpose
The purpose of this paper is to examine challenges and potential of big data in heterogeneous business networks and relate these to an implemented logistics solution.
Design/methodology/approach
The paper establishes an overview of challenges and opportunities of current significance in the area of big data, specifically in the context of transparency and processes in heterogeneous enterprise networks. Within this context, the paper presents how existing components and purpose-driven research were combined for a solution implemented in a nationwide network for less-than-truckload consignments.
Findings
Aside from providing an extended overview of today’s big data situation, the findings have shown that technical means and methods available today can comprise a feasible process transparency solution in a large heterogeneous network where legacy practices, reporting lags and incomplete data exist, yet processes are sensitive to inadequate policy changes.
Practical implications
The means introduced in the paper were found to be of utility value in improving process efficiency, transparency and planning in logistics networks. The particular system design choices in the presented solution allow an incremental introduction or evolution of resource handling practices, incorporating existing fragmentary, unstructured or tacit knowledge of experienced personnel into the theoretically founded overall concept.
Originality/value
The paper extends previous high-level view on the potential of big data, and presents new applied research and development results in a logistics application.
Details
Keywords
Choo Jun Tan, Ting Yee Lim, Chin Wei Bong and Teik Kooi Liew
The purpose of this paper is to propose a soft computing model based on multi-objective evolutionary algorithm (MOEA), namely, modified micro genetic algorithm (MmGA) coupled with…
Abstract
Purpose
The purpose of this paper is to propose a soft computing model based on multi-objective evolutionary algorithm (MOEA), namely, modified micro genetic algorithm (MmGA) coupled with a decision tree (DT)-based classifier, in classifying and optimising the students’ online interaction activities as classifier of student achievement. Subsequently, the results are transformed into useful information that may help educator in designing better learning instructions geared towards higher student achievement.
Design/methodology/approach
A soft computing model based on MOEA is proposed. It is tested on benchmark data pertaining to student activities and achievement obtained from the University of California at Irvine machine learning repository. Additional, a real-world case study in a distance learning institution, namely, Wawasan Open University in Malaysia has been conducted. The case study involves a total of 46 courses collected over 24 consecutive weeks with students across the entire regions in Malaysia and worldwide.
Findings
The proposed model obtains high classification accuracy rates at reduced number of features used. These results are transformed into useful information for the educational institution in our case study in an effort to improve student achievement. Whether benchmark or real-world case study, the proposed model successfully reduced the number features used by at least 48 per cent while achieving higher classification accuracy.
Originality/value
A soft computing model based on MOEA, namely, MmGA coupled with a DT-based classifier, in handling educational data is proposed.
Details
Keywords
Elias Shohei Kamimura, Anderson Rogério Faia Pinto and Marcelo Seido Nagano
This paper aims to present a literature review of the most recent optimisation methods applied to Credit Scoring Models (CSMs).
Abstract
Purpose
This paper aims to present a literature review of the most recent optimisation methods applied to Credit Scoring Models (CSMs).
Design/methodology/approach
The research methodology employed technical procedures based on bibliographic and exploratory analyses. A traditional investigation was carried out using the Scopus, ScienceDirect and Web of Science databases. The papers selection and classification took place in three steps considering only studies in English language and published in electronic journals (from 2008 to 2022). The investigation led up to the selection of 46 publications (10 presenting literature reviews and 36 proposing CSMs).
Findings
The findings showed that CSMs are usually formulated using Financial Analysis, Machine Learning, Statistical Techniques, Operational Research and Data Mining Algorithms. The main databases used by the researchers were banks and the University of California, Irvine. The analyses identified 48 methods used by CSMs, the main ones being: Logistic Regression (13%), Naive Bayes (10%) and Artificial Neural Networks (7%). The authors conclude that advances in credit score studies will require new hybrid approaches capable of integrating Big Data and Deep Learning algorithms into CSMs. These algorithms should have practical issues considered consider practical issues for improving the level of adaptation and performance demanded for the CSMs.
Practical implications
The results of this study might provide considerable practical implications for the application of CSMs. As it was aimed to demonstrate the application of optimisation methods, it is highly considerable that legal and ethical issues should be better adapted to CSMs. It is also suggested improvement of studies focused on micro and small companies for sales in instalment plans and commercial credit through the improvement or new CSMs.
Originality/value
The economic reality surrounding credit granting has made risk management a complex decision-making issue increasingly supported by CSMs. Therefore, this paper satisfies an important gap in the literature to present an analysis of recent advances in optimisation methods applied to CSMs. The main contribution of this paper consists of presenting the evolution of the state of the art and future trends in studies aimed at proposing better CSMs.
Details
Keywords
Khalid Iqbal and Muhammad Shehrayar Khan
In this digital era, email is the most pervasive form of communication between people. Many users become a victim of spam emails and their data have been exposed.
Abstract
Purpose
In this digital era, email is the most pervasive form of communication between people. Many users become a victim of spam emails and their data have been exposed.
Design/methodology/approach
Researchers contribute to solving this problem by a focus on advanced machine learning algorithms and improved models for detecting spam emails but there is still a gap in features. To achieve good results, features also play an important role. To evaluate the performance of applied classifiers, 10-fold cross-validation is used.
Findings
The results approve that the spam emails are correctly classified with the accuracy of 98.00% for the Support Vector Machine and 98.06% for the Artificial Neural Network as compared to other applied machine learning classifiers.
Originality/value
In this paper, Point-Biserial correlation is applied to each feature concerning the class label of the University of California Irvine (UCI) spambase email dataset to select the best features. Extensive experiments are conducted on selected features by training the different classifiers.
Details
Keywords
Tamoor Khan, Jiangtao Qiu, Ameen Banjar, Riad Alharbey, Ahmed Omar Alzahrani and Rashid Mehmood
The purpose of this paper is to assess the impacts on production of five fruit crops from 1961 to 2018 of energy use, CO2 emissions, farming areas and the labor force in China.
Abstract
Purpose
The purpose of this paper is to assess the impacts on production of five fruit crops from 1961 to 2018 of energy use, CO2 emissions, farming areas and the labor force in China.
Design/methodology/approach
This analysis applied the autoregressive distributed lag-bound testing (ARDL) approach, Granger causality method and Johansen co-integration test to predict long-term co-integration and relation between variables. Four machine learning methods are used for prediction of the accuracy of climate effect on fruit production.
Findings
The Johansen test findings have shown that the fruit crop growth, energy use, CO2 emissions, harvested land and labor force have a long-term co-integration relation. The outcome of the long-term use of CO2 emission and rural population has a negative influence on fruit crops. The energy consumption, harvested area, total fruit yield and agriculture labor force have a positive influence on six fruit crops. The long-run relationships reveal that a 1% increase in rural population and CO2 will decrease fruit crop production by −0.59 and −1.97. The energy consumption, fruit harvested area, total fruit yield and agriculture labor force will increase fruit crop production by 0.17%, 1.52%, 1.80% and 4.33%, respectively. Furthermore, uni-directional causality is correlated with the growth of fruit crops and energy consumption. Also, the results indicate that the bi-directional causality impact varies from CO2 emissions to agricultural areas to fruit crops.
Originality/value
This study also fills the literature gap in implementing ARDL for agricultural fruits of China, used machine learning methods to examine the impact of climate change and to explore this important issue.
Details
Keywords
Jeremy Whaley, Jinha Lee and Youn-Kyung Kim
The purpose of this paper is to investigate whether guests’ tipping motivations (i.e. server quality, social norm and food quality) and demographic characteristics (i.e. gender…
Abstract
Purpose
The purpose of this paper is to investigate whether guests’ tipping motivations (i.e. server quality, social norm and food quality) and demographic characteristics (i.e. gender, age and income) influence loyalty to the server in a restaurant.
Design/methodology/approach
Based on a national online survey consumer panel comprised of 468 participants, the authors utilized decision tree using R statistical software. Predictor variables were tipping motivations and demographic characteristics (age, gender and income). Target variable was loyalty to the server.
Findings
The findings suggest that social norm, food quality and income influence customers’ loyalty toward the same server on future visits. Social norm turned out to be the strongest predictor. If consumers did not have high social norm on tipping, their loyalty toward a particular server was affected by the combination of determinants such as server quality, social norm, income and food quality.
Research limitations/implications
Future research can identify or develop scales of tipping motivations with stronger reliabilities in the context of restaurants. Future research can also explore other demographic differences (e.g. ethnicity and sexual orientation) in tipping motivations and server loyalty.
Practical implications
Servers are indeed the primary contact point and they are in the most influential position with consumers. Overall, results of this study provide an interesting insight in that restaurant guests’ experience can be ruined by bad quality of food or can be mitigated by server quality. Thus, this research highlights a step-by-step process as to the actions that a server may perform and manage in order to enhance server loyalty.
Originality/value
Loyalty has been examined in the context of products, brands or service providers. This study focuses on loyalty toward a specific server, because the consideration of server–guest relationship provides both a compelling and timely area of study in that restaurants continue to look for unique ways to drive server–guest rapport and customer loyalty.
Details