Search results

1 – 10 of 331
Open Access
Article
Publication date: 28 July 2020

R. Shashikant and P. Chetankumar

Cardiac arrest is a severe heart anomaly that results in billions of annual casualties. Smoking is a specific hazard factor for cardiovascular pathology, including coronary heart…

2361

Abstract

Cardiac arrest is a severe heart anomaly that results in billions of annual casualties. Smoking is a specific hazard factor for cardiovascular pathology, including coronary heart disease, but data on smoking and heart death not earlier reviewed. The Heart Rate Variability (HRV) parameters used to predict cardiac arrest in smokers using machine learning technique in this paper. Machine learning is a method of computing experience based on automatic learning and enhances performances to increase prognosis. This study intends to compare the performance of logistical regression, decision tree, and random forest model to predict cardiac arrest in smokers. In this paper, a machine learning technique implemented on the dataset received from the data science research group MITU Skillogies Pune, India. To know the patient has a chance of cardiac arrest or not, developed three predictive models as 19 input feature of HRV indices and two output classes. These model evaluated based on their accuracy, precision, sensitivity, specificity, F1 score, and Area under the curve (AUC). The model of logistic regression has achieved an accuracy of 88.50%, precision of 83.11%, the sensitivity of 91.79%, the specificity of 86.03%, F1 score of 0.87, and AUC of 0.88. The decision tree model has arrived with an accuracy of 92.59%, precision of 97.29%, the sensitivity of 90.11%, the specificity of 97.38%, F1 score of 0.93, and AUC of 0.94. The model of the random forest has achieved an accuracy of 93.61%, precision of 94.59%, the sensitivity of 92.11%, the specificity of 95.03%, F1 score of 0.93 and AUC of 0.95. The random forest model achieved the best accuracy classification, followed by the decision tree, and logistic regression shows the lowest classification accuracy.

Details

Applied Computing and Informatics, vol. 19 no. 3/4
Type: Research Article
ISSN: 2634-1964

Keywords

Open Access
Article
Publication date: 27 February 2023

Vasileios Stamatis, Michail Salampasis and Konstantinos Diamantaras

In federated search, a query is sent simultaneously to multiple resources and each one of them returns a list of results. These lists are merged into a single list using the…

Abstract

Purpose

In federated search, a query is sent simultaneously to multiple resources and each one of them returns a list of results. These lists are merged into a single list using the results merging process. In this work, the authors apply machine learning methods for results merging in federated patent search. Even though several methods for results merging have been developed, none of them were tested on patent data nor considered several machine learning models. Thus, the authors experiment with state-of-the-art methods using patent data and they propose two new methods for results merging that use machine learning models.

Design/methodology/approach

The methods are based on a centralized index containing samples of documents from all the remote resources, and they implement machine learning models to estimate comparable scores for the documents retrieved by different resources. The authors examine the new methods in cooperative and uncooperative settings where document scores from the remote search engines are available and not, respectively. In uncooperative environments, they propose two methods for assigning document scores.

Findings

The effectiveness of the new results merging methods was measured against state-of-the-art models and found to be superior to them in many cases with significant improvements. The random forest model achieves the best results in comparison to all other models and presents new insights for the results merging problem.

Originality/value

In this article the authors prove that machine learning models can substitute other standard methods and models that used for results merging for many years. Our methods outperformed state-of-the-art estimation methods for results merging, and they proved that they are more effective for federated patent search.

Details

Data Technologies and Applications, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2514-9288

Keywords

Open Access
Article
Publication date: 8 July 2019

Daniel Abreu Vasconcellos de Paula, Rinaldo Artes, Fabio Ayres and Andrea Maria Accioly Fonseca Minardi

Although credit unions are nonprofit organizations, their objectives depend on the efficient management of their resources and credit risk aligned with the principles of the…

2545

Abstract

Purpose

Although credit unions are nonprofit organizations, their objectives depend on the efficient management of their resources and credit risk aligned with the principles of the cooperative doctrine. This paper aims to propose the combined use of credit scoring and profit scoring to increase the effectiveness of the loan-granting process in credit unions.

Design/methodology/approach

This sample is composed by the data of personal loans transactions of a Brazilian credit union.

Findings

The analysis reveals that the use of statistical methods improves significantly the predictability of default when compared to the use of subjective techniques and the superiority of the random forests model in estimating credit scoring and profit scoring when compared to logit and ordinary least squares method (OLS) regression. The study also illustrates how both analyses can be used jointly for more effective decision-making.

Originality/value

Replacing subjective analysis with objective credit analysis using deterministic models will benefit Brazilian credit unions. The credit decision will be based on the input variables and on clear criteria, turning the decision-making process impartial. The joint use of credit scoring and profit scoring allows granting credit for the clients with the highest potential to pay debt obligation and, at the same time, to certify that the transaction profitability meets the goals of the organization: to be sustainable and to provide loans and investment opportunities at attractive rates to members.

Details

RAUSP Management Journal, vol. 54 no. 3
Type: Research Article
ISSN: 2531-0488

Keywords

Open Access
Article
Publication date: 12 October 2021

Kiran Fahd, Shah Jahan Miah and Khandakar Ahmed

Student attritions in tertiary educational institutes may play a significant role to achieve core values leading towards strategic mission and financial well-being. Analysis of…

3745

Abstract

Purpose

Student attritions in tertiary educational institutes may play a significant role to achieve core values leading towards strategic mission and financial well-being. Analysis of data generated from student interaction with learning management systems (LMSs) in blended learning (BL) environments may assist with the identification of students at risk of failing, but to what extent this may be possible is unknown. However, existing studies are limited to address the issues at a significant scale.

Design/methodology/approach

This study develops a new approach harnessing applications of machine learning (ML) models on a dataset, that is publicly available, relevant to student attrition to identify potential students at risk. The dataset consists of the data generated by the interaction of students with LMS for their BL environment.

Findings

Identifying students at risk through an innovative approach will promote timely intervention in the learning process, such as for improving student academic progress. To evaluate the performance of the proposed approach, the accuracy is compared with other representational ML methods.

Originality/value

The best ML algorithm random forest with 85% is selected to support educators in implementing various pedagogical practices to improve students’ learning.

Open Access
Article
Publication date: 2 November 2021

Showmitra Kumar Sarkar, Swapan Talukdar, Atiqur Rahman, Shahfahad and Sujit Kumar Roy

The present study aims to construct ensemble machine learning (EML) algorithms for groundwater potentiality mapping (GPM) in the Teesta River basin of Bangladesh, including random

2251

Abstract

Purpose

The present study aims to construct ensemble machine learning (EML) algorithms for groundwater potentiality mapping (GPM) in the Teesta River basin of Bangladesh, including random forest (RF) and random subspace (RSS).

Design/methodology/approach

The RF and RSS models have been implemented for integrating 14 selected groundwater condition parametres with groundwater inventories for generating GPMs. The GPM were then validated using the empirical and bionormal receiver operating characteristics (ROC) curve.

Findings

The very high (831–1200 km2) and high groundwater potential areas (521–680 km2) were predicted using EML algorithms. The RSS (AUC-0.892) model outperformed RF model based on ROC's area under curve (AUC).

Originality/value

Two new EML models have been constructed for GPM. These findings will aid in proposing sustainable water resource management plans.

Details

Frontiers in Engineering and Built Environment, vol. 2 no. 1
Type: Research Article
ISSN: 2634-2499

Keywords

Open Access
Article
Publication date: 22 November 2022

Kedong Yin, Yun Cao, Shiwei Zhou and Xinman Lv

The purposes of this research are to study the theory and method of multi-attribute index system design and establish a set of systematic, standardized, scientific index systems…

Abstract

Purpose

The purposes of this research are to study the theory and method of multi-attribute index system design and establish a set of systematic, standardized, scientific index systems for the design optimization and inspection process. The research may form the basis for a rational, comprehensive evaluation and provide the most effective way of improving the quality of management decision-making. It is of practical significance to improve the rationality and reliability of the index system and provide standardized, scientific reference standards and theoretical guidance for the design and construction of the index system.

Design/methodology/approach

Using modern methods such as complex networks and machine learning, a system for the quality diagnosis of index data and the classification and stratification of index systems is designed. This guarantees the quality of the index data, realizes the scientific classification and stratification of the index system, reduces the subjectivity and randomness of the design of the index system, enhances its objectivity and rationality and lays a solid foundation for the optimal design of the index system.

Findings

Based on the ideas of statistics, system theory, machine learning and data mining, the focus in the present research is on “data quality diagnosis” and “index classification and stratification” and clarifying the classification standards and data quality characteristics of index data; a data-quality diagnosis system of “data review – data cleaning – data conversion – data inspection” is established. Using a decision tree, explanatory structural model, cluster analysis, K-means clustering and other methods, classification and hierarchical method system of indicators is designed to reduce the redundancy of indicator data and improve the quality of the data used. Finally, the scientific and standardized classification and hierarchical design of the index system can be realized.

Originality/value

The innovative contributions and research value of the paper are reflected in three aspects. First, a method system for index data quality diagnosis is designed, and multi-source data fusion technology is adopted to ensure the quality of multi-source, heterogeneous and mixed-frequency data of the index system. The second is to design a systematic quality-inspection process for missing data based on the systematic thinking of the whole and the individual. Aiming at the accuracy, reliability, and feasibility of the patched data, a quality-inspection method of patched data based on inversion thought and a unified representation method of data fusion based on a tensor model are proposed. The third is to use the modern method of unsupervised learning to classify and stratify the index system, which reduces the subjectivity and randomness of the design of the index system and enhances its objectivity and rationality.

Details

Marine Economics and Management, vol. 5 no. 2
Type: Research Article
ISSN: 2516-158X

Keywords

Open Access
Article
Publication date: 13 January 2022

Dinda Thalia Andariesta and Meditya Wasesa

This research presents machine learning models for predicting international tourist arrivals in Indonesia during the COVID-19 pandemic using multisource Internet data.

4857

Abstract

Purpose

This research presents machine learning models for predicting international tourist arrivals in Indonesia during the COVID-19 pandemic using multisource Internet data.

Design/methodology/approach

To develop the prediction models, this research utilizes multisource Internet data from TripAdvisor travel forum and Google Trends. Temporal factors, posts and comments, search queries index and previous tourist arrivals records are set as predictors. Four sets of predictors and three distinct data compositions were utilized for training the machine learning models, namely artificial neural networks (ANNs), support vector regression (SVR) and random forest (RF). To evaluate the models, this research uses three accuracy metrics, namely root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE).

Findings

Prediction models trained using multisource Internet data predictors have better accuracy than those trained using single-source Internet data or other predictors. In addition, using more training sets that cover the phenomenon of interest, such as COVID-19, will enhance the prediction model's learning process and accuracy. The experiments show that the RF models have better prediction accuracy than the ANN and SVR models.

Originality/value

First, this study pioneers the practice of a multisource Internet data approach in predicting tourist arrivals amid the unprecedented COVID-19 pandemic. Second, the use of multisource Internet data to improve prediction performance is validated with real empirical data. Finally, this is one of the few papers to provide perspectives on the current dynamics of Indonesia's tourism demand.

Open Access
Article
Publication date: 28 October 2014

Julian Witjaksono, Xiaowen Wei, Suchun Mao, Wankui Gong, Yabing Li and Youlu Yuan

The purpose of this paper is to provide an overview of the current state of knowledge on the economic performance of genetically modified (GM) cotton worldwide based on a wide…

9944

Abstract

Purpose

The purpose of this paper is to provide an overview of the current state of knowledge on the economic performance of genetically modified (GM) cotton worldwide based on a wide range of data and source from available literature, and second to assess yield gain and economic performance.

Design/methodology/approach

A systematic review was captured to provide the evidence of potential benefits of GM cotton. A country-specific analysis was conducted in order to compare economic indicators and employed meta-analysis to find out the significance of the different of GM cotton over its counterpart.

Findings

This paper depicts positive impact of commercialized GM cotton in terms of net revenue, and the benefits, especially in terms of increased yields, are greatest for the mostly farmers in developing countries who have benefitted from the spill over of technology targeted at farmers in industrialized countries.

Research limitations/implications

Due to the variability of the data which came from different methodologies, it is difficult to determine the differences of the performances each individual study.

Practical implications

This, it is believed that results from this study can be useful for operations of all sizes as the authors think about what needs to be focussed on for long-term producers survival.

Originality/value

The paper clearly indicates that China is the highest cotton yield of GM cotton, the lowest cost of GM seed and the lowest cost of chemical spray compare to any other countries. Therefore, this is the fact that the adoption of GM cotton has been widely spread among the farmers across the regions in China.

Details

China Agricultural Economic Review, vol. 6 no. 4
Type: Research Article
ISSN: 1756-137X

Keywords

Open Access
Article
Publication date: 7 June 2021

Tamoor Khan, Jiangtao Qiu, Ameen Banjar, Riad Alharbey, Ahmed Omar Alzahrani and Rashid Mehmood

The purpose of this paper is to assess the impacts on production of five fruit crops from 1961 to 2018 of energy use, CO2 emissions, farming areas and the labor force in China.

1860

Abstract

Purpose

The purpose of this paper is to assess the impacts on production of five fruit crops from 1961 to 2018 of energy use, CO2 emissions, farming areas and the labor force in China.

Design/methodology/approach

This analysis applied the autoregressive distributed lag-bound testing (ARDL) approach, Granger causality method and Johansen co-integration test to predict long-term co-integration and relation between variables. Four machine learning methods are used for prediction of the accuracy of climate effect on fruit production.

Findings

The Johansen test findings have shown that the fruit crop growth, energy use, CO2 emissions, harvested land and labor force have a long-term co-integration relation. The outcome of the long-term use of CO2 emission and rural population has a negative influence on fruit crops. The energy consumption, harvested area, total fruit yield and agriculture labor force have a positive influence on six fruit crops. The long-run relationships reveal that a 1% increase in rural population and CO2 will decrease fruit crop production by −0.59 and −1.97. The energy consumption, fruit harvested area, total fruit yield and agriculture labor force will increase fruit crop production by 0.17%, 1.52%, 1.80% and 4.33%, respectively. Furthermore, uni-directional causality is correlated with the growth of fruit crops and energy consumption. Also, the results indicate that the bi-directional causality impact varies from CO2 emissions to agricultural areas to fruit crops.

Originality/value

This study also fills the literature gap in implementing ARDL for agricultural fruits of China, used machine learning methods to examine the impact of climate change and to explore this important issue.

Details

International Journal of Climate Change Strategies and Management, vol. 13 no. 2
Type: Research Article
ISSN: 1756-8692

Keywords

Open Access
Article
Publication date: 19 December 2022

Baojun Ma, Jingxia He, Hui Yuan, Jian Zhang and Chi Zhang

Corporate social responsibility (CSR) is significant in the financial market. Despite plenty of existing research on CSR, few studies have quantified the fine-grained aspects of…

804

Abstract

Purpose

Corporate social responsibility (CSR) is significant in the financial market. Despite plenty of existing research on CSR, few studies have quantified the fine-grained aspects of CSR and examined how diverse CSR aspects are associated with firms' trade credit. Based on the released CSR reports, this paper strives to measure the CSR fulfillment of firms and examine the relationships between CSR and trade credit in terms of textual features presented in these reports.

Design/methodology/approach

This research proposes a natural language processing-based framework to extract the overall readability and the sentiment of fine-grained aspects from CSR reports, which can signal the performance of firms' CSR in diverse aspects. Furthermore, this paper explores how the textual features are associated with trade credit through partial dependence plots (PDPs), and PDPs can generate both linear and nonlinear relationships.

Findings

The study’s results reveal that the overall readability of the reports is positively associated with trade credit, while the performance of the fine-grained CSR aspects mentioned in the CSR reports matters differently. The performance of the environment has a positive impact on trade credit; the performance of creditors, suppliers and information disclosure, shows a U-shaped influence on trade credit; while the performance of the government and customers is negatively associated with trade credit.

Originality/value

This study expands the scope of research on CSR and trade credit by investigating fine-grained aspects covered in CSR reports. It also offers some managerial implications in the allocation of CSR resources and the presentation of CSR reports.

Details

Journal of Electronic Business & Digital Economics, vol. 2 no. 1
Type: Research Article
ISSN: 2754-4214

Keywords

1 – 10 of 331