Search results

1 – 10 of 216
Book part
Publication date: 30 September 2020

B. G. Deepa and S. Senthil

Breast cancer (BC) is one of the leading cancer in the world, BC risk has been there for women of the middle age also, it is the malignant tumor. However, identifying BC in the…

Abstract

Breast cancer (BC) is one of the leading cancer in the world, BC risk has been there for women of the middle age also, it is the malignant tumor. However, identifying BC in the early stage will save most of the women’s life. As there is an advancement in the technology research used Machine Learning (ML) algorithm Random Forest for ranking the feature, Support Vector Machine (SVM), and Naïve Bayes (NB) supervised classifiers for selection of best optimized features and prediction of BC accuracy. The estimation of prediction accuracy has been done by using the dataset Wisconsin Breast Cancer Data from University of California Irvine (UCI) ML repository. To perform all these operation, Anaconda one of the open source distribution of Python has been used. The proposed work resulted in extemporize improvement in the NB and SVM classifier accuracy. The performance evaluation of the proposed model is estimated by using classification accuracy, confusion matrix, mean, standard deviation, variance, and root mean-squared error.

The experimental results shows that 70-30 data split will result in best accuracy. SVM acts as a feature optimizer of 12 best features with the result of 97.66% accuracy and improvement of 1.17% after feature reduction. NB results with feature optimizer 17 of best features with the result of 96.49% accuracy and improvement of 1.17% after feature reduction.

The study shows that proposal model works very effectively as compare to the existing models with respect to accuracy measures.

Details

Big Data Analytics and Intelligence: A Perspective for Health Care
Type: Book
ISBN: 978-1-83909-099-8

Keywords

Book part
Publication date: 30 September 2020

Hera Khan, Ayush Srivastav and Amit Kumar Mishra

A detailed description will be provided of all the classification algorithms that have been widely used in the domain of medical science. The foundation will be laid by giving a…

Abstract

A detailed description will be provided of all the classification algorithms that have been widely used in the domain of medical science. The foundation will be laid by giving a comprehensive overview pertaining to the background and history of the classification algorithms. This will be followed by an extensive discussion regarding various techniques of classification algorithm in machine learning (ML) hence concluding with their relevant applications in data analysis in medical science and health care. To begin with, the initials of this chapter will deal with the basic fundamentals required for a profound understanding of the classification techniques in ML which will comprise of the underlying differences between Unsupervised and Supervised Learning followed by the basic terminologies of classification and its history. Further, it will include the types of classification algorithms ranging from linear classifiers like Logistic Regression, Naïve Bayes to Nearest Neighbour, Support Vector Machine, Tree-based Classifiers, and Neural Networks, and their respective mathematics. Ensemble algorithms such as Majority Voting, Boosting, Bagging, Stacking will also be discussed at great length along with their relevant applications. Furthermore, this chapter will also incorporate comprehensive elucidation regarding the areas of application of such classification algorithms in the field of biomedicine and health care and their contribution to decision-making systems and predictive analysis. To conclude, this chapter will devote highly in the field of research and development as it will provide a thorough insight to the classification algorithms and their relevant applications used in the cases of the healthcare development sector.

Details

Big Data Analytics and Intelligence: A Perspective for Health Care
Type: Book
ISBN: 978-1-83909-099-8

Keywords

Book part
Publication date: 31 January 2015

Davy Janssens and Geert Wets

Several activity-based transportation models are now becoming operational and are entering the stage of application for the modelling of travel demand. In our application, we will…

Abstract

Several activity-based transportation models are now becoming operational and are entering the stage of application for the modelling of travel demand. In our application, we will use decision rules to support the decision-making of the model instead of principles of utility maximization, which means our work can be interpreted as an application of the concept of bounded rationality in the transportation domain. In this chapter we explored a novel idea of combining decision trees and Bayesian networks to improve decision-making in order to maintain the potential advantages of both techniques. The results of this study suggest that integrated Bayesian networks and decision trees can be used for modelling the different choice facets of a travel demand model with better predictive power than CHAID decision trees. Another conclusion is that there are initial indications that the new way of integrating decision trees and Bayesian networks has produced a decision tree that is structurally more stable.

Details

Bounded Rational Choice Behaviour: Applications in Transport
Type: Book
ISBN: 978-1-78441-071-1

Keywords

Abstract

Details

Machine Learning and Artificial Intelligence in Marketing and Sales
Type: Book
ISBN: 978-1-80043-881-1

Book part
Publication date: 4 December 2020

Gauri Rajendra Virkar and Supriya Sunil Shinde

Predictive analytics is the science of decision-making that eliminates guesswork out of the decision-making process and applies proven scientific procedures to find right…

Abstract

Predictive analytics is the science of decision-making that eliminates guesswork out of the decision-making process and applies proven scientific procedures to find right solutions. Predictive analytics provides ideas on the occurrences of future downtimes and rejections thereby aids in taking preventive actions before abnormalities occur. Considering these advantages, predictive analytics is adopted in various diverse fields such as health care, finance, education, marketing, automotive, etc. Predictive analytics tools can be used to predict various behaviors and patterns, thereby saving the time and money of its users. Many open-source predictive analysis tools namely R, scikit-learn, Konstanz Information Miner (KNIME), Orange, RapidMiner, Waikato Environment for Knowledge Analysis (WEKA), etc. are freely available for the users. This chapter aims to reveal the best accurate tools and techniques for the classification task that aid in decision-making. Our experimental results show that no specific tool provides the best results in all scenarios; rather it depends upon the datasets and the classifier.

Abstract

Details

Developing an Effective Model for Detecting Trade-based Market Manipulation
Type: Book
ISBN: 978-1-80117-397-1

Book part
Publication date: 28 March 2022

Altaf Alam, Anurag Chauhan, Mohd Tauseef Khan and Zainul Abdin Jaffery

In this chapter, drone and vision camera technology have been combined for monitoring the crop product quality. Three vegetable crops such as tomato, cauliflower, and eggplant are…

Abstract

In this chapter, drone and vision camera technology have been combined for monitoring the crop product quality. Three vegetable crops such as tomato, cauliflower, and eggplant are considered for quality monitoring; hence, image datasets are collected for those vegetables only. The proposed method classified the vegetables into two classes as rotten and nonrotten products so the images were collected for rotten and nonrotten products. Three different features information such as chromatic features, contour features, and texture features have been extracted from the dataset and further used to train a Gaussian kernel support vector machine algorithm for identifying the product quality. The system utilized multiple features such as chromatic, contour, and texture features in classifier training which enhances the accuracy and robustness of the system. Chromatic features were utilized for detecting the crop while other features such as contour and texture features were utilized for further classifier building to identify the crop product quality. The performance of the system is evaluated based on the true positive rate, false discovery rate, positive predictive value, and accuracy. The proposed system identified good and bad products with a 97.9% of true positive rate, 2.43 % of false discovery rate, 97.73% positive predictive value, and 95.4% of accuracy. The achieved results concluded that the results are lucrative and the proposed system is efficient in agriculture product quality monitoring.

Book part
Publication date: 10 November 2017

Karen Miller

This chapter explores differences in fringe, distant, and remote rural public library assets for asset-based community development (ABCD) and the relationships of those assets to…

Abstract

This chapter explores differences in fringe, distant, and remote rural public library assets for asset-based community development (ABCD) and the relationships of those assets to geographic regions, governance structures, and demographics.

The author analyzes 2013 data from the Institute of Museum and Library Services (IMLS) and U.S. Department of Agriculture using nonparametric statistics and data mining random forest supervised classification algorithms.

There are statistically significant differences between fringe, distant, and remote library assets. Unexpectedly, median per capita outlets (along with service hours and staff) increase as distances from urban areas increase. The Southeast region ranks high in unemployment and poverty and low in median household income, which aligns with the Southeast’s low median per capita library expenditures, staff, hours, inventory, and programs. However, the Southeast’s relatively high percentage of rural libraries with at least one staff member with a Master of Library and Information Science promises future asset growth in those libraries. State and federal contributions to Alaska libraries propelled the remote Far West to the number one ranking in median per capita staff, inventory, and programs.

This study is based on IMLS library system-wide data and does not include rural library branches operated by nonrural central libraries.

State and federal contributions to rural libraries increase economic, cultural, and social capital creation in the most remote communities. On a per capita basis, economic capital from state and federal agencies assists small, remote rural libraries in providing infrastructure and services that are more closely aligned with libraries in more populated areas and increases library assets available for ABCD initiatives in otherwise underserved communities.

Even the smallest rural library can contribute to ABCD initiatives by connecting their communities to outside resources and creating new economic, cultural, and social assets.

Analyzing rural public library assets within their geographic, political, and demographic contexts highlights their potential contributions to ABCD initiatives.

Details

Rural and Small Public Libraries: Challenges and Opportunities
Type: Book
ISBN: 978-1-78743-112-6

Keywords

Book part
Publication date: 18 July 2022

Yakub Kayode Saheed, Usman Ahmad Baba and Mustafa Ayobami Raji

Purpose: This chapter aims to examine machine learning (ML) models for predicting credit card fraud (CCF).Need for the study: With the advance of technology, the world is…

Abstract

Purpose: This chapter aims to examine machine learning (ML) models for predicting credit card fraud (CCF).

Need for the study: With the advance of technology, the world is increasingly relying on credit cards rather than cash in daily life. This creates a slew of new opportunities for fraudulent individuals to abuse these cards. As of December 2020, global card losses reached $28.65billion, up 2.9% from $27.85 billion in 2018, according to the Nilson 2019 research. To safeguard the safety of credit card users, the credit card issuer should include a service that protects customers from potential risks. CCF has become a severe threat as internet buying has grown. To this goal, various studies in the field of automatic and real-time fraud detection are required. Due to their advantageous properties, the most recent ones employ a variety of ML algorithms and techniques to construct a well-fitting model to detect fraudulent transactions. When it comes to recognising credit card risk is huge and high-dimensional data, feature selection (FS) is critical for improving classification accuracy and fraud detection.

Methodology/design/approach: The objectives of this chapter are to construct a new model for credit card fraud detection (CCFD) based on principal component analysis (PCA) for FS and using supervised ML techniques such as K-nearest neighbour (KNN), ridge classifier, gradient boosting, quadratic discriminant analysis, AdaBoost, and random forest for classification of fraudulent and legitimate transactions. When compared to earlier experiments, the suggested approach demonstrates a high capacity for detecting fraudulent transactions. To be more precise, our model’s resilience is constructed by integrating the power of PCA for determining the most useful predictive features. The experimental analysis was performed on German credit card and Taiwan credit card data sets.

Findings: The experimental findings revealed that the KNN achieved an accuracy of 96.29%, recall of 100%, and precision of 96.29%, which is the best performing model on the German data set. While the ridge classifier was the best performing model on Taiwan Credit data with an accuracy of 81.75%, recall of 34.89, and precision of 66.61%.

Practical implications: The poor performance of the models on the Taiwan data revealed that it is an imbalanced credit card data set. The comparison of our proposed models with state-of-the-art credit card ML models showed that our results were competitive.

Book part
Publication date: 11 May 2017

Golo Henseke and Francis Green

Utilizing work task data drawn from the OECD’s Survey of Adult Skills of 2011–2012 and 2014–2015, we derive a new skills-based indicator of graduate jobs, termed ISCO(HE)2008, for…

Abstract

Utilizing work task data drawn from the OECD’s Survey of Adult Skills of 2011–2012 and 2014–2015, we derive a new skills-based indicator of graduate jobs, termed ISCO(HE)2008, for 31 countries. The indicator generates a plausible distribution of graduate occupations and explains graduates’ wages and job satisfaction better than hitherto existing indicators. Unlike with the traditional classifier, several jobs in major group 3 “Technicians and Associate Professionals” require higher education in many countries. Altogether, almost a third of labor is deployed in graduate jobs in the 31 countries, but with large cross-national differences. Industry and establishment-size composition can account for some of the variation. In addition, two indicators of the relative quality of the higher education system also contribute to the variation in the prevalence of graduate jobs across countries.

Details

Skill Mismatch in Labor Markets
Type: Book
ISBN: 978-1-78714-377-7

Keywords

1 – 10 of 216