Search results
21 – 30 of over 48000This work can be used as a building block in other settings such as GPU, Map-Reduce, Spark or any other. Also, DDPML can be deployed on other distributed systems such as P2P…
Abstract
Purpose
This work can be used as a building block in other settings such as GPU, Map-Reduce, Spark or any other. Also, DDPML can be deployed on other distributed systems such as P2P networks, clusters, clouds computing or other technologies.
Design/methodology/approach
In the age of Big Data, all companies want to benefit from large amounts of data. These data can help them understand their internal and external environment and anticipate associated phenomena, as the data turn into knowledge that can be used for prediction later. Thus, this knowledge becomes a great asset in companies' hands. This is precisely the objective of data mining. But with the production of a large amount of data and knowledge at a faster pace, the authors are now talking about Big Data mining. For this reason, the authors’ proposed works mainly aim at solving the problem of volume, veracity, validity and velocity when classifying Big Data using distributed and parallel processing techniques. So, the problem that the authors are raising in this work is how the authors can make machine learning algorithms work in a distributed and parallel way at the same time without losing the accuracy of classification results. To solve this problem, the authors propose a system called Dynamic Distributed and Parallel Machine Learning (DDPML) algorithms. To build it, the authors divided their work into two parts. In the first, the authors propose a distributed architecture that is controlled by Map-Reduce algorithm which in turn depends on random sampling technique. So, the distributed architecture that the authors designed is specially directed to handle big data processing that operates in a coherent and efficient manner with the sampling strategy proposed in this work. This architecture also helps the authors to actually verify the classification results obtained using the representative learning base (RLB). In the second part, the authors have extracted the representative learning base by sampling at two levels using the stratified random sampling method. This sampling method is also applied to extract the shared learning base (SLB) and the partial learning base for the first level (PLBL1) and the partial learning base for the second level (PLBL2). The experimental results show the efficiency of our solution that the authors provided without significant loss of the classification results. Thus, in practical terms, the system DDPML is generally dedicated to big data mining processing, and works effectively in distributed systems with a simple structure, such as client-server networks.
Findings
The authors got very satisfactory classification results.
Originality/value
DDPML system is specially designed to smoothly handle big data mining classification.
Details
Keywords
The objective of this research work is to design a data-based solution for administering traffic organization in a smart city by using the machine learning algorithm.
Abstract
Purpose
The objective of this research work is to design a data-based solution for administering traffic organization in a smart city by using the machine learning algorithm.
Design/methodology/approach
A machine learning framework for managing traffic infrastructure and air pollution in urban centers relies on a predictive analytics model. The model makes use of transportation data to predict traffic patterns based on the information gathered from numerous sources within the city. It can be promoted for strategic planning determination. The data features volume and calendar variables, including hours of the day, week and month. These variables are leveraged to identify time series-based seasonal patterns in the data. To achieve accurate traffic volume forecasting, the long short-term memory (LSTM) method is recommended.
Findings
The study has produced a model that is appropriate for the transportation sector in the city and other innovative urban applications. The findings indicate that the implementation of smart transportation systems enhances transportation and has a positive impact on air quality. The study's results are explored and connected to practical applications in the areas of air pollution control and smart transportation.
Originality/value
The present paper has created the machine learning framework for the transportation sector of smart cities that achieves a reasonable level of accuracy. Additionally, the paper examines the effects of smart transportation on both the environment and supply chain.
Details
Keywords
Deepak Suresh Asudani, Naresh Kumar Nagwani and Pradeep Singh
Classifying emails as ham or spam based on their content is essential. Determining the semantic and syntactic meaning of words and putting them in a high-dimensional feature…
Abstract
Purpose
Classifying emails as ham or spam based on their content is essential. Determining the semantic and syntactic meaning of words and putting them in a high-dimensional feature vector form for processing is the most difficult challenge in email categorization. The purpose of this paper is to examine the effectiveness of the pre-trained embedding model for the classification of emails using deep learning classifiers such as the long short-term memory (LSTM) model and convolutional neural network (CNN) model.
Design/methodology/approach
In this paper, global vectors (GloVe) and Bidirectional Encoder Representations Transformers (BERT) pre-trained word embedding are used to identify relationships between words, which helps to classify emails into their relevant categories using machine learning and deep learning models. Two benchmark datasets, SpamAssassin and Enron, are used in the experimentation.
Findings
In the first set of experiments, machine learning classifiers, the support vector machine (SVM) model, perform better than other machine learning methodologies. The second set of experiments compares the deep learning model performance without embedding, GloVe and BERT embedding. The experiments show that GloVe embedding can be helpful for faster execution with better performance on large-sized datasets.
Originality/value
The experiment reveals that the CNN model with GloVe embedding gives slightly better accuracy than the model with BERT embedding and traditional machine learning algorithms to classify an email as ham or spam. It is concluded that the word embedding models improve email classifiers accuracy.
Details
Keywords
Xin Huang, Ting Tang, Yu Ning Luo and Ren Wang
This study aims to examine the impact of board characteristics on firm performance while also exploring the influential mechanisms that help Chinese listed companies establish…
Abstract
Purpose
This study aims to examine the impact of board characteristics on firm performance while also exploring the influential mechanisms that help Chinese listed companies establish effective boards of directors and strengthen their corporate governance mechanisms.
Design/methodology/approach
This paper uses machine learning methods to investigate the predictive ability of the board of directors' characteristics on firm performance based on the data from Chinese A-share listed companies on the Shanghai and Shenzhen stock exchanges in China during 2008–2021. This study further analyzes board characteristics with relatively strong predictive ability and their predictive models on firm performance.
Findings
The results show that nonlinear machine learning methods are more effective than traditional linear models in analyzing the impact of board characteristics on Chinese firm performance. Among the series characteristics of the board of directors, the contribution ratio in prediction from directors compensation, director shareholding ratio, the average age of directors and directors' educational level are significant, and these characteristics have a roughly nonlinear correlation to the prediction of firm performance; the improvement of the predictive ability of board characteristics on firm performance in state-owned enterprises in China performs better than that in private enterprises.
Practical implications
The findings of this study provide valuable suggestions for enriching the theory of board governance, strengthening board construction and optimizing the effectiveness of board governance. Furthermore, these impacts can serve as a valuable reference for board construction and selection, aiding in the rational selection of boards to establish an efficient and high-performing board of directors.
Originality/value
The study findings unequivocally demonstrate the superiority of nonlinear machine learning approaches over traditional linear models in examining the relationship between board characteristics and firm performance in China. Within the suite of board characteristics, director compensation, shareholding ratio, average age and educational level are particularly noteworthy, consistently demonstrating strong, nonlinear associations with firm performance. Within the suite of board characteristics, director compensation, shareholding ratio, average age and educational level are particularly noteworthy, consistently demonstrating strong, nonlinear associations with firm performance. The study reveals that the predictive performance of board attributes is generally more robust for state-owned enterprises in China in comparison to their counterparts in the private sector.
Details
Keywords
Samar Ali Shilbayeh and Sunil Vadera
This paper aims to describe the use of a meta-learning framework for recommending cost-sensitive classification methods with the aim of answering an important question that arises…
Abstract
Purpose
This paper aims to describe the use of a meta-learning framework for recommending cost-sensitive classification methods with the aim of answering an important question that arises in machine learning, namely, “Among all the available classification algorithms, and in considering a specific type of data and cost, which is the best algorithm for my problem?”
Design/methodology/approach
This paper describes the use of a meta-learning framework for recommending cost-sensitive classification methods for the aim of answering an important question that arises in machine learning, namely, “Among all the available classification algorithms, and in considering a specific type of data and cost, which is the best algorithm for my problem?” The framework is based on the idea of applying machine learning techniques to discover knowledge about the performance of different machine learning algorithms. It includes components that repeatedly apply different classification methods on data sets and measures their performance. The characteristics of the data sets, combined with the algorithms and the performance provide the training examples. A decision tree algorithm is applied to the training examples to induce the knowledge, which can then be used to recommend algorithms for new data sets. The paper makes a contribution to both meta-learning and cost-sensitive machine learning approaches. Those both fields are not new, however, building a recommender that recommends the optimal case-sensitive approach for a given data problem is the contribution. The proposed solution is implemented in WEKA and evaluated by applying it on different data sets and comparing the results with existing studies available in the literature. The results show that a developed meta-learning solution produces better results than METAL, a well-known meta-learning system. The developed solution takes the misclassification cost into consideration during the learning process, which is not available in the compared project.
Findings
The proposed solution is implemented in WEKA and evaluated by applying it to different data sets and comparing the results with existing studies available in the literature. The results show that a developed meta-learning solution produces better results than METAL, a well-known meta-learning system.
Originality/value
The paper presents a major piece of new information in writing for the first time. Meta-learning work has been done before but this paper presents a new meta-learning framework that is costs sensitive.
Details
Keywords
Madhulika Bhatia, Shubham Sharma, Madhurima Hooda and Narayan C. Debnath
Recent research advances in artificial intelligence, machine learning, and neural networks are becoming essential tools for building a wide range of intelligent applications…
Abstract
Recent research advances in artificial intelligence, machine learning, and neural networks are becoming essential tools for building a wide range of intelligent applications. Moreover, machine learning helps to automate analytical model building. Machine learning based frameworks and approaches allow making well-informed and intelligent choices for improving daily eating habits and extension of healthy lifestyle. This book chapter presents a new machine learning approach for meal classification and assessment of nutrients values based on weather conditions along with new and innovative ideas for further study and research on health care-related applications.
Details
Keywords
Prudence Kadebu, Robert T.R. Shoniwa, Kudakwashe Zvarevashe, Addlight Mukwazvure, Innocent Mapanga, Nyasha Fadzai Thusabantu and Tatenda Trust Gotora
Given how smart today’s malware authors have become through employing highly sophisticated techniques, it is only logical that methods be developed to combat the most potent…
Abstract
Purpose
Given how smart today’s malware authors have become through employing highly sophisticated techniques, it is only logical that methods be developed to combat the most potent threats, particularly where the malware is stealthy and makes indicators of compromise (IOC) difficult to detect. After the analysis is completed, the output can be employed to detect and then counteract the attack. The goal of this work is to propose a machine learning approach to improve malware detection by combining the strengths of both supervised and unsupervised machine learning techniques. This study is essential as malware has certainly become ubiquitous as cyber-criminals use it to attack systems in cyberspace. Malware analysis is required to reveal hidden IOC, to comprehend the attacker’s goal and the severity of the damage and to find vulnerabilities within the system.
Design/methodology/approach
This research proposes a hybrid approach for dynamic and static malware analysis that combines unsupervised and supervised machine learning algorithms and goes on to show how Malware exploiting steganography can be exposed.
Findings
The tactics used by malware developers to circumvent detection are becoming more advanced with steganography becoming a popular technique applied in obfuscation to evade mechanisms for detection. Malware analysis continues to call for continuous improvement of existing techniques. State-of-the-art approaches applying machine learning have become increasingly popular with highly promising results.
Originality/value
Cyber security researchers globally are grappling with devising innovative strategies to identify and defend against the threat of extremely sophisticated malware attacks on key infrastructure containing sensitive data. The process of detecting the presence of malware requires expertise in malware analysis. Applying intelligent methods to this process can aid practitioners in identifying malware’s behaviour and features. This is especially expedient where the malware is stealthy, hiding IOC.
Details
Keywords
Djordje Cica, Branislav Sredanovic, Sasa Tesic and Davorin Kramar
Sustainable manufacturing is one of the most important and most challenging issues in present industrial scenario. With the intention of diminish negative effects associated with…
Abstract
Sustainable manufacturing is one of the most important and most challenging issues in present industrial scenario. With the intention of diminish negative effects associated with cutting fluids, the machining industries are continuously developing technologies and systems for cooling/lubricating of the cutting zone while maintaining machining efficiency. In the present study, three regression based machine learning techniques, namely, polynomial regression (PR), support vector regression (SVR) and Gaussian process regression (GPR) were developed to predict machining force, cutting power and cutting pressure in the turning of AISI 1045. In the development of predictive models, machining parameters of cutting speed, depth of cut and feed rate were considered as control factors. Since cooling/lubricating techniques significantly affects the machining performance, prediction model development of quality characteristics was performed under minimum quantity lubrication (MQL) and high-pressure coolant (HPC) cutting conditions. The prediction accuracy of developed models was evaluated by statistical error analyzing methods. Results of regressions based machine learning techniques were also compared with probably one of the most frequently used machine learning method, namely artificial neural networks (ANN). Finally, a metaheuristic approach based on a neural network algorithm was utilized to perform an efficient multi-objective optimization of process parameters for both cutting environment.
Details
Keywords
Ryan Varghese, Abha Deshpande, Gargi Digholkar and Dileep Kumar
Background: Artificial intelligence (AI) is a booming sector that has profoundly influenced every walk of life, and the education sector is no exception. In education, AI has…
Abstract
Background: Artificial intelligence (AI) is a booming sector that has profoundly influenced every walk of life, and the education sector is no exception. In education, AI has helped to develop novel teaching and learning solutions that are currently being tested in various contexts. Businesses and governments across the globe have been pouring money into a wide array of implementations, and dozens of EdTech start-ups are being funded to capitalise on this technological force. The penetration of AI in classroom teaching is also a profound matter of discussion. These have garnered massive amounts of student big data and have a significant impact on the life of both students and educators alike.
Purpose: The prime focus of this chapter is to extensively review and analyse the vast literature available on the utilities of AI in health care, learning, and development. The specific objective of thematic exploration of the literature is to explicate the principal facets and recent advances in the development and employment of AI in the latter. This chapter also aims to explore how the EdTech and healthcare–education sectors would witness a paradigm shift with the advent and incorporation of AI.
Design/Methodology/Approach: To provide context and evidence, relevant publications were identified on ScienceDirect, PubMed, and Google Scholar using keywords like AI, education, learning, health care, and development. In addition, the latest articles were also thoroughly reviewed to underscore recent advances in the same field.
Results: The implementation of AI in the learning, development, and healthcare sector is rising steeply, with a projected expansion of about 50% by 2022. These algorithms and user interfaces economically facilitate efficient delivery of the latter.
Conclusions: The EdTech and healthcare sector has great potential for a spectrum of AI-based interventions, providing access to learning opportunities and personalised experiences. These interventions are often economic in the long run compared to conventional modalities. However, several ethical and regulatory concerns should be addressed before the complete adoption of AI in these sectors.
Originality/Value: The value in exploring this topic is to present a view on the potential of employing AI in health care, medical education, and learning and development. It also intends to open a discussion of its potential benefits and a remedy to its shortcomings.
Details
Keywords
Hanane Sebbaq and Nour-eddine El Faddouli
The purpose of this study is, First, to leverage the limitation of annotated data and to identify the cognitive level of learning objectives efficiently, this study adopts…
Abstract
Purpose
The purpose of this study is, First, to leverage the limitation of annotated data and to identify the cognitive level of learning objectives efficiently, this study adopts transfer learning by using word2vec and a bidirectional gated recurrent units (GRU) that can fully take into account the context and improves the classification of the model. This study adds a layer based on attention mechanism (AM), which captures the context vector and gives keywords higher weight for text classification. Second, this study explains the authors’ model’s results with local interpretable model-agnostic explanations (LIME).
Design/methodology/approach
Bloom's taxonomy levels of cognition are commonly used as a reference standard for identifying e-learning contents. Many action verbs in Bloom's taxonomy, however, overlap at different levels of the hierarchy, causing uncertainty regarding the cognitive level expected. Some studies have looked into the cognitive classification of e-learning content but none has looked into learning objectives. On the other hand, most of these research papers just adopt classical machine learning algorithms. The main constraint of this study is the availability of annotated learning objectives data sets. This study managed to build a data set of 2,400 learning objectives, but this size remains limited.
Findings
This study’s experiments show that the proposed model achieves highest scores of accuracy: 90.62%, F1-score and loss. The proposed model succeeds in classifying learning objectives, which contain ambiguous verb from the Bloom’s taxonomy action verbs, while the same model without the attention layer fails. This study’s LIME explainer aids in visualizing the most essential features of the text, which contributes to justifying the final classification.
Originality/value
In this study, the main objective is to propose a model that outperforms the baseline models for learning objectives classification based on the six cognitive levels of Bloom's taxonomy. In this sense, this study builds the bidirectional GRU (BiGRU)-attention model based on the combination of the BiGRU algorithm with the AM. This study feeds the architecture with word2vec embeddings. To prove the effectiveness of the proposed model, this study compares it with four classical machine learning algorithms that are widely used for the cognitive classification of text: Bayes naive, logistic regression, support vector machine and K-nearest neighbors and with GRU. The main constraint related to this study is the absence of annotated data; there is no annotated learning objective data set based on Bloom’s taxonomy's cognitive levels. To overcome this problem, this study seemed to have no choice but to build the data set.
Details