Search results

21 – 30 of over 48000
Article
Publication date: 21 December 2021

Laouni Djafri

This work can be used as a building block in other settings such as GPU, Map-Reduce, Spark or any other. Also, DDPML can be deployed on other distributed systems such as P2P…

384

Abstract

Purpose

This work can be used as a building block in other settings such as GPU, Map-Reduce, Spark or any other. Also, DDPML can be deployed on other distributed systems such as P2P networks, clusters, clouds computing or other technologies.

Design/methodology/approach

In the age of Big Data, all companies want to benefit from large amounts of data. These data can help them understand their internal and external environment and anticipate associated phenomena, as the data turn into knowledge that can be used for prediction later. Thus, this knowledge becomes a great asset in companies' hands. This is precisely the objective of data mining. But with the production of a large amount of data and knowledge at a faster pace, the authors are now talking about Big Data mining. For this reason, the authors’ proposed works mainly aim at solving the problem of volume, veracity, validity and velocity when classifying Big Data using distributed and parallel processing techniques. So, the problem that the authors are raising in this work is how the authors can make machine learning algorithms work in a distributed and parallel way at the same time without losing the accuracy of classification results. To solve this problem, the authors propose a system called Dynamic Distributed and Parallel Machine Learning (DDPML) algorithms. To build it, the authors divided their work into two parts. In the first, the authors propose a distributed architecture that is controlled by Map-Reduce algorithm which in turn depends on random sampling technique. So, the distributed architecture that the authors designed is specially directed to handle big data processing that operates in a coherent and efficient manner with the sampling strategy proposed in this work. This architecture also helps the authors to actually verify the classification results obtained using the representative learning base (RLB). In the second part, the authors have extracted the representative learning base by sampling at two levels using the stratified random sampling method. This sampling method is also applied to extract the shared learning base (SLB) and the partial learning base for the first level (PLBL1) and the partial learning base for the second level (PLBL2). The experimental results show the efficiency of our solution that the authors provided without significant loss of the classification results. Thus, in practical terms, the system DDPML is generally dedicated to big data mining processing, and works effectively in distributed systems with a simple structure, such as client-server networks.

Findings

The authors got very satisfactory classification results.

Originality/value

DDPML system is specially designed to smoothly handle big data mining classification.

Details

Data Technologies and Applications, vol. 56 no. 4
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 27 October 2023

Pulkit Tiwari

The objective of this research work is to design a data-based solution for administering traffic organization in a smart city by using the machine learning algorithm.

Abstract

Purpose

The objective of this research work is to design a data-based solution for administering traffic organization in a smart city by using the machine learning algorithm.

Design/methodology/approach

A machine learning framework for managing traffic infrastructure and air pollution in urban centers relies on a predictive analytics model. The model makes use of transportation data to predict traffic patterns based on the information gathered from numerous sources within the city. It can be promoted for strategic planning determination. The data features volume and calendar variables, including hours of the day, week and month. These variables are leveraged to identify time series-based seasonal patterns in the data. To achieve accurate traffic volume forecasting, the long short-term memory (LSTM) method is recommended.

Findings

The study has produced a model that is appropriate for the transportation sector in the city and other innovative urban applications. The findings indicate that the implementation of smart transportation systems enhances transportation and has a positive impact on air quality. The study's results are explored and connected to practical applications in the areas of air pollution control and smart transportation.

Originality/value

The present paper has created the machine learning framework for the transportation sector of smart cities that achieves a reasonable level of accuracy. Additionally, the paper examines the effects of smart transportation on both the environment and supply chain.

Details

Management of Environmental Quality: An International Journal, vol. 35 no. 2
Type: Research Article
ISSN: 1477-7835

Keywords

Article
Publication date: 2 February 2022

Deepak Suresh Asudani, Naresh Kumar Nagwani and Pradeep Singh

Classifying emails as ham or spam based on their content is essential. Determining the semantic and syntactic meaning of words and putting them in a high-dimensional feature…

372

Abstract

Purpose

Classifying emails as ham or spam based on their content is essential. Determining the semantic and syntactic meaning of words and putting them in a high-dimensional feature vector form for processing is the most difficult challenge in email categorization. The purpose of this paper is to examine the effectiveness of the pre-trained embedding model for the classification of emails using deep learning classifiers such as the long short-term memory (LSTM) model and convolutional neural network (CNN) model.

Design/methodology/approach

In this paper, global vectors (GloVe) and Bidirectional Encoder Representations Transformers (BERT) pre-trained word embedding are used to identify relationships between words, which helps to classify emails into their relevant categories using machine learning and deep learning models. Two benchmark datasets, SpamAssassin and Enron, are used in the experimentation.

Findings

In the first set of experiments, machine learning classifiers, the support vector machine (SVM) model, perform better than other machine learning methodologies. The second set of experiments compares the deep learning model performance without embedding, GloVe and BERT embedding. The experiments show that GloVe embedding can be helpful for faster execution with better performance on large-sized datasets.

Originality/value

The experiment reveals that the CNN model with GloVe embedding gives slightly better accuracy than the model with BERT embedding and traditional machine learning algorithms to classify an email as ham or spam. It is concluded that the word embedding models improve email classifiers accuracy.

Details

Data Technologies and Applications, vol. 56 no. 4
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 15 February 2024

Xin Huang, Ting Tang, Yu Ning Luo and Ren Wang

This study aims to examine the impact of board characteristics on firm performance while also exploring the influential mechanisms that help Chinese listed companies establish…

Abstract

Purpose

This study aims to examine the impact of board characteristics on firm performance while also exploring the influential mechanisms that help Chinese listed companies establish effective boards of directors and strengthen their corporate governance mechanisms.

Design/methodology/approach

This paper uses machine learning methods to investigate the predictive ability of the board of directors' characteristics on firm performance based on the data from Chinese A-share listed companies on the Shanghai and Shenzhen stock exchanges in China during 2008–2021. This study further analyzes board characteristics with relatively strong predictive ability and their predictive models on firm performance.

Findings

The results show that nonlinear machine learning methods are more effective than traditional linear models in analyzing the impact of board characteristics on Chinese firm performance. Among the series characteristics of the board of directors, the contribution ratio in prediction from directors compensation, director shareholding ratio, the average age of directors and directors' educational level are significant, and these characteristics have a roughly nonlinear correlation to the prediction of firm performance; the improvement of the predictive ability of board characteristics on firm performance in state-owned enterprises in China performs better than that in private enterprises.

Practical implications

The findings of this study provide valuable suggestions for enriching the theory of board governance, strengthening board construction and optimizing the effectiveness of board governance. Furthermore, these impacts can serve as a valuable reference for board construction and selection, aiding in the rational selection of boards to establish an efficient and high-performing board of directors.

Originality/value

The study findings unequivocally demonstrate the superiority of nonlinear machine learning approaches over traditional linear models in examining the relationship between board characteristics and firm performance in China. Within the suite of board characteristics, director compensation, shareholding ratio, average age and educational level are particularly noteworthy, consistently demonstrating strong, nonlinear associations with firm performance. Within the suite of board characteristics, director compensation, shareholding ratio, average age and educational level are particularly noteworthy, consistently demonstrating strong, nonlinear associations with firm performance. The study reveals that the predictive performance of board attributes is generally more robust for state-owned enterprises in China in comparison to their counterparts in the private sector.

Details

Chinese Management Studies, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1750-614X

Keywords

Article
Publication date: 22 September 2021

Samar Ali Shilbayeh and Sunil Vadera

This paper aims to describe the use of a meta-learning framework for recommending cost-sensitive classification methods with the aim of answering an important question that arises…

Abstract

Purpose

This paper aims to describe the use of a meta-learning framework for recommending cost-sensitive classification methods with the aim of answering an important question that arises in machine learning, namely, “Among all the available classification algorithms, and in considering a specific type of data and cost, which is the best algorithm for my problem?”

Design/methodology/approach

This paper describes the use of a meta-learning framework for recommending cost-sensitive classification methods for the aim of answering an important question that arises in machine learning, namely, “Among all the available classification algorithms, and in considering a specific type of data and cost, which is the best algorithm for my problem?” The framework is based on the idea of applying machine learning techniques to discover knowledge about the performance of different machine learning algorithms. It includes components that repeatedly apply different classification methods on data sets and measures their performance. The characteristics of the data sets, combined with the algorithms and the performance provide the training examples. A decision tree algorithm is applied to the training examples to induce the knowledge, which can then be used to recommend algorithms for new data sets. The paper makes a contribution to both meta-learning and cost-sensitive machine learning approaches. Those both fields are not new, however, building a recommender that recommends the optimal case-sensitive approach for a given data problem is the contribution. The proposed solution is implemented in WEKA and evaluated by applying it on different data sets and comparing the results with existing studies available in the literature. The results show that a developed meta-learning solution produces better results than METAL, a well-known meta-learning system. The developed solution takes the misclassification cost into consideration during the learning process, which is not available in the compared project.

Findings

The proposed solution is implemented in WEKA and evaluated by applying it to different data sets and comparing the results with existing studies available in the literature. The results show that a developed meta-learning solution produces better results than METAL, a well-known meta-learning system.

Originality/value

The paper presents a major piece of new information in writing for the first time. Meta-learning work has been done before but this paper presents a new meta-learning framework that is costs sensitive.

Details

Journal of Modelling in Management, vol. 17 no. 3
Type: Research Article
ISSN: 1746-5664

Keywords

Book part
Publication date: 30 September 2020

Madhulika Bhatia, Shubham Sharma, Madhurima Hooda and Narayan C. Debnath

Recent research advances in artificial intelligence, machine learning, and neural networks are becoming essential tools for building a wide range of intelligent applications…

Abstract

Recent research advances in artificial intelligence, machine learning, and neural networks are becoming essential tools for building a wide range of intelligent applications. Moreover, machine learning helps to automate analytical model building. Machine learning based frameworks and approaches allow making well-informed and intelligent choices for improving daily eating habits and extension of healthy lifestyle. This book chapter presents a new machine learning approach for meal classification and assessment of nutrients values based on weather conditions along with new and innovative ideas for further study and research on health care-related applications.

Details

Big Data Analytics and Intelligence: A Perspective for Health Care
Type: Book
ISBN: 978-1-83909-099-8

Keywords

Open Access
Article
Publication date: 28 April 2023

Prudence Kadebu, Robert T.R. Shoniwa, Kudakwashe Zvarevashe, Addlight Mukwazvure, Innocent Mapanga, Nyasha Fadzai Thusabantu and Tatenda Trust Gotora

Given how smart today’s malware authors have become through employing highly sophisticated techniques, it is only logical that methods be developed to combat the most potent…

1808

Abstract

Purpose

Given how smart today’s malware authors have become through employing highly sophisticated techniques, it is only logical that methods be developed to combat the most potent threats, particularly where the malware is stealthy and makes indicators of compromise (IOC) difficult to detect. After the analysis is completed, the output can be employed to detect and then counteract the attack. The goal of this work is to propose a machine learning approach to improve malware detection by combining the strengths of both supervised and unsupervised machine learning techniques. This study is essential as malware has certainly become ubiquitous as cyber-criminals use it to attack systems in cyberspace. Malware analysis is required to reveal hidden IOC, to comprehend the attacker’s goal and the severity of the damage and to find vulnerabilities within the system.

Design/methodology/approach

This research proposes a hybrid approach for dynamic and static malware analysis that combines unsupervised and supervised machine learning algorithms and goes on to show how Malware exploiting steganography can be exposed.

Findings

The tactics used by malware developers to circumvent detection are becoming more advanced with steganography becoming a popular technique applied in obfuscation to evade mechanisms for detection. Malware analysis continues to call for continuous improvement of existing techniques. State-of-the-art approaches applying machine learning have become increasingly popular with highly promising results.

Originality/value

Cyber security researchers globally are grappling with devising innovative strategies to identify and defend against the threat of extremely sophisticated malware attacks on key infrastructure containing sensitive data. The process of detecting the presence of malware requires expertise in malware analysis. Applying intelligent methods to this process can aid practitioners in identifying malware’s behaviour and features. This is especially expedient where the malware is stealthy, hiding IOC.

Details

International Journal of Industrial Engineering and Operations Management, vol. 5 no. 2
Type: Research Article
ISSN: 2690-6090

Keywords

Open Access
Article
Publication date: 3 August 2020

Djordje Cica, Branislav Sredanovic, Sasa Tesic and Davorin Kramar

Sustainable manufacturing is one of the most important and most challenging issues in present industrial scenario. With the intention of diminish negative effects associated with…

2094

Abstract

Sustainable manufacturing is one of the most important and most challenging issues in present industrial scenario. With the intention of diminish negative effects associated with cutting fluids, the machining industries are continuously developing technologies and systems for cooling/lubricating of the cutting zone while maintaining machining efficiency. In the present study, three regression based machine learning techniques, namely, polynomial regression (PR), support vector regression (SVR) and Gaussian process regression (GPR) were developed to predict machining force, cutting power and cutting pressure in the turning of AISI 1045. In the development of predictive models, machining parameters of cutting speed, depth of cut and feed rate were considered as control factors. Since cooling/lubricating techniques significantly affects the machining performance, prediction model development of quality characteristics was performed under minimum quantity lubrication (MQL) and high-pressure coolant (HPC) cutting conditions. The prediction accuracy of developed models was evaluated by statistical error analyzing methods. Results of regressions based machine learning techniques were also compared with probably one of the most frequently used machine learning method, namely artificial neural networks (ANN). Finally, a metaheuristic approach based on a neural network algorithm was utilized to perform an efficient multi-objective optimization of process parameters for both cutting environment.

Details

Applied Computing and Informatics, vol. 20 no. 1/2
Type: Research Article
ISSN: 2634-1964

Keywords

Book part
Publication date: 10 February 2023

Ryan Varghese, Abha Deshpande, Gargi Digholkar and Dileep Kumar

Background: Artificial intelligence (AI) is a booming sector that has profoundly influenced every walk of life, and the education sector is no exception. In education, AI has…

Abstract

Background: Artificial intelligence (AI) is a booming sector that has profoundly influenced every walk of life, and the education sector is no exception. In education, AI has helped to develop novel teaching and learning solutions that are currently being tested in various contexts. Businesses and governments across the globe have been pouring money into a wide array of implementations, and dozens of EdTech start-ups are being funded to capitalise on this technological force. The penetration of AI in classroom teaching is also a profound matter of discussion. These have garnered massive amounts of student big data and have a significant impact on the life of both students and educators alike.

Purpose: The prime focus of this chapter is to extensively review and analyse the vast literature available on the utilities of AI in health care, learning, and development. The specific objective of thematic exploration of the literature is to explicate the principal facets and recent advances in the development and employment of AI in the latter. This chapter also aims to explore how the EdTech and healthcare–education sectors would witness a paradigm shift with the advent and incorporation of AI.

Design/Methodology/Approach: To provide context and evidence, relevant publications were identified on ScienceDirect, PubMed, and Google Scholar using keywords like AI, education, learning, health care, and development. In addition, the latest articles were also thoroughly reviewed to underscore recent advances in the same field.

Results: The implementation of AI in the learning, development, and healthcare sector is rising steeply, with a projected expansion of about 50% by 2022. These algorithms and user interfaces economically facilitate efficient delivery of the latter.

Conclusions: The EdTech and healthcare sector has great potential for a spectrum of AI-based interventions, providing access to learning opportunities and personalised experiences. These interventions are often economic in the long run compared to conventional modalities. However, several ethical and regulatory concerns should be addressed before the complete adoption of AI in these sectors.

Originality/Value: The value in exploring this topic is to present a view on the potential of employing AI in health care, medical education, and learning and development. It also intends to open a discussion of its potential benefits and a remedy to its shortcomings.

Details

The Adoption and Effect of Artificial Intelligence on Human Resources Management, Part B
Type: Book
ISBN: 978-1-80455-662-7

Keywords

Article
Publication date: 6 September 2022

Hanane Sebbaq and Nour-eddine El Faddouli

The purpose of this study is, First, to leverage the limitation of annotated data and to identify the cognitive level of learning objectives efficiently, this study adopts…

Abstract

Purpose

The purpose of this study is, First, to leverage the limitation of annotated data and to identify the cognitive level of learning objectives efficiently, this study adopts transfer learning by using word2vec and a bidirectional gated recurrent units (GRU) that can fully take into account the context and improves the classification of the model. This study adds a layer based on attention mechanism (AM), which captures the context vector and gives keywords higher weight for text classification. Second, this study explains the authors’ model’s results with local interpretable model-agnostic explanations (LIME).

Design/methodology/approach

Bloom's taxonomy levels of cognition are commonly used as a reference standard for identifying e-learning contents. Many action verbs in Bloom's taxonomy, however, overlap at different levels of the hierarchy, causing uncertainty regarding the cognitive level expected. Some studies have looked into the cognitive classification of e-learning content but none has looked into learning objectives. On the other hand, most of these research papers just adopt classical machine learning algorithms. The main constraint of this study is the availability of annotated learning objectives data sets. This study managed to build a data set of 2,400 learning objectives, but this size remains limited.

Findings

This study’s experiments show that the proposed model achieves highest scores of accuracy: 90.62%, F1-score and loss. The proposed model succeeds in classifying learning objectives, which contain ambiguous verb from the Bloom’s taxonomy action verbs, while the same model without the attention layer fails. This study’s LIME explainer aids in visualizing the most essential features of the text, which contributes to justifying the final classification.

Originality/value

In this study, the main objective is to propose a model that outperforms the baseline models for learning objectives classification based on the six cognitive levels of Bloom's taxonomy. In this sense, this study builds the bidirectional GRU (BiGRU)-attention model based on the combination of the BiGRU algorithm with the AM. This study feeds the architecture with word2vec embeddings. To prove the effectiveness of the proposed model, this study compares it with four classical machine learning algorithms that are widely used for the cognitive classification of text: Bayes naive, logistic regression, support vector machine and K-nearest neighbors and with GRU. The main constraint related to this study is the absence of annotated data; there is no annotated learning objective data set based on Bloom’s taxonomy's cognitive levels. To overcome this problem, this study seemed to have no choice but to build the data set.

21 – 30 of over 48000