Search results

1 – 10 of 359
Article
Publication date: 26 June 2023

Chetna Choudhary, Deepti Mehrotra and Avinash K. Shrivastava

As the number of web applications is increasing day by day web mining acts as an important tool to extract useful information from weblogs and analyse them according to the…

Abstract

Purpose

As the number of web applications is increasing day by day web mining acts as an important tool to extract useful information from weblogs and analyse them according to the attributes and predict the usage of a website. The main aim of this paper is to inspect how process mining can be used to predict the web usability of hotel booking sites based on the number of users on each page, and the time of stay of each user. Through this paper, the authors analyse the web usability of a website through process mining by finding the web usability metrics. This work proposes an approach to finding the usage of a website using the attributes available in the weblog which predicts the actual footfall on a website.

Design/methodology/approach

PROM (Process Mining tool) is used for the analysis of the event log of a hotel booking site. In this work, authors have used a case study to apply the PROM (process mining tool) to pre-process the event log dataset for analysis to discover better-structured process maps than without pre-processing.

Findings

This article first provided an overview of process mining, then focused on web mining and later discussed process mining techniques. It also described different target languages: system nets (i.e. Petri nets with an initial and a final state), inductive miner and heuristic miner, graphs showing the change in behaviour of the dataset and predicting the outcome, that is the webpage having the maximum number of hits.

Originality/value

In this work, a case study has been used to apply the PROM (process mining tool) to pre-process the event log dataset for analysis to discover better-structured process maps than without pre-processing.

Details

International Journal of Quality & Reliability Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0265-671X

Keywords

Open Access
Article
Publication date: 11 October 2023

Bachriah Fatwa Dhini, Abba Suganda Girsang, Unggul Utan Sufandi and Heny Kurniawati

The authors constructed an automatic essay scoring (AES) model in a discussion forum where the result was compared with scores given by human evaluators. This research proposes…

Abstract

Purpose

The authors constructed an automatic essay scoring (AES) model in a discussion forum where the result was compared with scores given by human evaluators. This research proposes essay scoring, which is conducted through two parameters, semantic and keyword similarities, using a SentenceTransformers pre-trained model that can construct the highest vector embedding. Combining these models is used to optimize the model with increasing accuracy.

Design/methodology/approach

The development of the model in the study is divided into seven stages: (1) data collection, (2) pre-processing data, (3) selected pre-trained SentenceTransformers model, (4) semantic similarity (sentence pair), (5) keyword similarity, (6) calculate final score and (7) evaluating model.

Findings

The multilingual paraphrase-multilingual-MiniLM-L12-v2 and distilbert-base-multilingual-cased-v1 models got the highest scores from comparisons of 11 pre-trained multilingual models of SentenceTransformers with Indonesian data (Dhini and Girsang, 2023). Both multilingual models were adopted in this study. A combination of two parameters is obtained by comparing the response of the keyword extraction responses with the rubric keywords. Based on the experimental results, proposing a combination can increase the evaluation results by 0.2.

Originality/value

This study uses discussion forum data from the general biology course in online learning at the open university for the 2020.2 and 2021.2 semesters. Forum discussion ratings are still manual. In this survey, the authors created a model that automatically calculates the value of discussion forums, which are essays based on the lecturer's answers moreover rubrics.

Details

Asian Association of Open Universities Journal, vol. 18 no. 3
Type: Research Article
ISSN: 1858-3431

Keywords

Open Access
Article
Publication date: 31 July 2023

Daniel Šandor and Marina Bagić Babac

Sarcasm is a linguistic expression that usually carries the opposite meaning of what is being said by words, thus making it difficult for machines to discover the actual meaning…

2864

Abstract

Purpose

Sarcasm is a linguistic expression that usually carries the opposite meaning of what is being said by words, thus making it difficult for machines to discover the actual meaning. It is mainly distinguished by the inflection with which it is spoken, with an undercurrent of irony, and is largely dependent on context, which makes it a difficult task for computational analysis. Moreover, sarcasm expresses negative sentiments using positive words, allowing it to easily confuse sentiment analysis models. This paper aims to demonstrate the task of sarcasm detection using the approach of machine and deep learning.

Design/methodology/approach

For the purpose of sarcasm detection, machine and deep learning models were used on a data set consisting of 1.3 million social media comments, including both sarcastic and non-sarcastic comments. The data set was pre-processed using natural language processing methods, and additional features were extracted and analysed. Several machine learning models, including logistic regression, ridge regression, linear support vector and support vector machines, along with two deep learning models based on bidirectional long short-term memory and one bidirectional encoder representations from transformers (BERT)-based model, were implemented, evaluated and compared.

Findings

The performance of machine and deep learning models was compared in the task of sarcasm detection, and possible ways of improvement were discussed. Deep learning models showed more promise, performance-wise, for this type of task. Specifically, a state-of-the-art model in natural language processing, namely, BERT-based model, outperformed other machine and deep learning models.

Originality/value

This study compared the performance of the various machine and deep learning models in the task of sarcasm detection using the data set of 1.3 million comments from social media.

Details

Information Discovery and Delivery, vol. 52 no. 2
Type: Research Article
ISSN: 2398-6247

Keywords

Book part
Publication date: 15 April 2024

Adriana AnaMaria Davidescu, Eduard Mihai Manta and Maria Ruxandra Cojocaru

Purpose: Students’ transition from education to employment is influenced by factors like the length and calibre of their education, demography, labour market conditions, and the…

Abstract

Purpose: Students’ transition from education to employment is influenced by factors like the length and calibre of their education, demography, labour market conditions, and the general state of the economy. Regardless of the economy, education systems should seek to ensure that students have the skills required for the labour market. This will help them better transition from school to work. This study examines the work skills that companies require for entry-level positions in Romania.

Need for Study: Previously, text analysis studies treated the job market only for the IT industry in Romania. To understand the demand-side opportunities and restrictions, assessing the employment opportunities for young people in the Romanian labour market is necessary.

Methodology: A text mining approach from 842 unstructured data of the existing job positions in October 2022 for fresh graduates or students is used in this chapter. The study uses data from LinkedIn job descriptions in the Romanian job market. The methodology involved is focused on text retrieval, text-pre-processing, word cloud analysis, network analysis, and topic modelling.

Findings: The empirical findings revealed that the most common words in job descriptions are experience, team, work, skills, development, knowledge, support, data, business, and software. The correlation network revealed that the most correlated pairs of words are gender–sexual–race–religion–origin–diversity–age–identity–orientation–colour–equal–marital.

Practical Implications: This study looked at the job market and used text analytics to extract a space of skill and qualification dimensions from job announcements relevant to the Romanian employment market instead of depending on subjective knowledge.

Details

Contemporary Challenges in Social Science Management: Skills Gaps and Shortages in the Labour Market
Type: Book
ISBN: 978-1-83753-170-7

Keywords

Article
Publication date: 19 April 2024

Ean Teng Khor and Dave Darshan

This study leverages social network analysis (SNA) to visualise the way students interacted with online resources and uses the data obtained from SNA as features for supervised…

Abstract

Purpose

This study leverages social network analysis (SNA) to visualise the way students interacted with online resources and uses the data obtained from SNA as features for supervised machine learning algorithms to predict whether a student will successfully complete a course.

Design/methodology/approach

The exploration and visualisation of the data were first carried out to gain a better understanding of the students, the course(s) each student was enrolled in and each course’s virtual learning resources. Following this, the construction of the social network graphs was performed to depict how each student behaved online before the degree centralities were computed for each of the nodes in a social network graph. Data pre-processing to assign labels based on the final result a student obtained in a course was then performed before we trained and tested models to predict which students did or did not graduate.

Findings

The study’s findings demonstrate that the constructed predictive model has good performance, as shown by the accuracy, precision, recall and f-measure metrics. The outcomes also showed that students’ use of online resources is a crucial element that influences how well they perform in their academics.

Originality/value

The similarity index is as low as 9%.

Details

The International Journal of Information and Learning Technology, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2056-4880

Keywords

Article
Publication date: 9 October 2023

Xiaoguang Wang, Yue Cheng, Tao Lv and Rongjiang Cai

The authors hope to filter valuable information from online reviews, obtain objective and accurate information about the demands of auto consumers and help auto companies develop…

Abstract

Purpose

The authors hope to filter valuable information from online reviews, obtain objective and accurate information about the demands of auto consumers and help auto companies develop more reasonable production and marketing strategies for healthy and sustainable development. This paper aims to discuss the aforementioned objectives.

Design/methodology/approach

The authors collected review data from online automotive forums and generated a corpus after pre-processing. Then, the authors extracted consumer demands and topics using the LDA model. Finally, the authors used a trained Word2vec tool to extend the consumer demand topics.

Findings

Different types of vehicle consumers have the same demands, such as “Space,” “Power Performance,” and “Brand Comparison,” and distinct demands, such as “Appearance,” “Safety,” “Service,” and “New Energy Features”; consumers who buy new energy vehicles are still accustomed to comparing with the brands or models of fuel vehicles; new energy vehicles consumers pay more attention to services and service quality during the purchasing and using process.

Research limitations/implications

The development time of new energy vehicles is relatively short, with some models being available for only one year or even six months. The smaller amount of available data may impact the applicability of topic models. The sample size, especially for new energy vehicles, needs to be increased to improve the general applicability of topic models further.

Practical implications

First, this measure helps online review websites improve their existing review publication mechanisms, enhance the overall quality of online review content, increase user traffic and promote the healthy development of online review websites. Second, this allows for timely adjustments in future product production and sales plans and further enhances automotive companies' ability to leverage online reviews for Internet marketing.

Originality/value

The authors have improved the accuracy and stability of the fused topic model, providing a scientific and efficient research tool for multi-dimensional topic mining of online reviews. With the help of research results, consumers can more easily understand the discussion topics and thus filter out valuable reference information. As a result, automotive companies may gain information about consumer demands and product quality feedback and thus quickly adjust production and marketing strategies to increase sales and market share.

Details

Marketing Intelligence & Planning, vol. 41 no. 8
Type: Research Article
ISSN: 0263-4503

Keywords

Article
Publication date: 20 September 2023

Shamima Khatoon and Gufran Ahmad

The hygroscopic properties of 3D-printed filaments and moisture absorption itself during the process result in dimensional inaccuracy, particularly for nozzle movement along the…

Abstract

Purpose

The hygroscopic properties of 3D-printed filaments and moisture absorption itself during the process result in dimensional inaccuracy, particularly for nozzle movement along the x-axis and for micro-scale features. In view of that, this study aims to analyze in depth the dimensional errors and deviations of the fused filament fabrication (FFF)/fused deposition modeling (FDM) 3D-printed micropillars (MPs) from the reference values. A detailed analysis into the variability in printed dimensions below 1 mm in width without any deformations in the printed shape of the designed features, for challenging filaments like polymethyl methacrylate (PMMA) has been done. The study also explores whether the printed shape retains the designed structure.

Design/methodology/approach

A reference model for MPs of width 800 µm and height 2,000 µm is selected to generate a g-code model after pre-processing of slicing and meshing parameters for 3D printing of micro-scale structure with defined boundaries. Three SETs, SET-A, SET-B and SET-C, for nozzle diameter of 0.2 mm, 0.25 mm and 0.3 mm, respectively, have been prepared. The SETs containing the MPs were fabricated with the spacing (S) of 2,000 µm, 3,200 µm and 4,000 µm along the print head x-axis. The MPs were measured by taking three consecutive measurements (top, bottom and middle) for the width and one for the height.

Findings

The prominent highlight of this study is the successful FFF/FDM 3D printing of thin features (<1mm) without any deformation. The mathematical analysis of the variance of the optical microscopy measurements concluded that printed dimensions for micropillar widths did not vary significantly, retaining more than 65% of the recording within the first standard deviation (SD) (±1 s). The minimum value of SD is obtained from the samples of SET-B, that is, 31.96 µm and 35.865 µm, for height and width, respectively. The %RE for SET-B samples is 5.09% for S = 2,000µm, 3.86% for S = 3,200µm and 1.09% for S = 4,000µm. The error percentage is so small that it could be easily compensated by redesigning.

Research limitations/implications

The study does not cover other 3D printing techniques of additive manufacturing like stereolithography, digital light processing and material jetting.

Practical implications

The presented study can be potentially implemented for the rapid prototyping of microfluidics mixer, bioseparator and lab-on-chip devices, both for membrane-free bioseparation based on microfiltration, plasma extraction from whole blood, size-selection trapping of unwanted blood cells, and also for membrane-based plasma extraction that requires supporting microstructures. Our developed process may prove to be far more economical than the other existing techniques for such applications.

Originality/value

For the first time, this work presents a comprehensive analysis of the fabrication of micropillars using FDM/FFF 3D printing and PMMA in filament form. The primary focus of the study is to minimize the dimensional inaccuracies in the 3D printed devices containing thin features, especially in the area of biomedical engineering, by delivering benefits from the choice of the parameters. Thus, on the basis of errors and deviations, a thorough comparison of the three SETs of the fabricated micropillars has been done.

Article
Publication date: 3 November 2023

Vimala Balakrishnan, Aainaa Nadia Mohammed Hashim, Voon Chung Lee, Voon Hee Lee and Ying Qiu Lee

This study aims to develop a machine learning model to detect structure fire fatalities using a dataset comprising 11,341 cases from 2011 to 2019.

30

Abstract

Purpose

This study aims to develop a machine learning model to detect structure fire fatalities using a dataset comprising 11,341 cases from 2011 to 2019.

Design/methodology/approach

Exploratory data analysis (EDA) was conducted prior to modelling, in which ten machine learning models were experimented with.

Findings

The main fatal structure fire risk factors were fires originating from bedrooms, living areas and the cooking/dining areas. The highest fatality rate (20.69%) was reported for fires ignited due to bedding (23.43%), despite a low fire incident rate (3.50%). Using 21 structure fire features, Random Forest (RF) yielded the best detection performance with 86% accuracy, followed by Decision Tree (DT) with bagging (accuracy = 84.7%).

Research limitations/practical implications

Limitations of the study are pertaining to data quality and grouping of categories in the data pre-processing stage, which could affect the performance of the models.

Originality/value

The study is the first of its kind to manipulate risk factors to detect fatal structure classification, particularly focussing on structure fire fatalities. Most of the previous studies examined the importance of fire risk factors and their relationship to the fire risk level.

Details

International Journal of Intelligent Computing and Cybernetics, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 2 October 2023

Rahat Gulzar, Sumeer Gul, Manoj Kumar Verma, Mushtaq Ahmad Darzi, Farzana Gulzar and Sheikh Shueb

Sharing and obtaining information over social media has enabled people to express their opinions regarding any event. Since the tweets regarding the Russia-Ukraine war were…

Abstract

Purpose

Sharing and obtaining information over social media has enabled people to express their opinions regarding any event. Since the tweets regarding the Russia-Ukraine war were extensively publicized on social media, this study aims to analyse the temporal sentiments people express through tweets related to the war.

Design/methodology/approach

Relevant hashtag related to the Russia-Ukraine war was identified, and tweets were downloaded using Twitter API, which were later migrated to Orange Data mining software. Pre-processing techniques like transformation, tokenization, and filtering were applied to the extracted tweets. VADER (Valence Aware Dictionary for Sentiment Reasoning) sentiment analysis module of Orange software was used to categorize tweets into positive, negative and neutral ones based on the tweet polarity. For ascertaining the key and co-occurring terms and phrases in tweets and also to visualize the keyword clusters, VOSviewer, a data visualization software, was made use of.

Findings

An increase in the number of tweets is witnessed in the initial days, while a decline is observed over time. Most tweets are negative in nature, followed by positive and neutral ones. It is also ascertained that tweets from verified accounts are more impactful than unverified ones. russiaukrainewar, ukraine, russia, false, war, nato, zelensky and stoprussia are the dominant co-occurring keywords. Ukraine, Russia and Putin are the top hashtags for sentiment representation. India, the USA and the UK contribute the highest tweets.

Originality/value

The study tries to explore the public sentiments expressed over Twitter related to Russia-Ukraine war.

Details

Global Knowledge, Memory and Communication, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2514-9342

Keywords

Open Access
Article
Publication date: 22 September 2023

Nengsheng Bao, Yuchen Fan, Chaoping Li and Alessandro Simeone

Lubricating oil leakage is a common issue in thermal power plant operation sites, requiring prompt equipment maintenance. The real-time detection of leakage occurrences could…

Abstract

Purpose

Lubricating oil leakage is a common issue in thermal power plant operation sites, requiring prompt equipment maintenance. The real-time detection of leakage occurrences could avoid disruptive consequences caused by the lack of timely maintenance. Currently, inspection operations are mostly carried out manually, resulting in time-consuming processes prone to health and safety hazards. To overcome such issues, this paper proposes a machine vision-based inspection system aimed at automating the oil leakage detection for improving the maintenance procedures.

Design/methodology/approach

The approach aims at developing a novel modular-structured automatic inspection system. The image acquisition module collects digital images along a predefined inspection path using a dual-light (i.e. ultraviolet and blue light) illumination system, deploying the fluorescence of the lubricating oil while suppressing unwanted background noise. The image processing module is designed to detect the oil leakage within the digital images minimizing detection errors. A case study is reported to validate the industrial suitability of the proposed inspection system.

Findings

On-site experimental results demonstrate the capabilities to complete the automatic inspection procedures of the tested industrial equipment by achieving an oil leakage detection accuracy up to 99.13%.

Practical implications

The proposed inspection system can be adopted in industrial context to detect lubricant leakage ensuring the equipment and the operators safety.

Originality/value

The proposed inspection system adopts a computer vision approach, which deploys the combination of two separate sources of light, to boost the detection capabilities, enabling the application for a variety of particularly hard-to-inspect industrial contexts.

Details

Journal of Quality in Maintenance Engineering, vol. 29 no. 5
Type: Research Article
ISSN: 1355-2511

Keywords

1 – 10 of 359