Search results

1 – 10 of over 1000
Article
Publication date: 26 June 2023

Chetna Choudhary, Deepti Mehrotra and Avinash K. Shrivastava

As the number of web applications is increasing day by day web mining acts as an important tool to extract useful information from weblogs and analyse them according to the…

Abstract

Purpose

As the number of web applications is increasing day by day web mining acts as an important tool to extract useful information from weblogs and analyse them according to the attributes and predict the usage of a website. The main aim of this paper is to inspect how process mining can be used to predict the web usability of hotel booking sites based on the number of users on each page, and the time of stay of each user. Through this paper, the authors analyse the web usability of a website through process mining by finding the web usability metrics. This work proposes an approach to finding the usage of a website using the attributes available in the weblog which predicts the actual footfall on a website.

Design/methodology/approach

PROM (Process Mining tool) is used for the analysis of the event log of a hotel booking site. In this work, authors have used a case study to apply the PROM (process mining tool) to pre-process the event log dataset for analysis to discover better-structured process maps than without pre-processing.

Findings

This article first provided an overview of process mining, then focused on web mining and later discussed process mining techniques. It also described different target languages: system nets (i.e. Petri nets with an initial and a final state), inductive miner and heuristic miner, graphs showing the change in behaviour of the dataset and predicting the outcome, that is the webpage having the maximum number of hits.

Originality/value

In this work, a case study has been used to apply the PROM (process mining tool) to pre-process the event log dataset for analysis to discover better-structured process maps than without pre-processing.

Details

International Journal of Quality & Reliability Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0265-671X

Keywords

Article
Publication date: 3 November 2023

Salam Abdallah and Ashraf Khalil

This study aims to understand and a lay a foundation of how analytics has been used in depression management, this study conducts a systematic literature review using two…

119

Abstract

Purpose

This study aims to understand and a lay a foundation of how analytics has been used in depression management, this study conducts a systematic literature review using two techniques – text mining and manual review. The proposed methodology would aid researchers in identifying key concepts and research gaps, which in turn, will help them to establish the theoretical background supporting their empirical research objective.

Design/methodology/approach

This paper explores a hybrid methodology for literature review (HMLR), using text mining prior to systematic manual review.

Findings

The proposed rapid methodology is an effective tool to automate and speed up the process required to identify key and emerging concepts and research gaps in any specific research domain while conducting a systematic literature review. It assists in populating a research knowledge graph that does not reach all semantic depths of the examined domain yet provides some science-specific structure.

Originality/value

This study presents a new methodology for conducting a literature review for empirical research articles. This study has explored an “HMLR” that combines text mining and manual systematic literature review. Depending on the purpose of the research, these two techniques can be used in tandem to undertake a comprehensive literature review, by combining pieces of complex textual data together and revealing areas where research might be lacking.

Details

Information Discovery and Delivery, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2398-6247

Keywords

Article
Publication date: 10 November 2023

Yong Gui and Lanxin Zhang

Influenced by the constantly changing manufacturing environment, no single dispatching rule (SDR) can consistently obtain better scheduling results than other rules for the…

Abstract

Purpose

Influenced by the constantly changing manufacturing environment, no single dispatching rule (SDR) can consistently obtain better scheduling results than other rules for the dynamic job-shop scheduling problem (DJSP). Although the dynamic SDR selection classifier (DSSC) mined by traditional data-mining-based scheduling method has shown some improvement in comparison to an SDR, the enhancement is not significant since the rule selected by DSSC is still an SDR.

Design/methodology/approach

This paper presents a novel data-mining-based scheduling method for the DJSP with machine failure aiming at minimizing the makespan. Firstly, a scheduling priority relation model (SPRM) is constructed to determine the appropriate priority relation between two operations based on the production system state and the difference between their priority values calculated using multiple SDRs. Subsequently, a training sample acquisition mechanism based on the optimal scheduling schemes is proposed to acquire training samples for the SPRM. Furthermore, feature selection and machine learning are conducted using the genetic algorithm and extreme learning machine to mine the SPRM.

Findings

Results from numerical experiments demonstrate that the SPRM, mined by the proposed method, not only achieves better scheduling results in most manufacturing environments but also maintains a higher level of stability in diverse manufacturing environments than an SDR and the DSSC.

Originality/value

This paper constructs a SPRM and mines it based on data mining technologies to obtain better results than an SDR and the DSSC in various manufacturing environments.

Details

Kybernetes, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0368-492X

Keywords

Article
Publication date: 18 September 2023

Temitope Egbelakin, Temitope Omotayo, Olabode Emmanuel Ogunmakinde and Damilola Ekundayo

Flood preparedness and response from the perspective of community engagement mechanisms have been studied in scholarly articles. However, the differences in flood mitigation may…

Abstract

Purpose

Flood preparedness and response from the perspective of community engagement mechanisms have been studied in scholarly articles. However, the differences in flood mitigation may expose social and behavioural challenges to learn from. This study aimed to demonstrate how text mining can be applied in prioritising existing contexts in community-based and government flood mitigation and management strategies.

Design/methodology/approach

This investigation mined the semantics researchers ascribed to flood disasters and community responses from 2001 to 2022 peer-reviewed publications. Text mining was used to derive frequently used terms from over 15 publications in the Scopus database and Google Scholar search engine after an initial output of 268 peer-reviewed publications. The text-mining process applied the topic modelling analyses on the 15 publications using the R studio application.

Findings

Topic modelling applied through text mining clustered four (4) themes. The themes that emerged from the topic modelling process were building adaptation to flooding, climate change and resilient communities, urban infrastructure and community preparedness and research output for flood risk and community response. The themes were supported with geographical flood risk and community mitigation contexts from the USA, India and Nigeria to provide a broader perspective.

Originality/value

This study exposed the deficiency of “communication, teamwork, responsibility and lessons” as focal themes of flood disaster management and response research. The divergence in flood mitigation in developing nations as compared with developed nations can be bridged through improved government policies, technologies and community engagement.

Details

International Journal of Building Pathology and Adaptation, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2398-4708

Keywords

Article
Publication date: 23 April 2024

Chen Zhong, Hong Liu and Hwee-Joo Kam

Cybersecurity competitions can effectively develop skills, but engaging a wide learner spectrum is challenging. This study aims to investigate the perceptions of cybersecurity…

Abstract

Purpose

Cybersecurity competitions can effectively develop skills, but engaging a wide learner spectrum is challenging. This study aims to investigate the perceptions of cybersecurity competitions among Reddit users. These users constitute a substantial demographic of young individuals, often participating in communities oriented towards college students or cybersecurity enthusiasts. The authors specifically focus on novice learners who showed an interest in cybersecurity but have not participated in competitions. By understanding their views and concerns, the authors aim to devise strategies to encourage their continuous involvement in cybersecurity learning. The Reddit platform provides unique access to this significant demographic, contributing to enhancing and diversifying the cybersecurity workforce.

Design/methodology/approach

The authors propose to mine Reddit posts for information about learners’ attitudes, interests and experiences with cybersecurity competitions. To mine Reddit posts, the authors developed a text mining approach that integrates computational text mining and qualitative content analysis techniques, and the authors discussed the advantages of the integrated approach.

Findings

The authors' text mining approach was successful in extracting the major themes from the collected posts. The authors found that motivated learners would want to form a strategic way to facilitate their learning. In addition, hope and fear collide, which exposes the learners’ interests and challenges.

Originality/value

The authors discussed the findings to provide education and training experts with a thorough understanding of novice learners, allowing them to engage them in the cybersecurity industry.

Details

Information & Computer Security, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2056-4961

Keywords

Article
Publication date: 26 March 2024

Md. Nurul Islam, Guangwei Hu, Murtaza Ashiq and Shakil Ahmad

This bibliometric study aims to analyze the latest trends and patterns of big data applications in librarianship from 2000 to 2022. By conducting a comprehensive examination of…

Abstract

Purpose

This bibliometric study aims to analyze the latest trends and patterns of big data applications in librarianship from 2000 to 2022. By conducting a comprehensive examination of the existing literature, this study aims to provide valuable insights into the emerging field of big data in librarianship and its potential impact on the future of libraries.

Design/methodology/approach

This study employed a rigorous four-stage process of identification, screening, eligibility and inclusion to filter and select the most relevant documents for analysis. The Scopus database was utilized to retrieve pertinent data related to big data applications in librarianship. The dataset comprised 430 documents, including journal articles, conference papers, book chapters, reviews and books. Through bibliometric analysis, the study examined the effectiveness of different publication types and identified the main topics and themes within the field.

Findings

The study found that the field of big data in librarianship is growing rapidly, with a significant increase in publications and citations over the past few years. China is the leading country in terms of publication output, followed by the United States of America. The most influential journals in the field are Library Hi Tech and the ACM International Conference Proceeding Series. The top authors in the field are Minami T, Wu J, Fox EA and Giles CL. The most common keywords in the literature are big data, librarianship, data mining, information retrieval, machine learning and webometrics.

Originality/value

This bibliometric study contributes to the existing body of literature by comprehensively analyzing the latest trends and patterns in big data applications within librarianship. It offers a systematic approach to understanding the state of the field and highlights the unique contributions made by various types of publications. The study’s findings and insights contribute to the originality of this research, providing a foundation for further exploration and advancement in the field of big data in librarianship.

Details

Library Hi Tech, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0737-8831

Keywords

Article
Publication date: 13 June 2023

G. Deepa, A.J. Niranjana and A.S. Balu

This study aims at proposing a hybrid model for early cost prediction of a construction project. Early cost prediction for a construction project is the basic approach to procure…

Abstract

Purpose

This study aims at proposing a hybrid model for early cost prediction of a construction project. Early cost prediction for a construction project is the basic approach to procure a project within a predefined budget. However, most of the projects routinely face the impact of cost overruns. Furthermore, conventional and manual cost computing techniques are hectic, time-consuming and error-prone. To deal with such challenges, soft computing techniques such as artificial neural networks (ANNs), fuzzy logic and genetic algorithms are applied in construction management. Each technique has its own constraints not only in terms of efficiency but also in terms of feasibility, practicability, reliability and environmental impacts. However, appropriate combination of the techniques improves the model owing to their inherent nature.

Design/methodology/approach

This paper proposes a hybrid model by combining machine learning (ML) techniques with ANN to accurately predict the cost of pile foundations. The parameters contributing toward the cost of pile foundations were collected from five different projects in India. Out of 180 collected data entries, 176 entries were finally used after data cleaning. About 70% of the final data were used for building the model and the remaining 30% were used for validation.

Findings

The proposed model is capable of predicting the pile foundation costs with an accuracy of 97.42%.

Originality/value

Although various cost estimation techniques are available, appropriate use and combination of various ML techniques aid in improving the prediction accuracy. The proposed model will be a value addition to cost estimation of pile foundations.

Details

Journal of Engineering, Design and Technology , vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1726-0531

Keywords

Article
Publication date: 1 January 2024

Shahrzad Yaghtin and Joel Mero

Machine learning (ML) techniques are increasingly important in enabling business-to-business (B2B) companies to offer personalized services to business customers. On the other…

Abstract

Purpose

Machine learning (ML) techniques are increasingly important in enabling business-to-business (B2B) companies to offer personalized services to business customers. On the other hand, humans play a critical role in dealing with uncertain situations and the relationship-building aspects of a B2B business. Most existing studies advocating human-ML augmentation simply posit the concept without providing a detailed view of augmentation. Therefore, the purpose of this paper is to investigate how human involvement can practically augment ML capabilities to develop a personalized information system (PIS) for business customers.

Design/methodology/approach

The authors developed a research framework to create an integrated human-ML PIS for business customers. The PIS was then implemented in the energy sector. Next, the accuracy of the PIS was evaluated using customer feedback. To this end, precision, recall and F1 evaluation metrics were used.

Findings

The computed figures of precision, recall and F1 (respectively, 0.73, 0.72 and 0.72) were all above 0.5; thus, the accuracy of the model was confirmed. Finally, the study presents the research model that illustrates how human involvement can augment ML capabilities in different stages of creating the PIS including the business/market understanding, data understanding, data collection and preparation, model creation and deployment and model evaluation phases.

Originality/value

This paper offers novel insight into the less-known phenomenon of human-ML augmentation for marketing purposes. Furthermore, the study contributes to the B2B personalization literature by elaborating on how human experts can augment ML computing power to create a PIS for business customers.

Details

Journal of Business & Industrial Marketing, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0885-8624

Keywords

Article
Publication date: 17 July 2023

Nghia Nguyen, Thuy-Hien Nguyen, Yen-Nhi Nguyen, Dung Doan, Minh Nguyen and Van-Ho Nguyen

The purpose of this paper is to expand and analyze deeply customer emotions, concretize the levels of positive or negative emotions with the aim of using machine learning methods…

Abstract

Purpose

The purpose of this paper is to expand and analyze deeply customer emotions, concretize the levels of positive or negative emotions with the aim of using machine learning methods, and build a model to identify customer emotions.

Design/methodology/approach

The study proposed a customer emotion detection model and data mining method based on the collected dataset, including 80,593 online reviews on agoda.com and booking.com from 2009 to 2022.

Findings

By discerning specific emotions expressed in customers' comments, emotion detection, which refers to the process of identifying users' emotional states, assumes a crucial role in evaluating the brand value of a product. The research capitalizes on the vast and diverse data sources available on hotel booking websites, which, despite their richness, remain largely unexplored and unanalyzed. The outcomes of the model, pertaining to the detection and classification of customer emotions based on ratings and reviews into four distinct emotional states, offer a means to address the challenge of determining customer satisfaction regarding their actual service experiences. These findings hold substantial value for businesses operating in this domain, as the findings facilitate the evaluation and formulation of improvement strategies within their business models. The experimental study reveals that the proposed model attains an exact match ratio, precision, and recall rates of up to 81%, 90% and 90%, respectively.

Research limitations/implications

The study has yet to mine real-time data. Prediction results may be influenced because the amount of data collected from the web is insufficient and preprocessing is not completely suppressed. Furthermore, the model in the study was not tested using all algorithms and multi-label classifiers. Future research should build databases to mine data in real-time and collect more data and enhance the current model.

Practical implications

The study's results suggest that the emotion detection models can be applied to the real world to quickly analyze customer feedback. The proposed models enable the identification of customers' emotions, the discovery of customer demand, the enhancement of service, and the general customer experience. The established models can be used by many service sectors to learn more about customer satisfaction with the offered goods and services from customer reviews.

Social implications

The research paper helps businesses in the hospitality area analyze customer emotions in each specific aspect to ensure customer satisfaction. In addition, managers can come up with appropriate strategies to bring better products and services to society and people. Subsequently, fostering the growth of the hotel tourism sector within the nation, thereby facilitating sustainable economic development on a national scale.

Originality/value

This study developed a customer emotions detection model for detecting and classifying customer ratings and reviews as 4 specific emotions: happy, angry, depressed and hopeful based on online booking hotel websites agoda.com and booking.com that contains 80,593 reviews in Vietnamese. The research results help businesses check and evaluate the quality of their services, thereby offering appropriate improvement strategies to increase customers' satisfaction and demand more effectively.

Details

Journal of Hospitality and Tourism Insights, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2514-9792

Keywords

Article
Publication date: 28 September 2023

Moh. Riskiyadi

This study aims to compare machine learning models, datasets and splitting training-testing using data mining methods to detect financial statement fraud.

3642

Abstract

Purpose

This study aims to compare machine learning models, datasets and splitting training-testing using data mining methods to detect financial statement fraud.

Design/methodology/approach

This study uses a quantitative approach from secondary data on the financial reports of companies listed on the Indonesia Stock Exchange in the last ten years, from 2010 to 2019. Research variables use financial and non-financial variables. Indicators of financial statement fraud are determined based on notes or sanctions from regulators and financial statement restatements with special supervision.

Findings

The findings show that the Extremely Randomized Trees (ERT) model performs better than other machine learning models. The best original-sampling dataset compared to other dataset treatments. Training testing splitting 80:10 is the best compared to other training-testing splitting treatments. So the ERT model with an original-sampling dataset and 80:10 training-testing splitting are the most appropriate for detecting future financial statement fraud.

Practical implications

This study can be used by regulators, investors, stakeholders and financial crime experts to add insight into better methods of detecting financial statement fraud.

Originality/value

This study proposes a machine learning model that has not been discussed in previous studies and performs comparisons to obtain the best financial statement fraud detection results. Practitioners and academics can use findings for further research development.

Details

Asian Review of Accounting, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1321-7348

Keywords

Access

Year

Last 12 months (1550)

Content type

Earlycite article (1550)
1 – 10 of over 1000