Search results

1 – 10 of 60

Open Access

Article

Publication date: 31 July 2023

Sarcasm detection in online comments using machine learning

Sarcasm is a linguistic expression that usually carries the opposite meaning of what is being said by words, thus making it difficult for machines to discover the actual meaning…

HTML

PDF (2 MB)

Downloads

3069

Abstract

Purpose

Sarcasm is a linguistic expression that usually carries the opposite meaning of what is being said by words, thus making it difficult for machines to discover the actual meaning. It is mainly distinguished by the inflection with which it is spoken, with an undercurrent of irony, and is largely dependent on context, which makes it a difficult task for computational analysis. Moreover, sarcasm expresses negative sentiments using positive words, allowing it to easily confuse sentiment analysis models. This paper aims to demonstrate the task of sarcasm detection using the approach of machine and deep learning.

Design/methodology/approach

For the purpose of sarcasm detection, machine and deep learning models were used on a data set consisting of 1.3 million social media comments, including both sarcastic and non-sarcastic comments. The data set was pre-processed using natural language processing methods, and additional features were extracted and analysed. Several machine learning models, including logistic regression, ridge regression, linear support vector and support vector machines, along with two deep learning models based on bidirectional long short-term memory and one bidirectional encoder representations from transformers (BERT)-based model, were implemented, evaluated and compared.

Findings

The performance of machine and deep learning models was compared in the task of sarcasm detection, and possible ways of improvement were discussed. Deep learning models showed more promise, performance-wise, for this type of task. Specifically, a state-of-the-art model in natural language processing, namely, BERT-based model, outperformed other machine and deep learning models.

Originality/value

This study compared the performance of the various machine and deep learning models in the task of sarcasm detection using the data set of 1.3 million comments from social media.

Details

Information Discovery and Delivery, vol. 52 no. 2

Type: Research Article

DOI:

ISSN: 2398-6247

Keywords

Open Access

Article

Publication date: 26 April 2024

Validating predictions of burial mounds with field data: the promise and reality of machine learning

Adela Sobotkova, Ross Deans Kristensen-McLachlan, Orla Mallon and Shawn Adrian Ross

This paper provides practical advice for archaeologists and heritage specialists wishing to use ML approaches to identify archaeological features in high-resolution satellite…

HTML

PDF (4.4 MB)

Downloads

Abstract

Purpose

This paper provides practical advice for archaeologists and heritage specialists wishing to use ML approaches to identify archaeological features in high-resolution satellite imagery (or other remotely sensed data sources). We seek to balance the disproportionately optimistic literature related to the application of ML to archaeological prospection through a discussion of limitations, challenges and other difficulties. We further seek to raise awareness among researchers of the time, effort, expertise and resources necessary to implement ML successfully, so that they can make an informed choice between ML and manual inspection approaches.

Design/methodology/approach

Automated object detection has been the holy grail of archaeological remote sensing for the last two decades. Machine learning (ML) models have proven able to detect uniform features across a consistent background, but more variegated imagery remains a challenge. We set out to detect burial mounds in satellite imagery from a diverse landscape in Central Bulgaria using a pre-trained Convolutional Neural Network (CNN) plus additional but low-touch training to improve performance. Training was accomplished using MOUND/NOT MOUND cutouts, and the model assessed arbitrary tiles of the same size from the image. Results were assessed using field data.

Findings

Validation of results against field data showed that self-reported success rates were misleadingly high, and that the model was misidentifying most features. Setting an identification threshold at 60% probability, and noting that we used an approach where the CNN assessed tiles of a fixed size, tile-based false negative rates were 95–96%, false positive rates were 87–95% of tagged tiles, while true positives were only 5–13%. Counterintuitively, the model provided with training data selected for highly visible mounds (rather than all mounds) performed worse. Development of the model, meanwhile, required approximately 135 person-hours of work.

Research limitations/implications

Our attempt to deploy a pre-trained CNN demonstrates the limitations of this approach when it is used to detect varied features of different sizes within a heterogeneous landscape that contains confounding natural and modern features, such as roads, forests and field boundaries. The model has detected incidental features rather than the mounds themselves, making external validation with field data an essential part of CNN workflows. Correcting the model would require refining the training data as well as adopting different approaches to model choice and execution, raising the computational requirements beyond the level of most cultural heritage practitioners.

Practical implications

Improving the pre-trained model’s performance would require considerable time and resources, on top of the time already invested. The degree of manual intervention required – particularly around the subsetting and annotation of training data – is so significant that it raises the question of whether it would be more efficient to identify all of the mounds manually, either through brute-force inspection by experts or by crowdsourcing the analysis to trained – or even untrained – volunteers. Researchers and heritage specialists seeking efficient methods for extracting features from remotely sensed data should weigh the costs and benefits of ML versus manual approaches carefully.

Social implications

Our literature review indicates that use of artificial intelligence (AI) and ML approaches to archaeological prospection have grown exponentially in the past decade, approaching adoption levels associated with “crossing the chasm” from innovators and early adopters to the majority of researchers. The literature itself, however, is overwhelmingly positive, reflecting some combination of publication bias and a rhetoric of unconditional success. This paper presents the failure of a good-faith attempt to utilise these approaches as a counterbalance and cautionary tale to potential adopters of the technology. Early-majority adopters may find ML difficult to implement effectively in real-life scenarios.

Originality/value

Unlike many high-profile reports from well-funded projects, our paper represents a serious but modestly resourced attempt to apply an ML approach to archaeological remote sensing, using techniques like transfer learning that are promoted as solutions to time and cost problems associated with, e.g. annotating and manipulating training data. While the majority of articles uncritically promote ML, or only discuss how challenges were overcome, our paper investigates how – despite reasonable self-reported scores – the model failed to locate the target features when compared to field data. We also present time, expertise and resourcing requirements, a rarity in ML-for-archaeology publications.

Details

Journal of Documentation, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 0022-0418

Keywords

Open Access

Article

Publication date: 23 January 2024

Automation of text document classification in the budgeting phase of the Construction process: a Systematic Literature Review

Luís Jacques de Sousa, João Poças Martins, Luís Sanhudo and João Santos Baptista

This study aims to review recent advances towards the implementation of ANN and NLP applications during the budgeting phase of the construction process. During this phase…

HTML

PDF (1009 KB)

Downloads

537

Abstract

Purpose

This study aims to review recent advances towards the implementation of ANN and NLP applications during the budgeting phase of the construction process. During this phase, construction companies must assess the scope of each task and map the client’s expectations to an internal database of tasks, resources and costs. Quantity surveyors carry out this assessment manually with little to no computer aid, within very austere time constraints, even though these results determine the company’s bid quality and are contractually binding.

Design/methodology/approach

This paper seeks to compile applications of machine learning (ML) and natural language processing in the architectural engineering and construction sector to find which methodologies can assist this assessment. The paper carries out a systematic literature review, following the preferred reporting items for systematic reviews and meta-analyses guidelines, to survey the main scientific contributions within the topic of text classification (TC) for budgeting in construction.

Findings

This work concludes that it is necessary to develop data sets that represent the variety of tasks in construction, achieve higher accuracy algorithms, widen the scope of their application and reduce the need for expert validation of the results. Although full automation is not within reach in the short term, TC algorithms can provide helpful support tools.

Originality/value

Given the increasing interest in ML for construction and recent developments, the findings disclosed in this paper contribute to the body of knowledge, provide a more automated perspective on budgeting in construction and break ground for further implementation of text-based ML in budgeting for construction.

Details

Construction Innovation , vol. 24 no. 7

Type: Research Article

DOI:

ISSN: 1471-4175

Keywords

Open Access

Article

Publication date: 26 April 2024

Research on noise reduction and data mining techniques for pavement dynamic response signals

Xue Xin, Yuepeng Jiao, Yunfeng Zhang, Ming Liang and Zhanyong Yao

This study aims to ensure reliable analysis of dynamic responses in asphalt pavement structures. It investigates noise reduction and data mining techniques for pavement dynamic…

HTML

PDF (2.5 MB)

Downloads

Abstract

Purpose

This study aims to ensure reliable analysis of dynamic responses in asphalt pavement structures. It investigates noise reduction and data mining techniques for pavement dynamic response signals.

Design/methodology/approach

The paper conducts time-frequency analysis on signals of pavement dynamic response initially. It also uses two common noise reduction methods, namely, low-pass filtering and wavelet decomposition reconstruction, to evaluate their effectiveness in reducing noise in these signals. Furthermore, as these signals are generated in response to vehicle loading, they contain a substantial amount of data and are prone to environmental interference, potentially resulting in outliers. Hence, it becomes crucial to extract dynamic strain response features (e.g. peaks and peak intervals) in real-time and efficiently.

Findings

The study introduces an improved density-based spatial clustering of applications with Noise (DBSCAN) algorithm for identifying outliers in denoised data. The results demonstrate that low-pass filtering is highly effective in reducing noise in pavement dynamic response signals within specified frequency ranges. The improved DBSCAN algorithm effectively identifies outliers in these signals through testing. Furthermore, the peak detection process, using the enhanced findpeaks function, consistently achieves excellent performance in identifying peak values, even when complex multi-axle heavy-duty truck strain signals are present.

Originality/value

The authors identified a suitable frequency domain range for low-pass filtering in asphalt road dynamic response signals, revealing minimal amplitude loss and effective strain information reflection between road layers. Furthermore, the authors introduced the DBSCAN-based anomaly data detection method and enhancements to the Matlab findpeaks function, enabling the detection of anomalies in road sensor data and automated peak identification.

Details

Smart and Resilient Transportation, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 2632-0487

Keywords

Content available

Book part

Publication date: 23 April 2024

Index

Free Access

HTML

PDF (47 KB)

EPUB (6.9 MB)

Details

Technological Innovations for Business, Education and Sustainability

Type: Book

DOI:

ISBN: 978-1-83753-106-6

Open Access

Article

Publication date: 30 April 2024

A physics-driven and machine learning-based digital twinning approach to transient thermal systems

Armando Di Meglio, Nicola Massarotti and Perumal Nithiarasu

In this study, the authors propose a novel digital twinning approach specifically designed for controlling transient thermal systems. The purpose of this study is to harness the…

HTML

PDF (3.9 MB)

Downloads

Abstract

Purpose

In this study, the authors propose a novel digital twinning approach specifically designed for controlling transient thermal systems. The purpose of this study is to harness the combined power of deep learning (DL) and physics-based methods (PBM) to create an active virtual replica of the physical system.

Design/methodology/approach

To achieve this goal, we introduce a deep neural network (DNN) as the digital twin and a Finite Element (FE) model as the physical system. This integrated approach is used to address the challenges of controlling an unsteady heat transfer problem with an integrated feedback loop.

Findings

The results of our study demonstrate the effectiveness of the proposed digital twinning approach in regulating the maximum temperature within the system under varying and unsteady heat flux conditions. The DNN, trained on stationary data, plays a crucial role in determining the heat transfer coefficients necessary to maintain temperatures below a defined threshold value, such as the material’s melting point. The system is successfully controlled in 1D, 2D and 3D case studies. However, careful evaluations should be conducted if such a training approach, based on steady-state data, is applied to completely different transient heat transfer problems.

Originality/value

The present work represents one of the first examples of a comprehensive digital twinning approach to transient thermal systems, driven by data. One of the noteworthy features of this approach is its robustness. Adopting a training based on dimensionless data, the approach can seamlessly accommodate changes in thermal capacity and thermal conductivity without the need for retraining.

Details

International Journal of Numerical Methods for Heat & Fluid Flow, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 0961-5539

Keywords

Open Access

Article

Publication date: 12 December 2023

A rule-based machine learning methodology for the proactive improvement of OEE: a real case study

Laura Lucantoni, Sara Antomarioni, Filippo Emanuele Ciarapica and Maurizio Bevilacqua

The Overall Equipment Effectiveness (OEE) is considered a standard for measuring equipment productivity in terms of efficiency. Still, Artificial Intelligence solutions are rarely…

HTML

PDF (2.6 MB)

Downloads

545

Abstract

Purpose

The Overall Equipment Effectiveness (OEE) is considered a standard for measuring equipment productivity in terms of efficiency. Still, Artificial Intelligence solutions are rarely used for analyzing OEE results and identifying corrective actions. Therefore, the approach proposed in this paper aims to provide a new rule-based Machine Learning (ML) framework for OEE enhancement and the selection of improvement actions.

Design/methodology/approach

Association Rules (ARs) are used as a rule-based ML method for extracting knowledge from huge data. First, the dominant loss class is identified and traditional methodologies are used with ARs for anomaly classification and prioritization. Once selected priority anomalies, a detailed analysis is conducted to investigate their influence on the OEE loss factors using ARs and Network Analysis (NA). Then, a Deming Cycle is used as a roadmap for applying the proposed methodology, testing and implementing proactive actions by monitoring the OEE variation.

Findings

The method proposed in this work has also been tested in an automotive company for framework validation and impact measuring. In particular, results highlighted that the rule-based ML methodology for OEE improvement addressed seven anomalies within a year through appropriate proactive actions: on average, each action has ensured an OEE gain of 5.4%.

Originality/value

The originality is related to the dual application of association rules in two different ways for extracting knowledge from the overall OEE. In particular, the co-occurrences of priority anomalies and their impact on asset Availability, Performance and Quality are investigated.

Details

International Journal of Quality & Reliability Management, vol. 41 no. 5

Type: Research Article

DOI:

ISSN: 0265-671X

Keywords

Open Access

Article

Publication date: 29 April 2024

Distribution fitting and ANOVA test to analyze pavement sensing patterns for condition assessments

Dada Zhang and Chun-Hsing Ho

The purpose of this paper is to investigate the vehicle-based sensor effect and pavement temperature on road condition assessment, as well as to compute a threshold value for the…

HTML

PDF (2.6 MB)

Downloads

Abstract

Purpose

The purpose of this paper is to investigate the vehicle-based sensor effect and pavement temperature on road condition assessment, as well as to compute a threshold value for the classification of pavement conditions.

Design/methodology/approach

Four sensors were placed on the vehicle’s control arms and one inside the vehicle to collect vibration acceleration data for analysis. The Analysis of Variance (ANOVA) tests were performed to diagnose the effect of the vehicle-based sensors’ placement in the field. To classify road conditions and identify pavement distress (point of interest), the probability distribution was applied based on the magnitude values of vibration data.

Findings

Results from ANOVA indicate that pavement sensing patterns from the sensors placed on the front control arms were statistically significant, and there is no difference between the sensors placed on the same side of the vehicle (e.g., left or right side). A reference threshold (i.e., 1.7 g) was computed from the distribution fitting method to classify road conditions and identify the road distress based on the magnitude values that combine all acceleration along three axes. In addition, the pavement temperature was found to be highly correlated with the sensing patterns, which is noteworthy for future projects.

Originality/value

The paper investigates the effect of pavement sensors’ placement in assessing road conditions, emphasizing the implications for future road condition assessment projects. A threshold value for classifying road conditions was proposed and applied in class assignments (I-17 highway projects).

Details

Built Environment Project and Asset Management, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 2044-124X

Keywords

Open Access

Article

Publication date: 25 April 2024

Twitter's capacity to forecast tourism demand: the case of way of Saint James

Adrián Mendieta-Aragón, Julio Navío-Marco and Teresa Garín-Muñoz

Radical changes in consumer habits induced by the coronavirus disease (COVID-19) pandemic suggest that the usual demand forecasting techniques based on historical series are…

HTML

PDF (486 KB)

Downloads

Abstract

Purpose

Radical changes in consumer habits induced by the coronavirus disease (COVID-19) pandemic suggest that the usual demand forecasting techniques based on historical series are questionable. This is particularly true for hospitality demand, which has been dramatically affected by the pandemic. Accordingly, we investigate the suitability of tourists’ activity on Twitter as a predictor of hospitality demand in the Way of Saint James – an important pilgrimage tourism destination.

Design/methodology/approach

This study compares the predictive performance of the seasonal autoregressive integrated moving average (SARIMA) time-series model with that of the SARIMA with an exogenous variables (SARIMAX) model to forecast hotel tourism demand. For this, 110,456 tweets posted on Twitter between January 2018 and September 2022 are used as exogenous variables.

Findings

The results confirm that the predictions of traditional time-series models for tourist demand can be significantly improved by including tourist activity on Twitter. Twitter data could be an effective tool for improving the forecasting accuracy of tourism demand in real-time, which has relevant implications for tourism management. This study also provides a better understanding of tourists’ digital footprints in pilgrimage tourism.

Originality/value

This study contributes to the scarce literature on the digitalisation of pilgrimage tourism and forecasting hotel demand using a new methodological framework based on Twitter user-generated content. This can enable hospitality industry practitioners to convert social media data into relevant information for hospitality management.

研究目的

2019冠狀病毒病引致消費者習慣有根本的改變; 這些改變顯示，根據歷史序列而運作的慣常需求預測技巧未必是正確的。這不確性尤以受到大流行極大影響的酒店服務需求為甚。因此，我們擬探討、若把在推特網站上的旅遊活動視為聖雅各之路 (一個重要的朝聖旅遊聖地) 酒店服務需求的預測器，這會否是合適的呢?

研究設計/方法/理念

本研究比較 SARIMA 時間序列模型與附有外生變數 (SARIMAX)模型兩者在預測旅遊及酒店服務需求方面的表現。為此，研究人員收集在推特網站上發佈的資訊，作為外生變數進行研究。這個樣本涵蓋於2018年1月至2022年9月期間110,456個發佈資訊。

研究結果

研究結果確認了傳統的時間序列模型，若涵蓋推特網站上的旅遊活動，則其對旅遊需求方面的預測會得到顯著的改善。推特網站的數據，就改善預測實時旅遊需求的準確度，或許可成為有效的工具; 而這發現對旅遊管理會有一定的意義。本研究亦讓我們進一步瞭解朝聖旅遊方面旅客的數碼足跡。

研究的原創性

現存文獻甚少探討朝聖旅遊的數字化，而本研究不但在這方面充實了有關的文獻，還使用了一個根據推特網站上使用者原創內容嶄新的方法框架，進行分析和探討。這會幫助酒店從業人員把社交媒體數據轉變為可供酒店管理之用的合宜資訊。

Details

European Journal of Management and Business Economics, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 2444-8451

Keywords

Open Access

Article

Publication date: 17 May 2022

Delineating the people-related features required for construction digitalisation

Douglas Aghimien, Clinton Aigbavboa, Ayodeji Emmanuel Oke and John Aliu

Digitalisation, which involves the use of digital technologies in transforming an organisation’s activities, transcends just the acquiring of emerging digital tools. Having the…

HTML

PDF (673 KB)

Downloads

1522

Abstract

Purpose

Digitalisation, which involves the use of digital technologies in transforming an organisation’s activities, transcends just the acquiring of emerging digital tools. Having the right people to drive the implementation of these technologies and attaining strategic organisational goals is essential. While most studies have focused on the use of emerging technologies in the construction industry, less attention has been given to the ‘people’ dimension. Therefore, this study aims to assess the people-related features needed for construction digitalisation.

Design/methodology/approach

The study adopted pragmatic thinking using a mixed-method approach. A Delphi was used to achieve the qualitative aspect of the research, while a questionnaire survey conducted among 222 construction professionals was used to achieve the quantitative aspect. The data gathered were analysed using frequency, percentage, mean item score, Kruskal–Wallis H test, exploratory factor analysis and confirmatory factor analysis.

Findings

Based on acceptable reliability, validity and model fit indices, the study found that the people-related factors needed for construction digitalisation can be grouped into technical capability of personnel, attracting and retaining digital talent and organisation’s digital culture.

Practical implications

The findings offer valuable benefits to construction organisations as understanding these identified people features can help lead to better deployment of digital tools and the attainment of the digital transformation.

Originality/value

This study attempts to fill the gap in the shortage of literature exploring the people dimension of construction digitalisation. The study offers an excellent theoretical backdrop for future works on digital talent for construction digitalisation, which has gained less attention in the current construction digitalisation discourse.

Details

Construction Innovation , vol. 24 no. 7

Type: Research Article

DOI:

ISSN: 1471-4175

Keywords

Access

Year

Content type

1 – 10 of 60

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Research limitations/implications

Practical implications

Social implications

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Details

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

研究目的

研究設計/方法/理念

研究結果

研究的原創性

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Practical implications

Originality/value

Details

Keywords

Access

Year

Content type

We’re listening — tell us what you think

Something didn’t work…

All feedback is valuable

Join us on our journey