Search results

1 – 10 of over 2000

View access options

Article

Publication date: 31 October 2023

Intelligent detection on construction project contract missing clauses based on deep learning and NLP

Hong Zhou, Binwei Gao, Shilong Tang, Bing Li and Shuyu Wang

The number of construction dispute cases has maintained a high growth trend in recent years. The effective exploration and management of construction contract risk can directly…

HTML

PDF (3.3 MB)

Downloads

267

Abstract

Purpose

The number of construction dispute cases has maintained a high growth trend in recent years. The effective exploration and management of construction contract risk can directly promote the overall performance of the project life cycle. The miss of clauses may result in a failure to match with standard contracts. If the contract, modified by the owner, omits key clauses, potential disputes may lead to contractors paying substantial compensation. Therefore, the identification of construction project contract missing clauses has heavily relied on the manual review technique, which is inefficient and highly restricted by personnel experience. The existing intelligent means only work for the contract query and storage. It is urgent to raise the level of intelligence for contract clause management. Therefore, this paper aims to propose an intelligent method to detect construction project contract missing clauses based on Natural Language Processing (NLP) and deep learning technology.

Design/methodology/approach

A complete classification scheme of contract clauses is designed based on NLP. First, construction contract texts are pre-processed and converted from unstructured natural language into structured digital vector form. Following the initial categorization, a multi-label classification of long text construction contract clauses is designed to preliminary identify whether the clause labels are missing. After the multi-label clause missing detection, the authors implement a clause similarity algorithm by creatively integrating the image detection thought, MatchPyramid model, with BERT to identify missing substantial content in the contract clauses.

Findings

1,322 construction project contracts were tested. Results showed that the accuracy of multi-label classification could reach 93%, the accuracy of similarity matching can reach 83%, and the recall rate and F1 mean of both can reach more than 0.7. The experimental results verify the feasibility of intelligently detecting contract risk through the NLP-based method to some extent.

Originality/value

NLP is adept at recognizing textual content and has shown promising results in some contract processing applications. However, the mostly used approaches of its utilization for risk detection in construction contract clauses predominantly are rule-based, which encounter challenges when handling intricate and lengthy engineering contracts. This paper introduces an NLP technique based on deep learning which reduces manual intervention and can autonomously identify and tag types of contractual deficiencies, aligning with the evolving complexities anticipated in future construction contracts. Moreover, this method achieves the recognition of extended contract clause texts. Ultimately, this approach boasts versatility; users simply need to adjust parameters such as segmentation based on language categories to detect omissions in contract clauses of diverse languages.

Details

Engineering, Construction and Architectural Management, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 0969-9988

Keywords

View access options

Article

Publication date: 18 August 2023

Text classification using deep learning techniques: a bibliometric analysis and future research directions

Gaurav Sarin, Pradeep Kumar and M. Mukund

Text classification is a widely accepted and adopted technique in organizations to mine and analyze unstructured and semi-structured data. With advancement of technological…

HTML

PDF (3.6 MB)

Downloads

167

Abstract

Purpose

Text classification is a widely accepted and adopted technique in organizations to mine and analyze unstructured and semi-structured data. With advancement of technological computing, deep learning has become more popular among academicians and professionals to perform mining and analytical operations. In this work, the authors study the research carried out in field of text classification using deep learning techniques to identify gaps and opportunities for doing research.

Design/methodology/approach

The authors adopted bibliometric-based approach in conjunction with visualization techniques to uncover new insights and findings. The authors collected data of two decades from Scopus global database to perform this study. The authors discuss business applications of deep learning techniques for text classification.

Findings

The study provides overview of various publication sources in field of text classification and deep learning together. The study also presents list of prominent authors and their countries working in this field. The authors also presented list of most cited articles based on citations and country of research. Various visualization techniques such as word cloud, network diagram and thematic map were used to identify collaboration network.

Originality/value

The study performed in this paper helped to understand research gaps that is original contribution to body of literature. To best of the authors' knowledge, in-depth study in the field of text classification and deep learning has not been performed in detail. The study provides high value to scholars and professionals by providing them opportunities of research in this area.

Details

Benchmarking: An International Journal, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 1463-5771

Keywords

View access options

Article

Publication date: 25 April 2024

Artificial intelligence-based pre-conception stage construction budget decision-making model and tool for residential buildings

Abdul-Manan Sadick, Argaw Gurmu and Chathuri Gunarathna

Developing a reliable cost estimate at the early stage of construction projects is challenging due to inadequate project information. Most of the information during this stage is…

HTML

PDF (2 MB)

Downloads

Abstract

Purpose

Developing a reliable cost estimate at the early stage of construction projects is challenging due to inadequate project information. Most of the information during this stage is qualitative, posing additional challenges to achieving accurate cost estimates. Additionally, there is a lack of tools that use qualitative project information and forecast the budgets required for project completion. This research, therefore, aims to develop a model for setting project budgets (excluding land) during the pre-conceptual stage of residential buildings, where project information is mainly qualitative.

Design/methodology/approach

Due to the qualitative nature of project information at the pre-conception stage, a natural language processing model, DistilBERT (Distilled Bidirectional Encoder Representations from Transformers), was trained to predict the cost range of residential buildings at the pre-conception stage. The training and evaluation data included 63,899 building permit activity records (2021–2022) from the Victorian State Building Authority, Australia. The input data comprised the project description of each record, which included project location and basic material types (floor, frame, roofing, and external wall).

Findings

This research designed a novel tool for predicting the project budget based on preliminary project information. The model achieved 79% accuracy in classifying residential buildings into three cost_classes ($100,000-$300,000, $300,000-$500,000, $500,000-$1,200,000) and F1-scores of 0.85, 0.73, and 0.74, respectively. Additionally, the results show that the model learnt the contextual relationship between qualitative data like project location and cost.

Research limitations/implications

The current model was developed using data from Victoria state in Australia; hence, it would not return relevant outcomes for other contexts. However, future studies can adopt the methods to develop similar models for their context.

Originality/value

This research is the first to leverage a deep learning model, DistilBERT, for cost estimation at the pre-conception stage using basic project information like location and material types. Therefore, the model would contribute to overcoming data limitations for cost estimation at the pre-conception stage. Residential building stakeholders, like clients, designers, and estimators, can use the model to forecast the project budget at the pre-conception stage to facilitate decision-making.

Details

Engineering, Construction and Architectural Management, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 0969-9988

Keywords

View access options

Article

Publication date: 16 February 2023

Latent topics identification from the articles of Sri Lankan authors using LDA

S. Ravikumar, Bidyut Bikash Boruah and Fullstar Lamin Gayang

The purpose of the study is to identify the latent topics from 9102 Web of Science (WoS) indexed research articles published in 2645 journals of the Sri Lankan authors from 1989…

HTML

PDF (1.9 MB)

Downloads

103

Abstract

Purpose

The purpose of the study is to identify the latent topics from 9102 Web of Science (WoS) indexed research articles published in 2645 journals of the Sri Lankan authors from 1989 to 2021 by applying Latent Dirichlet Allocation to the abstracts. Dominant topics in the corpus of text, the posterior probability of different terms in the topics and the publication proportions of the topics were discussed in the article.

Design/methodology/approach

Abstracts and other details of the studied articles are collected from WoS database by the authors. Data preprocessing is performed before the analysis. “ldatuning” from the R package is applied after preprocessing of text for deciding subjects in light of factual elements. Twenty topics are decided to extract as latent topics through four metrics methods.

Findings

It is observed that medical science, agriculture, research and development and chemistry-related topics dominate the subject categories as a whole. “Irrigation” and “mortality and health care” have a significant growth in the publication proportion from 2019 to 2021. For the most occurring latent topics, it is seen that terms like “activity” and “acid” carry higher posterior probability.

Practical implications

Topic models permit us to rapidly and efficiently address higher perspective inquiries without human mediation and are also helpful in information retrieval and document clustering. The unique feature of this study has highlighted how the growth of the universe of knowledge for a specific country can be studied using the LDA topic model.

Originality/value

This study will create an incentive for text analysis and information retrieval areas of research. The results of this paper gave an understanding of the writing development of the Sri Lankan authors in different subject spaces and over the period. Trends and intensity of publications from the Sri Lankan authors on different latent topics help to trace the interests and mostly practiced areas in different domains.

Details

Global Knowledge, Memory and Communication, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 2514-9342

Keywords

View access options

Article

Publication date: 22 March 2024

Decoding mood of the Twitterverse on ESG investing: opinion mining and key themes using machine learning

Rachana Jaiswal, Shashank Gupta and Aviral Kumar Tiwari

Grounded in the stakeholder theory and signaling theory, this study aims to broaden the research agenda on environmental, social and governance (ESG) investing by uncovering…

HTML

PDF (745 KB)

Downloads

Abstract

Purpose

Grounded in the stakeholder theory and signaling theory, this study aims to broaden the research agenda on environmental, social and governance (ESG) investing by uncovering public sentiments and key themes using Twitter data spanning from 2009 to 2022.

Design/methodology/approach

Using various machine learning models for text tonality analysis and topic modeling, this research scrutinizes 1,842,985 Twitter texts to extract prevalent ESG investing trends and gauge their sentiment.

Findings

Gibbs Sampling Dirichlet Multinomial Mixture emerges as the optimal topic modeling method, unveiling significant topics such as “Physical risk of climate change,” “Employee Health, Safety and well-being” and “Water management and Scarcity.” RoBERTa, an attention-based model, outperforms other machine learning models in sentiment analysis, revealing a predominantly positive shift in public sentiment toward ESG investing over the past five years.

Research limitations/implications

This study establishes a framework for sentiment analysis and topic modeling on alternative data, offering a foundation for future research. Prospective studies can enhance insights by incorporating data from additional social media platforms like LinkedIn and Facebook.

Practical implications

Leveraging unstructured data on ESG from platforms like Twitter provides a novel avenue to capture company-related information, supplementing traditional self-reported sustainability disclosures. This approach opens new possibilities for understanding a company’s ESG standing.

Social implications

By shedding light on public perceptions of ESG investing, this research uncovers influential factors that often elude traditional corporate reporting. The findings empower both investors and the general public, aiding managers in refining ESG and management strategies.

Originality/value

This study marks a groundbreaking contribution to scholarly exploration, to the best of the authors’ knowledge, by being the first to analyze unstructured Twitter data in the context of ESG investing, offering unique insights and advancing the understanding of this emerging field.

Details

Management Research Review, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 2040-8269

Keywords

View access options

Article

Publication date: 9 February 2024

How do reward personalization options influence the public’s willingness to participate in innovation projects? Insights from crowdfunding in Industry 5.0

Wei Wang, Haiwang Liu and Yenchun Jim Wu

This study aims to examine the influence of reward personalization on financing outcomes in the Industry 5.0 era, where reward-based crowdfunding meets the personalized needs of…

HTML

PDF (1.4 MB)

Downloads

Abstract

Purpose

This study aims to examine the influence of reward personalization on financing outcomes in the Industry 5.0 era, where reward-based crowdfunding meets the personalized needs of individuals.

Design/methodology/approach

The study utilizes a corpus of 218,822 crowdfunding projects and 1,276,786 reward options on Kickstarter to investigate the effect of reward personalization on investors’ willingness to participate in crowdfunding. The research draws on expectancy theory and employs quantitative and qualitative approaches to measure reward personalization. Quantitatively, the number of reward options is calculated by frequency; whereas text-mining techniques are implemented qualitatively to extract novelty, which serves as a proxy for innovation.

Findings

Findings indicate that reward personalization has an inverted U-shaped effect on investors’ willingness to participate, with investors in life-related projects having a stronger need for reward personalization than those interested in art-related projects. The pledge goal and reward text readability have an inverted U-shaped moderating effect on reward personalization from the perspective of reward expectations and reward instrumentality.

Originality/value

This study refines the application of expectancy theory to online financing, providing theoretical insight and practical guidance for crowdfunding platforms and financiers seeking to promote sustainable development through personalized innovation.

Details

European Journal of Innovation Management, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 1460-1060

Keywords

View access options

Article

Publication date: 30 August 2023

Automatic retrieval of health case reports for public needs using deep learning techniques

Yi-Hung Liu, Sheng-Fong Chen and Dan-Wei (Marian) Wen

Online medical repositories provide a platform for users to share information and dynamically access abundant electronic health data. It is important to determine whether case…

HTML

PDF (937 KB)

Downloads

109

Abstract

Purpose

Online medical repositories provide a platform for users to share information and dynamically access abundant electronic health data. It is important to determine whether case report information can assist the general public in appropriately managing their diseases. Therefore, this paper aims to introduce a novel deep learning-based method that allows non-professionals to make inquiries using ordinary vocabulary, retrieving the most relevant case reports for accurate and effective health information.

Design/methodology/approach

The dataset of case reports was collected from both the patient-generated research network and the digital medical journal repository. To enhance the accuracy of obtaining relevant case reports, the authors propose a retrieval approach that combines BERT and BiLSTM methods. The authors identified representative health-related case reports and analyzed the retrieval performance, as well as user judgments.

Findings

This study aims to provide the necessary functionalities to deliver relevant health case reports based on input from ordinary terms. The proposed framework includes features for health management, user feedback acquisition and ranking by weights to obtain the most pertinent case reports.

Originality/value

This study contributes to health information systems by analyzing patients' experiences and treatments with the case report retrieval model. The results of this study can provide immense benefit to the general public who intend to find treatment decisions and experiences from relevant case reports.

Details

Aslib Journal of Information Management, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 2050-3806

Keywords

View access options

Article

Publication date: 5 December 2023

Recognizing emotions in restaurant online reviews: a hybrid model integrating deep learning and a sentiment lexicon

Jun Liu, Sike Hu, Fuad Mehraliyev, Haiyue Zhou, Yunyun Yu and Luyu Yang

This study aims to establish a model for rapid and accurate emotion recognition in restaurant online reviews, thus advancing the literature and providing practical insights into…

HTML

PDF (2.1 MB)

Downloads

126

Abstract

Purpose

This study aims to establish a model for rapid and accurate emotion recognition in restaurant online reviews, thus advancing the literature and providing practical insights into electronic word-of-mouth management for the industry.

Design/methodology/approach

This study elaborates a hybrid model that integrates deep learning (DL) and a sentiment lexicon (SL) and compares it to five other models, including SL, random forest (RF), naïve Bayes, support vector machine (SVM) and a DL model, for the task of emotion recognition in restaurant online reviews. These models are trained and tested using 652,348 online reviews from 548 restaurants.

Findings

The hybrid approach performs well for valence-based emotion and discrete emotion recognition and is highly applicable for mining online reviews in a restaurant setting. The performances of SL and RF are inferior when it comes to recognizing discrete emotions. The DL method and SVM can perform satisfactorily in the valence-based emotion recognition.

Research limitations/implications

These findings provide methodological and theoretical implications; thus, they advance the current state of knowledge on emotion recognition in restaurant online reviews. The results also provide practical insights into intelligent service quality monitoring and electronic word-of-mouth management for the industry.

Originality/value

This study proposes a superior model for emotion recognition in restaurant online reviews. The methodological framework and steps are elucidated in detail for future research and practical application. This study also details the performances of other commonly used models to support the selection of methods in research and practical applications.

Details

International Journal of Contemporary Hospitality Management, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 0959-6119

Keywords

View access options

Article

Publication date: 8 December 2022

Conceptual model of knowledge management system for scholarly publication cycle in academic institution

Deden Sumirat Hidayat, Dana Indra Sensuse, Damayanti Elisabeth and Lintang Matahari Hasani

Study on knowledge-based systems for scientific publications is growing very broadly. However, most of these studies do not explicitly discuss the knowledge management (KM…

HTML

PDF (1.1 MB)

Downloads

328

Abstract

Purpose

Study on knowledge-based systems for scientific publications is growing very broadly. However, most of these studies do not explicitly discuss the knowledge management (KM) component as knowledge management system (KMS) implementation. This background causes academic institutions to face challenges in developing KMS to support scholarly publication cycle (SPC). Therefore, this study aims to develop a new KMS conceptual model, Identify critical components and provide research gap opportunities for future KM studies on SPC.

Design/methodology/approach

This study used a systematic literature review (SLR) method with the procedure from Kitchenham et al. Then, the SLR results are compiled into a conceptual model design based on a framework on KM foundations and KM solutions. Finally, the model design was validated through interviews with related field experts.

Findings

The KMS for SPC focuses on the discovery, sharing and application of knowledge. The majority of KMS use recommendation systems technology with content-based filtering and collaborative filtering personalization approaches. The characteristics data used in KMS for SPC are structured and unstructured. Metadata and article abstracts are considered sufficiently representative of the entire article content to be used as a search tool and can provide recommendations. The KMS model for SPC has layers of KM infrastructure, processes, systems, strategies, outputs and outcomes.

Research limitations/implications

This study has limitations in discussing tacit knowledge. In contrast, tacit knowledge for SPC is essential for scientific publication performance. The tacit knowledge includes experience in searching, writing, submitting, publishing and disseminating scientific publications. Tacit knowledge plays a vital role in the development of knowledge sharing system (KSS) and KCS. Therefore, KSS and KCS for SPC are still very challenging to be researched in the future. KMS opportunities that might be developed further are lessons learned databases and interactive forums that capture tacit knowledge about SPC. Future work potential could identify other types of KMS in academia and focus more on SPC.

Originality/value

This study proposes a novel comprehensive KMS model to support scientific publication performance. This model has a critical path as a KMS implementation solution for SPC. This model proposes and recommends appropriate components for SPC requirements (KM processes, technology, methods/techniques and data). This study also proposes novel research gaps as KMS research opportunities for SPC in the future.

Details

VINE Journal of Information and Knowledge Management Systems, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 2059-5891

Keywords

View access options

Article

Publication date: 27 January 2023

The impact of innovation news coverage on illiquid stocks: the case of US market

Elena Fedorova and Valentin Stepanov

The purpose of this study is to determine stock market reactions to the news about innovations and other types of publications for illiquid stocks.

HTML

PDF (2.9 MB)

Downloads

136

Abstract

Purpose

The purpose of this study is to determine stock market reactions to the news about innovations and other types of publications for illiquid stocks.

Design/methodology/approach

(1) The authors opt for machine learning techniques and expert analysis and propose their own lexicon of innovations based on the news articles published on the professional website; (2) the dataset consists of the data on 2,000 US companies for 6 years; (3) the text analysis including BERT and Top2 Vec models which are superior to Latent Dirichlet allocation (LDA) in information criteria allows for more accurate evaluation of news sentiment and idea; and (4) furthermore, random forest and gradient boosting were applied to increase validity of results and demonstrate factor importance.

Findings

(1) The paper presents theoretical findings adding to signalling theory and efficient market hypothesis for US illiquid stocks; (2) this study suggests that information on product innovations (unlike other types of innovations) has a direct and significant effect on the return of illiquid stocks; (3) the results also give evidence that under uncertainty innovation-related publications do not affect the return of illiquid stocks; and (4) the analysis of the news topics (narratives) demonstrates that only the narrative related to important corporate announcements has a positive impact on the return of illiquid stocks.

Originality/value

(1) The authors are the first to conduct a large-scale study of the impact of various information on the return of illiquid stocks; (2) the paper focuses on information on several types of innovations with regard to the return of illiquid stocks; (3) based on Top2 Vec model, this study identifies the key topics-narratives discussed by investors and assesses their impact on the return of illiquid stocks; and (4) as an information source, the authors use the sample comprising a total of 1.4m news articles released on the professional website for investors “Benzinga”.

Details

European Journal of Innovation Management, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 1460-1060

Keywords

Access

Year

Content type

Earlycite article (2670)

1 – 10 of over 2000