Search results

1 – 10 of over 1000
Article
Publication date: 31 October 2023

Hong Zhou, Binwei Gao, Shilong Tang, Bing Li and Shuyu Wang

The number of construction dispute cases has maintained a high growth trend in recent years. The effective exploration and management of construction contract risk can directly…

Abstract

Purpose

The number of construction dispute cases has maintained a high growth trend in recent years. The effective exploration and management of construction contract risk can directly promote the overall performance of the project life cycle. The miss of clauses may result in a failure to match with standard contracts. If the contract, modified by the owner, omits key clauses, potential disputes may lead to contractors paying substantial compensation. Therefore, the identification of construction project contract missing clauses has heavily relied on the manual review technique, which is inefficient and highly restricted by personnel experience. The existing intelligent means only work for the contract query and storage. It is urgent to raise the level of intelligence for contract clause management. Therefore, this paper aims to propose an intelligent method to detect construction project contract missing clauses based on Natural Language Processing (NLP) and deep learning technology.

Design/methodology/approach

A complete classification scheme of contract clauses is designed based on NLP. First, construction contract texts are pre-processed and converted from unstructured natural language into structured digital vector form. Following the initial categorization, a multi-label classification of long text construction contract clauses is designed to preliminary identify whether the clause labels are missing. After the multi-label clause missing detection, the authors implement a clause similarity algorithm by creatively integrating the image detection thought, MatchPyramid model, with BERT to identify missing substantial content in the contract clauses.

Findings

1,322 construction project contracts were tested. Results showed that the accuracy of multi-label classification could reach 93%, the accuracy of similarity matching can reach 83%, and the recall rate and F1 mean of both can reach more than 0.7. The experimental results verify the feasibility of intelligently detecting contract risk through the NLP-based method to some extent.

Originality/value

NLP is adept at recognizing textual content and has shown promising results in some contract processing applications. However, the mostly used approaches of its utilization for risk detection in construction contract clauses predominantly are rule-based, which encounter challenges when handling intricate and lengthy engineering contracts. This paper introduces an NLP technique based on deep learning which reduces manual intervention and can autonomously identify and tag types of contractual deficiencies, aligning with the evolving complexities anticipated in future construction contracts. Moreover, this method achieves the recognition of extended contract clause texts. Ultimately, this approach boasts versatility; users simply need to adjust parameters such as segmentation based on language categories to detect omissions in contract clauses of diverse languages.

Details

Engineering, Construction and Architectural Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0969-9988

Keywords

Article
Publication date: 4 January 2024

Zicheng Zhang

Advanced big data analysis and machine learning methods are concurrently used to unleash the value of the data generated by government hotline and help devise intelligent…

Abstract

Purpose

Advanced big data analysis and machine learning methods are concurrently used to unleash the value of the data generated by government hotline and help devise intelligent applications including automated process management, standard construction and more accurate dispatched orders to build high-quality government service platforms as more widely data-driven methods are in the process.

Design/methodology/approach

In this study, based on the influence of the record specifications of texts related to work orders generated by the government hotline, machine learning tools are implemented and compared to optimize classify dispatching tasks by performing exploratory studies on the hotline work order text, including linguistics analysis of text feature processing, new word discovery, text clustering and text classification.

Findings

The complexity of the content of the work order is reduced by applying more standardized writing specifications based on combining text grammar numerical features. So, order dispatch success prediction accuracy rate reaches 89.6 per cent after running the LSTM model.

Originality/value

The proposed method can help improve the current dispatching processes run by the government hotline, better guide staff to standardize the writing format of work orders, improve the accuracy of order dispatching and provide innovative support to the current mechanism.

Details

Data Technologies and Applications, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 8 March 2024

Bing Xue, Rui Yao, Zengyu Ye, Cheuk Ting Chan, Dickson K.W. Chiu and Zeyu Zhong

With the rapid development of social media, many organizations have begun to attach importance to social media platforms. This research studies the management and the use of…

Abstract

Purpose

With the rapid development of social media, many organizations have begun to attach importance to social media platforms. This research studies the management and the use of social media in academic music libraries, taking the Center for Chinese Music Studies of the Chinese University of Hong Kong (CCMS) as a case study.

Design/methodology/approach

We conducted a sentiment analysis of posts on Facebook’s public page to analyze the reaction to the posts with some exploratory analysis, including the communication trend and relevant factors that affect user interaction.

Findings

Our results show that the Facebook channel for the library has a good publicity effect and active interaction, but the number of posts and interactions has a downward trend. Therefore, the library needs to pay more attention to the management of the Facebook channel and take adequate measures to improve the quality of posts to increase interaction.

Originality/value

Few studies have analyzed existing data directly collected from social media by programming based on sentiment analysis and natural language processing technology to explore potential methods to promote music libraries, especially in East Asia, and about traditional music.

Details

Library Hi Tech, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0737-8831

Keywords

Article
Publication date: 29 May 2023

Xiang Zheng, Mingjie Li, Ze Wan and Yan Zhang

This study aims to extract knowledge of ancient Chinese scientific and technological documents bibliographic summaries (STDBS) and provide the knowledge graph (KG) comprehensively…

Abstract

Purpose

This study aims to extract knowledge of ancient Chinese scientific and technological documents bibliographic summaries (STDBS) and provide the knowledge graph (KG) comprehensively and systematically. By presenting the relationship among content, discipline, and author, this study focuses on providing services for knowledge discovery of ancient Chinese scientific and technological documents.

Design/methodology/approach

This study compiles ancient Chinese STDBS and designs a knowledge mining and graph visualization framework. The authors define the summaries' entities, attributes, and relationships for knowledge representation, use deep learning techniques such as BERT-BiLSTM-CRF models and rules for knowledge extraction, unify the representation of entities for knowledge fusion, and use Neo4j and other visualization techniques for KG construction and application. This study presents the generation, distribution, and evolution of ancient Chinese agricultural scientific and technological knowledge in visualization graphs.

Findings

The knowledge mining and graph visualization framework is feasible and effective. The BERT-BiLSTM-CRF model has domain adaptability and accuracy. The knowledge generation of ancient Chinese agricultural scientific and technological documents has distinctive time features. The knowledge distribution is uneven and concentrated, mainly concentrated on C1-Planting and cultivation, C2-Silkworm, and C3-Mulberry and water conservancy. The knowledge evolution is apparent, and differentiation and integration coexist.

Originality/value

This study is the first to visually present the knowledge connotation and association of ancient Chinese STDBS. It solves the problems of the lack of in-depth knowledge mining and connotation visualization of ancient Chinese STDBS.

Details

Library Hi Tech, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0737-8831

Keywords

Article
Publication date: 29 January 2024

Kai Wang

The identification of network user relationship in Fancircle contributes to quantifying the violence index of user text, mining the internal correlation of network behaviors among…

Abstract

Purpose

The identification of network user relationship in Fancircle contributes to quantifying the violence index of user text, mining the internal correlation of network behaviors among users, which provides necessary data support for the construction of knowledge graph.

Design/methodology/approach

A correlation identification method based on sentiment analysis (CRDM-SA) is put forward by extracting user semantic information, as well as introducing violent sentiment membership. To be specific, the topic of the implementation of topology mapping in the community can be obtained based on self-built field of violent sentiment dictionary (VSD) by extracting user text information. Afterward, the violence index of the user text is calculated to quantify the fuzzy sentiment representation between the user and the topic. Finally, the multi-granularity violence association rules mining of user text is realized by constructing violence fuzzy concept lattice.

Findings

It is helpful to reveal the internal relationship of online violence under complex network environment. In that case, the sentiment dependence of users can be characterized from a granular perspective.

Originality/value

The membership degree of violent sentiment into user relationship recognition in Fancircle community is introduced, and a text sentiment association recognition method based on VSD is proposed. By calculating the value of violent sentiment in the user text, the annotation of violent sentiment in the topic dimension of the text is achieved, and the partial order relation between fuzzy concepts of violence under the effective confidence threshold is utilized to obtain the association relation.

Details

Data Technologies and Applications, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 5 January 2024

Zizhong Zhang

Hair loss is often overlooked but psychologically challenging. However, the emergence of online health communities provides opportunities for hair loss patients to seek social…

Abstract

Purpose

Hair loss is often overlooked but psychologically challenging. However, the emergence of online health communities provides opportunities for hair loss patients to seek social support through self-disclosure. Nevertheless, not all disclosures receive the desired support. This research explores what patients disclose within the community and how their health narrative (content, form and linguistic style) regarding self-disclosure influences the social support they receive.

Design/methodology/approach

This study investigated a 13-year-old online support group for Chinese hair loss patients with nearly 240,000 members. Using structural topic modeling, Linguistic Inquiry and Word Count, and a negative binomial model, the research analyzed the content of self-disclosure and the interrelationships between social support and three narrative dimensions of self-disclosure.

Findings

Self-disclosures are classified into 14 topics, grouped under analytical, informative and emotional categories. Emotion-related self-disclosures, whether in content or effective word use, receive deeper social support. Longer and image-rich posts attract more support in quantity, but not necessarily in quality, while cognitive words have a limited impact.

Originality/value

This study addresses the previously overlooked population of hair loss patients within online health communities. It employs a more comprehensive health narrative framework to explore the relationship between self-disclosure and social support, utilizing unsupervised structural topic modeling methods to mine text. The research offers practical implications for how patients seek support and for healthcare professionals in developing doctor-patient communication strategies.

Details

Online Information Review, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1468-4527

Keywords

Article
Publication date: 14 November 2023

Jihye Park, Min Zhang, Seunghyun Yoo and Hannah Gloria Kwon

This study investigates the effects of vertical direction and rotation of English loan brand names in East Asian languages (Chinese and Korean) on processing fluency, perceived…

Abstract

Purpose

This study investigates the effects of vertical direction and rotation of English loan brand names in East Asian languages (Chinese and Korean) on processing fluency, perceived product quality and purchase intention.

Design/methodology/approach

Four experiments were conducted in China and Korea, employing a 2 (vertical direction: downward vs upward) X 3 (rotation: 0°/marquee vs 90° clockwise vs 90° counterclockwise) between-subjects factorial design.

Findings

The findings showed that when the English loan Chinese brand name was displayed downward, the marquee format was preferred, while counterclockwise rotation was favored when displayed upward. In Korean, clockwise rotation was preferred for downward presentation, while counterclockwise rotation was favored for upward presentation. The effects on purchase intention were mediated by processing fluency and perceived product quality.

Practical implications

This research provides practical implications for global manufacturers and retailers, offering guidance on presenting brand names in East Asian languages and optimizing product packaging designs. For Chinese consumers, the marquee format is recommended for downward-oriented brand names, while counterclockwise rotation is effective for upward orientation. For Korean consumers, clockwise rotation is favored for downward presentation and counterclockwise rotation is preferred for upward presentation. Understanding linguistic habits allows the tailoring of brand presentations, enhancing brand perception and consumer responses.

Originality/value

This study contributes to understanding the role of cultural and linguistic influences on consumer information processing and product perception in vertical presentations of brand names.

Details

Asia Pacific Journal of Marketing and Logistics, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1355-5855

Keywords

Article
Publication date: 5 October 2023

Sheng Yuan

The purpose of this study is to compare the communication practices of Chinese and US companies on YouTube and explores the effectiveness of different communication strategies at…

Abstract

Purpose

The purpose of this study is to compare the communication practices of Chinese and US companies on YouTube and explores the effectiveness of different communication strategies at the topic level.

Design/methodology/approach

The author selected 22 Chinese companies and 22 US firms and compared the content of their English language corporate YouTube channels through content analysis, sentiment analysis and cluster analysis.

Findings

The results revealed that the three communication strategies (information, response and involvement) in general were not significantly different regarding their engagement rates, but they generated different comment scores when communicating topics of corporate social responsibility. The results also showed that Chinese companies were more likely than American firms to display the speeches of corporate leaders, use collectivistic references and present human interest messages in YouTube videos.

Research limitations/implications

This study sheds light on how national institutional environment shapes corporate communication on YouTube.

Practical implications

This study challenges the infatuation with the involvement strategy and offers some advice for practitioners on topic selection and user comment function management.

Originality/value

This study makes a novel contribution to the literature of corporate communication on YouTube by adopting a cross-national comparative approach. A conceptual framework of major factors influencing stakeholder responses on YouTube was presented.

Peer review

The peer review history for this article is available at: https://publons.com/publon/10.1108/OIR-02-2023-0061

Details

Online Information Review, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1468-4527

Keywords

Article
Publication date: 2 August 2022

Zhongbao Liu and Wenjuan Zhao

The research on structure function recognition mainly concentrates on identifying a specific part of academic literature and its applicability in the multidiscipline perspective…

Abstract

Purpose

The research on structure function recognition mainly concentrates on identifying a specific part of academic literature and its applicability in the multidiscipline perspective. A specific part of academic literature, such as sentences, paragraphs and chapter contents are also called a level of academic literature in this paper. There are a few comparative research works on the relationship between models, disciplines and levels in the process of structure function recognition. In view of this, comparative research on structure function recognition based on deep learning has been conducted in this paper.

Design/methodology/approach

An experimental corpus, including the academic literature of traditional Chinese medicine, library and information science, computer science, environmental science and phytology, was constructed. Meanwhile, deep learning models such as convolutional neural networks (CNN), long and short-term memory (LSTM) and bidirectional encoder representation from transformers (BERT) were used. The comparative experiments of structure function recognition were conducted with the help of the deep learning models from the multilevel perspective.

Findings

The experimental results showed that (1) the BERT model performed best, with F1 values of 78.02, 89.41 and 94.88%, respectively at the level of sentence, paragraph and chapter content. (2) The deep learning models performed better on the academic literature of traditional Chinese medicine than on other disciplines in most cases, e.g. F1 values of CNN, LSTM and BERT, respectively arrived at 71.14, 69.96 and 78.02% at the level of sentence. (3) The deep learning models performed better at the level of chapter content than other levels, the maximum F1 values of CNN, LSTM and BERT at 91.92, 74.90 and 94.88%, respectively. Furthermore, the confusion matrix of recognition results on the academic literature was introduced to find out the reason for misrecognition.

Originality/value

This paper may inspire other research on structure function recognition, and provide a valuable reference for the analysis of influencing factors.

Details

Library Hi Tech, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0737-8831

Keywords

Article
Publication date: 29 August 2023

Hei-Chia Wang, Martinus Maslim and Hung-Yu Liu

A clickbait is a deceptive headline designed to boost ad revenue without presenting closely relevant content. There are numerous negative repercussions of clickbait, such as…

Abstract

Purpose

A clickbait is a deceptive headline designed to boost ad revenue without presenting closely relevant content. There are numerous negative repercussions of clickbait, such as causing viewers to feel tricked and unhappy, causing long-term confusion, and even attracting cyber criminals. Automatic detection algorithms for clickbait have been developed to address this issue. The fact that there is only one semantic representation for the same term and a limited dataset in Chinese is a need for the existing technologies for detecting clickbait. This study aims to solve the limitations of automated clickbait detection in the Chinese dataset.

Design/methodology/approach

This study combines both to train the model to capture the probable relationship between clickbait news headlines and news content. In addition, part-of-speech elements are used to generate the most appropriate semantic representation for clickbait detection, improving clickbait detection performance.

Findings

This research successfully compiled a dataset containing up to 20,896 Chinese clickbait news articles. This collection contains news headlines, articles, categories and supplementary metadata. The suggested context-aware clickbait detection (CA-CD) model outperforms existing clickbait detection approaches on many criteria, demonstrating the proposed strategy's efficacy.

Originality/value

The originality of this study resides in the newly compiled Chinese clickbait dataset and contextual semantic representation-based clickbait detection approach employing transfer learning. This method can modify the semantic representation of each word based on context and assist the model in more precisely interpreting the original meaning of news articles.

Details

Data Technologies and Applications, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2514-9288

Keywords

1 – 10 of over 1000