Search results
1 – 10 of 32The tender documents, an essential data source for internet-based logistics tendering platforms, incorporate massive fine-grained data, ranging from information on tenderee…
Abstract
Purpose
The tender documents, an essential data source for internet-based logistics tendering platforms, incorporate massive fine-grained data, ranging from information on tenderee, shipping location and shipping items. Automated information extraction in this area is, however, under-researched, making the extraction process a time- and effort-consuming one. For Chinese logistics tender entities, in particular, existing named entity recognition (NER) solutions are mostly unsuitable as they involve domain-specific terminologies and possess different semantic features.
Design/methodology/approach
To tackle this problem, a novel lattice long short-term memory (LSTM) model, combining a variant contextual feature representation and a conditional random field (CRF) layer, is proposed in this paper for identifying valuable entities from logistic tender documents. Instead of traditional word embedding, the proposed model uses the pretrained Bidirectional Encoder Representations from Transformers (BERT) model as input to augment the contextual feature representation. Subsequently, with the Lattice-LSTM model, the information of characters and words is effectively utilized to avoid error segmentation.
Findings
The proposed model is then verified by the Chinese logistic tender named entity corpus. Moreover, the results suggest that the proposed model excels in the logistics tender corpus over other mainstream NER models. The proposed model underpins the automatic extraction of logistics tender information, enabling logistic companies to perceive the ever-changing market trends and make far-sighted logistic decisions.
Originality/value
(1) A practical model for logistic tender NER is proposed in the manuscript. By employing and fine-tuning BERT into the downstream task with a small amount of data, the experiment results show that the model has a better performance than other existing models. This is the first study, to the best of the authors' knowledge, to extract named entities from Chinese logistic tender documents. (2) A real logistic tender corpus for practical use is constructed and a program of the model for online-processing real logistic tender documents is developed in this work. The authors believe that the model will facilitate logistic companies in converting unstructured documents to structured data and further perceive the ever-changing market trends to make far-sighted logistic decisions.
Details
Keywords
Qinxu Ding, Ding Ding, Yue Wang, Chong Guan and Bosheng Ding
The rapid rise of large language models (LLMs) has propelled them to the forefront of applications in natural language processing (NLP). This paper aims to present a comprehensive…
Abstract
Purpose
The rapid rise of large language models (LLMs) has propelled them to the forefront of applications in natural language processing (NLP). This paper aims to present a comprehensive examination of the research landscape in LLMs, providing an overview of the prevailing themes and topics within this dynamic domain.
Design/methodology/approach
Drawing from an extensive corpus of 198 records published between 1996 to 2023 from the relevant academic database encompassing journal articles, books, book chapters, conference papers and selected working papers, this study delves deep into the multifaceted world of LLM research. In this study, the authors employed the BERTopic algorithm, a recent advancement in topic modeling, to conduct a comprehensive analysis of the data after it had been meticulously cleaned and preprocessed. BERTopic leverages the power of transformer-based language models like bidirectional encoder representations from transformers (BERT) to generate more meaningful and coherent topics. This approach facilitates the identification of hidden patterns within the data, enabling authors to uncover valuable insights that might otherwise have remained obscure. The analysis revealed four distinct clusters of topics in LLM research: “language and NLP”, “education and teaching”, “clinical and medical applications” and “speech and recognition techniques”. Each cluster embodies a unique aspect of LLM application and showcases the breadth of possibilities that LLM technology has to offer. In addition to presenting the research findings, this paper identifies key challenges and opportunities in the realm of LLMs. It underscores the necessity for further investigation in specific areas, including the paramount importance of addressing potential biases, transparency and explainability, data privacy and security, and responsible deployment of LLM technology.
Findings
The analysis revealed four distinct clusters of topics in LLM research: “language and NLP”, “education and teaching”, “clinical and medical applications” and “speech and recognition techniques”. Each cluster embodies a unique aspect of LLM application and showcases the breadth of possibilities that LLM technology has to offer. In addition to presenting the research findings, this paper identifies key challenges and opportunities in the realm of LLMs. It underscores the necessity for further investigation in specific areas, including the paramount importance of addressing potential biases, transparency and explainability, data privacy and security, and responsible deployment of LLM technology.
Practical implications
This classification offers practical guidance for researchers, developers, educators, and policymakers to focus efforts and resources. The study underscores the importance of addressing challenges in LLMs, including potential biases, transparency, data privacy, and responsible deployment. Policymakers can utilize this information to shape regulations, while developers can tailor technology development based on the diverse applications identified. The findings also emphasize the need for interdisciplinary collaboration and highlight ethical considerations, providing a roadmap for navigating the complex landscape of LLM research and applications.
Originality/value
This study stands out as the first to examine the evolution of LLMs across such a long time frame and across such diversified disciplines. It provides a unique perspective on the key areas of LLM research, highlighting the breadth and depth of LLM’s evolution.
Details
Keywords
Xinyu Liu, Kun Ma, Ke Ji, Zhenxiang Chen and Bo Yang
Propaganda is a prevalent technique used in social media to intentionally express opinions or actions with the aim of manipulating or deceiving users. Existing methods for…
Abstract
Purpose
Propaganda is a prevalent technique used in social media to intentionally express opinions or actions with the aim of manipulating or deceiving users. Existing methods for propaganda detection primarily focus on capturing language features within its content. However, these methods tend to overlook the information presented within the external news environment from which propaganda news originated and spread. This news environment reflects recent mainstream media opinions and public attention and contains language characteristics of non-propaganda news. Therefore, the authors have proposed a graph-based multi-information integration network with an external news environment (abbreviated as G-MINE) for propaganda detection.
Design/methodology/approach
G-MINE is proposed to comprise four parts: textual information extraction module, external news environment perception module, multi-information integration module and classifier. Specifically, the external news environment perception module and multi-information integration module extract and integrate the popularity and novelty into the textual information and capture the high-order complementary information between them.
Findings
G-MINE achieves state-of-the-art performance on both the TSHP-17, Qprop and the PTC data sets, with an accuracy of 98.24%, 90.59% and 97.44%, respectively.
Originality/value
An external news environment perception module is proposed to capture the popularity and novelty information, and a multi-information integration module is proposed to effectively fuse them with the textual information.
Details
Keywords
Tong Yang, Jie Wu and Junming Zhang
This study aims to establish a comprehensive satisfaction analysis framework by mining online restaurant reviews, which can not only accurately reveal consumer satisfaction but…
Abstract
Purpose
This study aims to establish a comprehensive satisfaction analysis framework by mining online restaurant reviews, which can not only accurately reveal consumer satisfaction but also identify factors leading to dissatisfaction and further quantify improvement opportunity levels.
Design/methodology/approach
Adopting deep learning, Cross-Bidirectional Encoder Representations Transformers (BERT) model is developed to measure customer satisfaction. Furthermore, opinion mining technique is used to extract consumers’ opinions and obtain dissatisfaction factors. Furthermore, the opportunity algorithm is introduced to quantify attributes’ improvement opportunity levels. A total of 19,133 online reviews of 31 restaurants in Universal Beijing Resort are crawled to validate the framework.
Findings
Results demonstrate the superiority of Cross-BERT model compared to existing models such as sentiment lexicon-based model and Naïve Bayes. More importantly, after effectively unveiling customer dissatisfaction factors (e.g. long queuing time and taste salty), “Dish taste,” “Waiters’ attitude” and “Decoration” are identified as the three secondary attributes with the greatest improvement opportunities.
Practical implications
The proposed framework helps managers, especially in the restaurant industry, accurately understand customer satisfaction and reasons behind dissatisfaction, thereby generating efficient countermeasures. Especially, the improvement opportunity levels also benefit practitioners in efficiently allocating limited business resources.
Originality/value
This work contributes to hospitality and tourism literature by developing a comprehensive customer satisfaction analysis framework in the big data era. Moreover, to the best of the authors’ knowledge, this work is among the first to introduce opportunity algorithm to quantify service improvement benefits. The proposed Cross-BERT model also advances the methodological literature on measuring customer satisfaction.
Details
Keywords
Tingting Tian, Hongjian Shi, Ruhui Ma and Yuan Liu
For privacy protection, federated learning based on data separation allows machine learning models to be trained on remote devices or in isolated data devices. However, due to the…
Abstract
Purpose
For privacy protection, federated learning based on data separation allows machine learning models to be trained on remote devices or in isolated data devices. However, due to the limited resources such as bandwidth and power of local devices, communication in federated learning can be much slower than in local computing. This study aims to improve communication efficiency by reducing the number of communication rounds and the size of information transmitted in each round.
Design/methodology/approach
This paper allows each user node to perform multiple local trainings, then upload the local model parameters to a central server. The central server updates the global model parameters by weighted averaging the parameter information. Based on this aggregation, user nodes first cluster the parameter information to be uploaded and then replace each value with the mean value of its cluster. Considering the asymmetry of the federated learning framework, adaptively select the optimal number of clusters required to compress the model information.
Findings
While maintaining the loss convergence rate similar to that of federated averaging, the test accuracy did not decrease significantly.
Originality/value
By compressing uplink traffic, the work can improve communication efficiency on dynamic networks with limited resources.
Details
Keywords
Junyi Chen, Buqing Cao, Zhenlian Peng, Ziming Xie, Shanpeng Liu and Qian Peng
With the increasing number of mobile applications, efficiently recommending mobile applications to users has become a challenging problem. Although existing mobile application…
Abstract
Purpose
With the increasing number of mobile applications, efficiently recommending mobile applications to users has become a challenging problem. Although existing mobile application recommendation approaches based on user attributes and behaviors have achieved notable effectiveness, they overlook the diffusion patterns and interdependencies of topic-specific mobile applications among user groups. mobile applications among user groups. This paper aims to capture the diffusion patterns and interdependencies of mobile applications among user groups. To achieve this, a topic-aware neural network-based mobile application recommendation method, referred to as TN-MR, is proposed.
Design/methodology/approach
In this method, first, the user representations are enhanced by introducing a topic-aware attention layer, which captures both the topic context and the diffusion history context. Second, it exploits a time-decay mechanism to simulate changes in user interest. Multitopic user representations are aggregated by the time decay module to output the user representations of cascading representations under multiple topics. Finally, user scores that are likely to download the mobile application are predicted and ranked.
Findings
Experimental comparisons and analyses were conducted on the actual 360App data set, and the results demonstrate that the effectiveness of mobile application recommendations can be significantly improved by using TN-MR.
Originality/value
In this paper, the authors propose a mobile application recommendation method based on topic-aware attention networks. By capturing the diffusion patterns and dependencies of mobile applications, it effectively assists users in selecting their applications of interest from thousands of options, significantly improving the accuracy of mobile application recommendations.
Details
Keywords
Khameel B. Mustapha, Eng Hwa Yap and Yousif Abdalla Abakr
Following the recent rise in generative artificial intelligence (GenAI) tools, fundamental questions about their wider impacts have started to reverberate around various…
Abstract
Purpose
Following the recent rise in generative artificial intelligence (GenAI) tools, fundamental questions about their wider impacts have started to reverberate around various disciplines. This study aims to track the unfolding landscape of general issues surrounding GenAI tools and to elucidate the specific opportunities and limitations of these tools as part of the technology-assisted enhancement of mechanical engineering education and professional practices.
Design/methodology/approach
As part of the investigation, the authors conduct and present a brief scientometric analysis of recently published studies to unravel the emerging trend on the subject matter. Furthermore, experimentation was done with selected GenAI tools (Bard, ChatGPT, DALL.E and 3DGPT) for mechanical engineering-related tasks.
Findings
The study identified several pedagogical and professional opportunities and guidelines for deploying GenAI tools in mechanical engineering. Besides, the study highlights some pitfalls of GenAI tools for analytical reasoning tasks (e.g., subtle errors in computation involving unit conversions) and sketching/image generation tasks (e.g., poor demonstration of symmetry).
Originality/value
To the best of the authors’ knowledge, this study presents the first thorough assessment of the potential of GenAI from the lens of the mechanical engineering field. Combining scientometric analysis, experimentation and pedagogical insights, the study provides a unique focus on the implications of GenAI tools for material selection/discovery in product design, manufacturing troubleshooting, technical documentation and product positioning, among others.
Details
Keywords
Xiaoxian Yang, Zhifeng Wang, Qi Wang, Ke Wei, Kaiqi Zhang and Jiangang Shi
This study aims to adopt a systematic review approach to examine the existing literature on law and LLMs.It involves analyzing and synthesizing relevant research papers, reports…
Abstract
Purpose
This study aims to adopt a systematic review approach to examine the existing literature on law and LLMs.It involves analyzing and synthesizing relevant research papers, reports and scholarly articles that discuss the use of LLMs in the legal domain. The review encompasses various aspects, including an analysis of LLMs, legal natural language processing (NLP), model tuning techniques, data processing strategies and frameworks for addressing the challenges associated with legal question-and-answer (Q&A) systems. Additionally, the study explores potential applications and services that can benefit from the integration of LLMs in the field of intelligent justice.
Design/methodology/approach
This paper surveys the state-of-the-art research on law LLMs and their application in the field of intelligent justice. The study aims to identify the challenges associated with developing Q&A systems based on LLMs and explores potential directions for future research and development. The ultimate goal is to contribute to the advancement of intelligent justice by effectively leveraging LLMs.
Findings
To effectively apply a law LLM, systematic research on LLM, legal NLP and model adjustment technology is required.
Originality/value
This study contributes to the field of intelligent justice by providing a comprehensive review of the current state of research on law LLMs.
Details
Keywords
Rongen Yan, Depeng Dang, Hu Gao, Yan Wu and Wenhui Yu
Question answering (QA) answers the questions asked by people in the form of natural language. In the QA, due to the subjectivity of users, the questions they query have different…
Abstract
Purpose
Question answering (QA) answers the questions asked by people in the form of natural language. In the QA, due to the subjectivity of users, the questions they query have different expressions, which increases the difficulty of text retrieval. Therefore, the purpose of this paper is to explore new query rewriting method for QA that integrates multiple related questions (RQs) to form an optimal question. Moreover, it is important to generate a new dataset of the original query (OQ) with multiple RQs.
Design/methodology/approach
This study collects a new dataset SQuAD_extend by crawling the QA community and uses word-graph to model the collected OQs. Next, Beam search finds the best path to get the best question. To deeply represent the features of the question, pretrained model BERT is used to model sentences.
Findings
The experimental results show three outstanding findings. (1) The quality of the answers is better after adding the RQs of the OQs. (2) The word-graph that is used to model the problem and choose the optimal path is conducive to finding the best question. (3) Finally, BERT can deeply characterize the semantics of the exact problem.
Originality/value
The proposed method can use word-graph to construct multiple questions and select the optimal path for rewriting the question, and the quality of answers is better than the baseline. In practice, the research results can help guide users to clarify their query intentions and finally achieve the best answer.
Details
Keywords
Rahul Govind, Nitika Garg and Lemuria Carter
This study aims to examine the role of hope and hate in political leaders’ messages in influencing liberals versus conservatives’ social-distancing behavior during the COVID-19…
Abstract
Purpose
This study aims to examine the role of hope and hate in political leaders’ messages in influencing liberals versus conservatives’ social-distancing behavior during the COVID-19 pandemic. Given the increasing political partisanship across the world today, using the appropriate message framing has important implications for social and public policy.
Design/methodology/approach
The authors use two Natural Language Processing (NLP) methods – a pretrained package (HateSonar) and a classifier built to implement our supervised neural network-based model architecture using RoBERTa – to analyze 61,466 tweets by each US state’s governor and two senators with the goal of examining the association between message factors invoking hate and hope and increased or decreased social distancing from March to May 2020. The authors examine individuals’ social-distancing behaviors (the amount of nonessential driving undertaken) using data from 3,047 US counties between March 13 and May 31, 2020, as reported by Google COVID-19 Community Mobility Reports and the New York Times repository of COVID-19 data.
Findings
The results show that for conservative state leaders, the use of hate increases nonessential driving of state residents. However, when these leaders use hope in their speech, nonessential driving of state residents decreases. For liberal state leaders, the use of hate displays a directionally different result as compared to their conservative counterparts.
Research limitations/implications
Amid the emergence of new analytic techniques and novel data sources, the findings demonstrate that the use of global positioning systems data and social media analysis can provide valuable and precise insights into individual behavior. They also contribute to the literature on political ideology and emotion by demonstrating the use of specific emotion appeals in targeting specific consumer segments based on their political ideology.
Practical implications
The findings have significant implications for policymakers and public health officials regarding the importance of considering partisanship when developing and implementing public health policies. As partisanship continues to increase, applying the appropriate emotion appeal in messages will become increasingly crucial. The findings can help marketers and policymakers develop more effective social marketing campaigns by tailoring specific appeals given the political identity of the consumer.
Originality/value
Using Neural NLP methods, this study identifies the specific factors linking social media messaging from political leaders and increased compliance with health directives in a partisan population.
Details