Search results
1 – 10 of over 111000Xiaoming Zhang, Mingming Meng, Xiaoling Sun and Yu Bai
With the advent of the era of Big Data, the scale of knowledge graph (KG) in various domains is growing rapidly, which holds huge amount of knowledge surely benefiting the question…
Abstract
Purpose
With the advent of the era of Big Data, the scale of knowledge graph (KG) in various domains is growing rapidly, which holds huge amount of knowledge surely benefiting the question answering (QA) research. However, the KG, which is always constituted of entities and relations, is structurally inconsistent with the natural language query. Thus, the QA system based on KG is still faced with difficulties. The purpose of this paper is to propose a method to answer the domain-specific questions based on KG, providing conveniences for the information query over domain KG.
Design/methodology/approach
The authors propose a method FactQA to answer the factual questions about specific domain. A series of logical rules are designed to transform the factual questions into the triples, in order to solve the structural inconsistency between the user’s question and the domain knowledge. Then, the query expansion strategies and filtering strategies are proposed from two levels (i.e. words and triples in the question). For matching the question with domain knowledge, not only the similarity values between the words in the question and the resources in the domain knowledge but also the tag information of these words is considered. And the tag information is obtained by parsing the question using Stanford CoreNLP. In this paper, the KG in metallic materials domain is used to illustrate the FactQA method.
Findings
The designed logical rules have time stability for transforming the factual questions into the triples. Additionally, after filtering the synonym expansion results of the words in the question, the expansion quality of the triple representation of the question is improved. The tag information of the words in the question is considered in the process of data matching, which could help to filter out the wrong matches.
Originality/value
Although the FactQA is proposed for domain-specific QA, it can also be applied to any other domain besides metallic materials domain. For a question that cannot be answered, FactQA would generate a new related question to answer, providing as much as possible the user with the information they probably need. The FactQA could facilitate the user’s information query based on the emerging KG.
Details
Keywords
Previous knowledge base question answering (KBQA) models only consider the monolingual scenario and cannot be directly extended to the cross-lingual scenario, in which the…
Abstract
Purpose
Previous knowledge base question answering (KBQA) models only consider the monolingual scenario and cannot be directly extended to the cross-lingual scenario, in which the language of questions and that of knowledge base (KB) are different. Although a machine translation (MT) model can bridge the gap through translating questions to the language of KB, the noises of translated questions could accumulate and further sharply impair the final performance. Therefore, the authors propose a method to improve the robustness of KBQA models in the cross-lingual scenario.
Design/methodology/approach
The authors propose a knowledge distillation-based robustness enhancement (KDRE) method. Specifically, first a monolingual model (teacher) is trained by ground truth (GT) data. Then to imitate the practical noises, a noise-generating model is designed to inject two types of noise into questions: general noise and translation-aware noise. Finally, the noisy questions are input into the student model. Meanwhile, the student model is jointly trained by GT data and distilled data, which are derived from the teacher when feeding GT questions.
Findings
The experimental results demonstrate that KDRE can improve the performance of models in the cross-lingual scenario. The performance of each module in KBQA model is improved by KDRE. The knowledge distillation (KD) and noise-generating model in the method can complementarily boost the robustness of models.
Originality/value
The authors first extend KBQA models from monolingual to cross-lingual scenario. Also, the authors first implement KD for KBQA to develop robust cross-lingual models.
Details
Keywords
Tianxing Wu, Guilin Qi and Cheng Li
With the continuous development of intelligent technologies, knowledge graph, the backbone of artificial intelligence, has attracted much attention from both academic and…
Abstract
With the continuous development of intelligent technologies, knowledge graph, the backbone of artificial intelligence, has attracted much attention from both academic and industrial communities due to its powerful capability of knowledge representation and reasoning. Besides, knowledge graph has been widely applied in different kinds of applications, such as semantic search, question answering, knowledge management, and so on. In recent years, knowledge graph techniques in China are also developing rapidly and different Chinese knowledge graphs have been built to support various applications. Under the background of “One Belt One Road (OBOR)” initiative, cooperating with the countries along OBOR on studying knowledge graph techniques and applications will greatly promote the development of artificial intelligence. At the same time, the accumulated experience of China on developing knowledge graph is also a good reference. Thus, in this chapter, the authors mainly introduce the development of Chinese knowledge graphs and their applications. The authors first describe the background of OBOR, and then introduce the concept of knowledge graph and three typical Chinese knowledge graphs, including Zhishi.me, CN-DBpedia, and XLORE. Finally, the authors demonstrate several applications of Chinese knowledge graphs.
Details
Keywords
Sanam Ebrahimzadeh, Saeed Rezaei Sharifabadi, Masoumeh Karbala Aghaie Kamran and Kimiz Dalkir
The purpose of this paper is to identify the triggers, strategies and outcomes of collaborative information-seeking behaviours of researchers on the ResearchGate social networking…
Abstract
Purpose
The purpose of this paper is to identify the triggers, strategies and outcomes of collaborative information-seeking behaviours of researchers on the ResearchGate social networking site.
Design/methodology/approach
Data were collected from the population of researchers who use ResearchGate. The sample was limited to the Ph.D. students and assistant professors in the library and information science domain. Qualitative interviews were used for data collection.
Findings
Based on the findings of the study, informal communications and complex information needs lead to a decision to use collaborative information-seeking behaviour. Also, easy access to sources of information and finding relevant information were the major positive factors contributing to collaborative information-seeking behaviour of the ResearchGate users. Users moved from collaborative Q&A strategies to sharing information, synthesising information and networking strategies based on their needs. Analysis of information-seeking behaviour showed that ResearchGate users bridged the information gap by internalizing new knowledge, making collaborative decisions and increasing their work's visibility.
Originality/value
As one of the initial studies on the collaborative information-seeking behaviour of ResearchGate users, this study provides a holistic picture of different triggers that affect researchers' information-seeking on ResearchGate.
Details
Keywords
Na Xu, Yanxiang Liang, Chaoran Guo, Bo Meng, Xueqing Zhou, Yuting Hu and Bo Zhang
Safety management plays an important part in coal mine construction. Due to complex data, the implementation of the construction safety knowledge scattered in standards poses a…
Abstract
Purpose
Safety management plays an important part in coal mine construction. Due to complex data, the implementation of the construction safety knowledge scattered in standards poses a challenge. This paper aims to develop a knowledge extraction model to automatically and efficiently extract domain knowledge from unstructured texts.
Design/methodology/approach
Bidirectional encoder representations from transformers (BERT)-bidirectional long short-term memory (BiLSTM)-conditional random field (CRF) method based on a pre-training language model was applied to carry out knowledge entity recognition in the field of coal mine construction safety in this paper. Firstly, 80 safety standards for coal mine construction were collected, sorted out and marked as a descriptive corpus. Then, the BERT pre-training language model was used to obtain dynamic word vectors. Finally, the BiLSTM-CRF model concluded the entity’s optimal tag sequence.
Findings
Accordingly, 11,933 entities and 2,051 relationships in the standard specifications texts of this paper were identified and a language model suitable for coal mine construction safety management was proposed. The experiments showed that F1 values were all above 60% in nine types of entities such as security management. F1 value of this model was more than 60% for entity extraction. The model identified and extracted entities more accurately than conventional methods.
Originality/value
This work completed the domain knowledge query and built a Q&A platform via entities and relationships identified by the standard specifications suitable for coal mines. This paper proposed a systematic framework for texts in coal mine construction safety to improve efficiency and accuracy of domain-specific entity extraction. In addition, the pretraining language model was also introduced into the coal mine construction safety to realize dynamic entity recognition, which provides technical support and theoretical reference for the optimization of safety management platforms.
Details
Keywords
Recommending suitable content for users of online health communities (OHCs) is critical for overcoming information overload problem and facilitate medical decision making, but…
Abstract
Purpose
Recommending suitable content for users of online health communities (OHCs) is critical for overcoming information overload problem and facilitate medical decision making, but remains not fully investigated. This study aims to provide a content recommendation approach to automatically match valuable health-related information for OHC members.
Design/methodology/approach
A framework of health-related content recommendation was proposed by leveraging rich social information in online communities. The authors constructed user influence relationship (UIR) utilizing users' interaction records, user profiles and user-generated content. The initial user rating matrix and the user post matching matrix were then created by analyzing text content of posts. Finally, the user rating matrix and the recommended content were generated for community members. Datasets were collected from an OHC to evaluate the effectiveness of the proposed approach.
Findings
The experimental results revealed that the proposed method statistically outperformed baseline models in content recommendation for users of OHCs.
Research limitations/implications
The incorporation of social information can significantly enhance the performance of content recommendation in OHCs. The user post matching degree based on text analysis can improve the effectiveness of recommendation.
Practical implications
This study potentially contributes to the social support exchange and medical decision making of community members and the sustainable prosperity of OHCs.
Originality/value
This study proposes a novel social content recommendation method for online health consumers based on UIRs by leveraging social information in OHCs. The results indicate the significance of social information in content recommendation of healthcare social media.
Details
Keywords
Social media (SM) is amongst the latest techniques employed by organizations in their knowledge management endeavors. The paper provides a brief overview of how SM platforms are…
Abstract
Purpose
Social media (SM) is amongst the latest techniques employed by organizations in their knowledge management endeavors. The paper provides a brief overview of how SM platforms are utilized for the creation, dissemination, and retention of knowledge. The various stages of knowledge management as supported by social media, is mapped through a framework.
Design/methodology/approach
A list of research articles on “social media and knowledge management” have been read and reviewed. The insights are summarized and a framework is proposed.
Findings
The framework demonstrates how SM tools aid in the creation of new knowledge, knowledge exchange and storage of the knowledge footprint. They help to generate varied forms of knowledge from different stakeholders. The freely available information acts as a knowledge source for the third party. The virtual nature of digital platforms motivates employees to share knowledge more openly, leaving a digital trace that can be accessed anytime, thereby building on to the knowledge base.
Originality/value
The article highlights how SM can be essential in the knowledge management processes in the organization. It showcases the prominence of everyday organizational interactions and experiences which together build a knowledge-rich culture.
Details
Keywords
Mostafa Jafari, Roozbeh Hesamamiri, Jafar Sadjadi and Atieh Bourouni
The objective of this paper is to propose a holistic dynamic model for understanding the behavior of a complex and internet‐based kind of knowledge market by considering both…
Abstract
Purpose
The objective of this paper is to propose a holistic dynamic model for understanding the behavior of a complex and internet‐based kind of knowledge market by considering both social and economic interactions.
Design/methodology/approach
A system dynamics (SD) model is formulated in this study to investigate the dynamic characteristics of complex interactions in a fee‐based online question & answer (Q&A) knowledge market. The proposed model considers the dynamic, non‐linear, asymmetric, and reciprocal relationships between its components, and allows the study of the evolution of the market under assumed conditions.
Findings
Some illustrative results show that: this market is very sensitive to the prices that the customers choose; low‐priced questions are as important as high‐priced ones; gradually increasing experts' proportion of a question's price reduces customer satisfaction and experts' reputation; and training programs for experts result in higher customer satisfaction and researchers' reputation. Furthermore, three types of customers are identified and discussed.
Practical implications
This model can be used to change, manage, and control this market and also helps to design new similar markets. In addition, the proposed model helps to observe the behavior of a market under one or more policies before applying to the real world.
Social implications
Since GA was shut down in 2006, the implications of this research serve as a strategic tool (strategic evaluation software) for understanding and examining the effects of policies for many existing similar Q&A business models. Furthermore, the SD approach can provide new insights into the field of online Q&A knowledge markets and overcome traditional econometric treatment of data for understanding the dynamic behavior of these markets.
Originality/value
Understanding the complex social and economic behavior of Q&A markets is one of the most important concerns for academics and practitioners in the areas of online markets' management. The paper shows how SD can provide attractive insights into the field of online fee‐based knowledge markets based on a qualitative and quantitative modeling. However, the background literature lacks a holistic view of these kinds of markets.
Details
Keywords
Anuraj Mohan, Karthika P.V., Parvathi Sankar, K. Maya Manohar and Amala Peter
Money laundering is the process of concealing unlawfully obtained funds by presenting them as coming from a legitimate source. Criminals use crypto money laundering to hide the…
Abstract
Purpose
Money laundering is the process of concealing unlawfully obtained funds by presenting them as coming from a legitimate source. Criminals use crypto money laundering to hide the illicit origin of funds using a variety of methods. The most simplified form of bitcoin money laundering leans hard on the fact that transactions made in cryptocurrencies are pseudonymous, but open data gives more power to investigators and enables the crowdsourcing of forensic analysis. With the motive to curb these illegal activities, there exist various rules, policies and technologies collectively known as anti-money laundering (AML) tools. When properly implemented, AML restrictions reduce the negative effects of illegal economic activity while also promoting financial market integrity and stability, but these bear high costs for institutions. The purpose of this work is to motivate the opportunity to reconcile the cause of safety with that of financial inclusion, bearing in mind the limitations of the available data. The authors use the Elliptic dataset; to the best of the authors' knowledge, this is the largest labelled transaction dataset publicly available in any cryptocurrency.
Design/methodology/approach
AML in bitcoin can be modelled as a node classification task in dynamic networks. In this work, graph convolutional decision forest will be introduced, which combines the potentialities of evolving graph convolutional network and deep neural decision forest (DNDF). This model will be used to classify the unknown transactions in the Elliptic dataset. Additionally, the application of knowledge distillation (KD) over the proposed approach gives finest results compared to all the other experimented techniques.
Findings
The importance of utilising a concatenation between dynamic graph learning and ensemble feature learning is demonstrated in this work. The results show the superiority of the proposed model to classify the illicit transactions in the Elliptic dataset. Experiments also show that the results can be further improved when the system is fine-tuned using a KD framework.
Originality/value
Existing works used either ensemble learning or dynamic graph learning to tackle the problem of AML in bitcoin. The proposed model provides a novel view to combine the power of random forest with dynamic graph learning methods. Furthermore, the work also demonstrates the advantage of KD in improving the performance of the whole system.
Details
Keywords
Arshad Ahmad, Chong Feng, Shi Ge and Abdallah Yousif
Software developers extensively use stack overflow (SO) for knowledge sharing on software development. Thus, software engineering researchers have started mining the…
Abstract
Purpose
Software developers extensively use stack overflow (SO) for knowledge sharing on software development. Thus, software engineering researchers have started mining the structured/unstructured data present in certain software repositories including the Q&A software developer community SO, with the aim to improve software development. The purpose of this paper is show that how academics/practitioners can get benefit from the valuable user-generated content shared on various online social networks, specifically from Q&A community SO for software development.
Design/methodology/approach
A comprehensive literature review was conducted and 166 research papers on SO were categorized about software development from the inception of SO till June 2016.
Findings
Most of the studies revolve around a limited number of software development tasks; approximately 70 percent of the papers used millions of posts data, applied basic machine learning methods, and conducted investigations semi-automatically and quantitative studies. Thus, future research should focus on the overcoming existing identified challenges and gaps.
Practical implications
The work on SO is classified into two main categories; “SO design and usage” and “SO content applications.” These categories not only give insights to Q&A forum providers about the shortcomings in design and usage of such forums but also provide ways to overcome them in future. It also enables software developers to exploit such forums for the identified under-utilized tasks of software development.
Originality/value
The study is the first of its kind to explore the work on SO about software development and makes an original contribution by presenting a comprehensive review, design/usage shortcomings of Q&A sites, and future research challenges.
Details