Search results

1 – 10 of 303
Open Access
Article
Publication date: 11 October 2023

Bachriah Fatwa Dhini, Abba Suganda Girsang, Unggul Utan Sufandi and Heny Kurniawati

The authors constructed an automatic essay scoring (AES) model in a discussion forum where the result was compared with scores given by human evaluators. This research proposes…

Abstract

Purpose

The authors constructed an automatic essay scoring (AES) model in a discussion forum where the result was compared with scores given by human evaluators. This research proposes essay scoring, which is conducted through two parameters, semantic and keyword similarities, using a SentenceTransformers pre-trained model that can construct the highest vector embedding. Combining these models is used to optimize the model with increasing accuracy.

Design/methodology/approach

The development of the model in the study is divided into seven stages: (1) data collection, (2) pre-processing data, (3) selected pre-trained SentenceTransformers model, (4) semantic similarity (sentence pair), (5) keyword similarity, (6) calculate final score and (7) evaluating model.

Findings

The multilingual paraphrase-multilingual-MiniLM-L12-v2 and distilbert-base-multilingual-cased-v1 models got the highest scores from comparisons of 11 pre-trained multilingual models of SentenceTransformers with Indonesian data (Dhini and Girsang, 2023). Both multilingual models were adopted in this study. A combination of two parameters is obtained by comparing the response of the keyword extraction responses with the rubric keywords. Based on the experimental results, proposing a combination can increase the evaluation results by 0.2.

Originality/value

This study uses discussion forum data from the general biology course in online learning at the open university for the 2020.2 and 2021.2 semesters. Forum discussion ratings are still manual. In this survey, the authors created a model that automatically calculates the value of discussion forums, which are essays based on the lecturer's answers moreover rubrics.

Details

Asian Association of Open Universities Journal, vol. 18 no. 3
Type: Research Article
ISSN: 1858-3431

Keywords

Open Access
Article
Publication date: 2 February 2023

Xian Wang, Yijian Zhao, Qingyi Wang, Huang Yixing and Gabedava George

This paper focuses on the orientation of the economy expressed in the communication of the Central Economic Work Conference (CEWC) of China and its relation with the stock market…

Abstract

Purpose

This paper focuses on the orientation of the economy expressed in the communication of the Central Economic Work Conference (CEWC) of China and its relation with the stock market. This study seeks to explore which orientation of the economy may have a stronger impact on the rise of the stock market. It proposes words connoting orientation of the economy (WOE) that is closely related to the stock market, and different WOE has different impacts on the stock market in terms of intensity. The study aims to provide investors with better investment strategies by identifying the stronger developmental WOE.

Design/methodology/approach

The paper opted for an exploratory study using the textual analysis approach, based on a corpus of 28 CEWC communications spanning from 1994 to 2021. The raw corpus amounted to 50,754 words in total that are treated with noise reduction method and record an effective corpus of 39,591.

Findings

The paper provides empirical insights into the close relationship of the WOE of the CEWC to the stock market, and different WOE has different impacts on the stock market in terms of intensity. It suggests that WOE connoting development may forecast a rising stock market if it is nearly 40% higher than the other two WOEs by impact index.

Research limitations/implications

As WOE is only proven in the CEWC, this paper has its limitations in the scope of samples. It is necessary to apply WOE to more Central Bank communication (CBC) and countries. It is desirable to apply the Gunning–Fog index.

Practical implications

The paper includes implications for investors to read out the orientation of the economy and the degree of different WOEs. Investors are keener to know “what” degree of the CEWC leads to the rise/fall of the stock market. The impact index can be an indicator of a tendency of the stock market, which upgrades the rationality of investment decisions.

Social implications

This paper fulfills words connoting the orientation of economy as an identified linguistic feature, which the impact of CEWC on stockmarket can be measured.

Originality/value

Previous academic research studies mostly focus on the impact on stock market from the language features of CBC, rather than that from the more influential body, CEWC communication. This study seeks to provide the relationship of CEWC communication and the time length of the impact on the stock prices.

Details

Journal of Capital Markets Studies, vol. 7 no. 1
Type: Research Article
ISSN: 2514-4774

Keywords

Open Access
Article
Publication date: 21 June 2021

Bufei Xing, Haonan Yin, Zhijun Yan and Jiachen Wang

The purpose of this paper is to propose a new approach to retrieve similar questions in online health communities to improve the efficiency of health information retrieval and…

Abstract

Purpose

The purpose of this paper is to propose a new approach to retrieve similar questions in online health communities to improve the efficiency of health information retrieval and sharing.

Design/methodology/approach

This paper proposes a hybrid approach to combining domain knowledge similarity and topic similarity to retrieve similar questions in online health communities. The domain knowledge similarity can evaluate the domain distance between different questions. And the topic similarity measures questions’ relationship base on the extracted latent topics.

Findings

The experiment results show that the proposed method outperforms the baseline methods.

Originality/value

This method conquers the problem of word mismatch and considers the named entities included in questions, which most of existing studies did not.

Details

International Journal of Crowd Science, vol. 5 no. 2
Type: Research Article
ISSN: 2398-7294

Keywords

Open Access
Article
Publication date: 15 February 2022

Martin Nečaský, Petr Škoda, David Bernhauer, Jakub Klímek and Tomáš Skopal

Semantic retrieval and discovery of datasets published as open data remains a challenging task. The datasets inherently originate in the globally distributed web jungle, lacking…

1210

Abstract

Purpose

Semantic retrieval and discovery of datasets published as open data remains a challenging task. The datasets inherently originate in the globally distributed web jungle, lacking the luxury of centralized database administration, database schemes, shared attributes, vocabulary, structure and semantics. The existing dataset catalogs provide basic search functionality relying on keyword search in brief, incomplete or misleading textual metadata attached to the datasets. The search results are thus often insufficient. However, there exist many ways of improving the dataset discovery by employing content-based retrieval, machine learning tools, third-party (external) knowledge bases, countless feature extraction methods and description models and so forth.

Design/methodology/approach

In this paper, the authors propose a modular framework for rapid experimentation with methods for similarity-based dataset discovery. The framework consists of an extensible catalog of components prepared to form custom pipelines for dataset representation and discovery.

Findings

The study proposes several proof-of-concept pipelines including experimental evaluation, which showcase the usage of the framework.

Originality/value

To the best of authors’ knowledge, there is no similar formal framework for experimentation with various similarity methods in the context of dataset discovery. The framework has the ambition to establish a platform for reproducible and comparable research in the area of dataset discovery. The prototype implementation of the framework is available on GitHub.

Details

Data Technologies and Applications, vol. 56 no. 4
Type: Research Article
ISSN: 2514-9288

Keywords

Open Access
Article
Publication date: 24 June 2021

Haosen Liu, Youwei Wang, Xiabing Zhou, Zhengzheng Lou and Yangdong Ye

The railway signal equipment failure diagnosis is a vital element to keep the railway system operating safely. One of the most difficulties in signal equipment failure diagnosis…

Abstract

Purpose

The railway signal equipment failure diagnosis is a vital element to keep the railway system operating safely. One of the most difficulties in signal equipment failure diagnosis is the uncertainty of causality between the consequence and cause for the accident. The traditional method to solve this problem is based on Bayesian Network, which needs a rigid and independent assumption basis and prior probability knowledge but ignoring the semantic relationship in causality analysis. This paper aims to perform the uncertainty of causality in signal equipment failure diagnosis through a new way that emphasis on mining semantic relationships.

Design/methodology/approach

This study proposes a deterministic failure diagnosis (DFD) model based on the question answering system to implement railway signal equipment failure diagnosis. It includes the failure diagnosis module and deterministic diagnosis module. In the failure diagnosis module, this paper exploits the question answering system to recognise the cause of failure consequences. The question answering is composed of multi-layer neural networks, which extracts the position and part of speech features of text data from lower layers and acquires contextual features and interactive features of text data by Bi-LSTM and Match-LSTM, respectively, from high layers, subsequently generates the candidate failure cause set by proposed the enhanced boundary unit. In the second module, this study ranks the candidate failure cause set in the semantic matching mechanism (SMM), choosing the top 1st semantic matching degree as the deterministic failure causative factor.

Findings

Experiments on real data set railway maintenance signal equipment show that the proposed DFD model can implement the deterministic diagnosis of railway signal equipment failure. Comparing massive existing methods, the model achieves the state of art in the natural understanding semantic of railway signal equipment diagnosis domain.

Originality/value

It is the first time to use a question answering system executing signal equipment failure diagnoses, which makes failure diagnosis more intelligent than before. The EMU enables the DFD model to understand the natural semantic in long sequence contexture. Then, the SMM makes the DFD model acquire the certainty failure cause in the failure diagnosis of railway signal equipment.

Details

Smart and Resilient Transportation, vol. 3 no. 2
Type: Research Article
ISSN: 2632-0487

Keywords

Open Access
Article
Publication date: 12 June 2017

Lichao Zhu, Hangzhou Yang and Zhijun Yan

The purpose of this paper is to develop a new method to extract medical temporal information from online health communities.

Abstract

Purpose

The purpose of this paper is to develop a new method to extract medical temporal information from online health communities.

Design/methodology/approach

The authors trained a conditional random-filed model for the extraction of temporal expressions. The temporal relation identification is considered as a classification task and several support vector machine classifiers are built in the proposed method. For the model training, the authors extracted some high-level semantic features including co-reference relationship of medical concepts and the semantic similarity among words.

Findings

For the extraction of TIMEX, the authors find that well-formatted expressions are easy to recognize, and the main challenge is the relative TIMEX such as “three days after onset”. It also shows the same difficulty for normalization of absolute date or well-formatted duration, whereas frequency is easier to be normalized. For the identification of DocTimeRel, the result is fairly well, and the relation is difficult to identify when it involves a relative TIMEX or a hypothetical concept.

Originality/value

The authors proposed a new method to extract temporal information from the online clinical data and evaluated the usefulness of different level of syntactic features in this task.

Details

International Journal of Crowd Science, vol. 1 no. 2
Type: Research Article
ISSN: 2398-7294

Keywords

Open Access
Article
Publication date: 6 March 2017

Zhuoxuan Jiang, Chunyan Miao and Xiaoming Li

Recent years have witnessed the rapid development of massive open online courses (MOOCs). With more and more courses being produced by instructors and being participated by…

2121

Abstract

Purpose

Recent years have witnessed the rapid development of massive open online courses (MOOCs). With more and more courses being produced by instructors and being participated by learners all over the world, unprecedented massive educational resources are aggregated. The educational resources include videos, subtitles, lecture notes, quizzes, etc., on the teaching side, and forum contents, Wiki, log of learning behavior, log of homework, etc., on the learning side. However, the data are both unstructured and diverse. To facilitate knowledge management and mining on MOOCs, extracting keywords from the resources is important. This paper aims to adapt the state-of-the-art techniques to MOOC settings and evaluate the effectiveness on real data. In terms of practice, this paper also tries to answer the questions for the first time that to what extend can the MOOC resources support keyword extraction models, and how many human efforts are required to make the models work well.

Design/methodology/approach

Based on which side generates the data, i.e instructors or learners, the data are classified to teaching resources and learning resources, respectively. The approach used on teaching resources is based on machine learning models with labels, while the approach used on learning resources is based on graph model without labels.

Findings

From the teaching resources, the methods used by the authors can accurately extract keywords with only 10 per cent labeled data. The authors find a characteristic of the data that the resources of various forms, e.g. subtitles and PPTs, should be separately considered because they have the different model ability. From the learning resources, the keywords extracted from MOOC forums are not as domain-specific as those extracted from teaching resources, but they can reflect the topics which are lively discussed in forums. Then instructors can get feedback from the indication. The authors implement two applications with the extracted keywords: generating concept map and generating learning path. The visual demos show they have the potential to improve learning efficiency when they are integrated into a real MOOC platform.

Research limitations/implications

Conducting keyword extraction on MOOC resources is quite difficult because teaching resources are hard to be obtained due to copyrights. Also, getting labeled data is tough because usually expertise of the corresponding domain is required.

Practical implications

The experiment results support that MOOC resources are good enough for building models of keyword extraction, and an acceptable balance between human efforts and model accuracy can be achieved.

Originality/value

This paper presents a pioneer study on keyword extraction on MOOC resources and obtains some new findings.

Details

International Journal of Crowd Science, vol. 1 no. 1
Type: Research Article
ISSN: 2398-7294

Keywords

Open Access
Article
Publication date: 14 August 2017

Manuel Mühlburger, Stefan Oppl and Christian Stary

Deployment of knowledge management systems (KMSs) suffers from low adoption in organizational reality that is attributed to a lack of perceivable added value for people in actual…

1430

Abstract

Purpose

Deployment of knowledge management systems (KMSs) suffers from low adoption in organizational reality that is attributed to a lack of perceivable added value for people in actual work situations. Poor task/technology fit in the process of knowledge retrieval appears to be a major factor influencing this issue. Existing research indicates a lack of re-contextualizing stored information provided by KMSs in a particular situation. Existing research in the area of organizational memory information systems (OMISs) has thoroughly examined and widely discussed the topic of re-contextualization. The purpose of this paper, thus, is to examine how KMS design can benefit from OMIS research on approaches for re-contextualization in knowledge retrieval.

Design/methodology/approach

This paper examines OMIS literature and inductively derives a categorization scheme for KMS according to their strategy of re-contextualizing knowledge. The authors have validated the scheme validated in a multiple case study that examines the differentiatory value of the scheme for approaches with various re-contextualization strategies.

Findings

The classification scheme allows a step-by-step selection of approaches for re-contextualization of information in KMS design and development derived from OMIS research. The case study has demonstrated the applicability of the developed scheme and shows that the differentiation criteria can be applied unambiguously.

Research limitations/implications

Because of the chosen case study approach for validation, the validation results may lack generalizability.

Practical implications

The scheme enables an informed selection of KMSs appropriate for a particular OMIS use case, as the scheme’s attributes serve as design rationale for a certain architecture or constellation of components. Developers can not only select from various approaches when designing re-contextualizaton but also come up with rationales for each candidate because of structured representation. Hence, stakeholders can be supported in a more informed way and design KMSs more effectively along organizational change processes.

Originality/value

The paper addresses an identified need for systematic characterization of KMS approaches and systems intending to meet the objectives of OMISs. As such, it allows streamlining further research in this field, as approaches can be judged according to their originality and positioned relative to each other.

Details

VINE Journal of Information and Knowledge Management Systems, vol. 47 no. 3
Type: Research Article
ISSN: 2059-5891

Keywords

Open Access
Article
Publication date: 11 April 2018

Tingwei Gao, Yueting Chai and Yi Liu

The main purpose of this paper is to conduct an in-depth theoretical review and analysis for the fields of knowledge management (KM) and investigate the future research trend…

23640

Abstract

Purpose

The main purpose of this paper is to conduct an in-depth theoretical review and analysis for the fields of knowledge management (KM) and investigate the future research trend about KM.

Design/methodology/approach

At first, few theoretical basis about KM which include definitions and stages about KM have been summarized and analyzed. Then a comprehensive review about the major approaches for designing the KM system from different perspectives including knowledge representation and organization, knowledge sharing and performance measure for KM has been conducted.

Findings

The contributions of this paper will be useful for both academics and practitioners for the study of KM.

Originality/value

For this research, the focus is on conducting an in-depth theoretical review and analysis of KM.

Details

International Journal of Crowd Science, vol. 2 no. 1
Type: Research Article
ISSN: 2398-7294

Keywords

Open Access
Article
Publication date: 17 March 2022

Federico P. Zasa, Roberto Verganti and Paola Bellis

Having a shared vision is crucial for innovation. The purpose of this paper is to investigate the effect of individual propensity to collaborate and innovate on the development of…

1093

Abstract

Purpose

Having a shared vision is crucial for innovation. The purpose of this paper is to investigate the effect of individual propensity to collaborate and innovate on the development of a shared vision.

Design/methodology/approach

The authors build a network in which each node represents the vision of one individual and link the network structure to individual propensity of collaboration and innovativeness. During organizational workshops in four multinational organizations, the authors collected individual visions in the form of images as well as text describing the approach to innovation from 85 employees.

Findings

The study maps individual visions for innovation as a cognitive network. The authors find that individual propensity to innovate or collaborate is related to different network centrality. Innovators, individuals who see innovation as an opportunity to change and grow, are located at the center of the cognitive network. Collaborators, who see innovation as an opportunity to collaborate, have a higher closeness centrality inside a cluster.

Research limitations/implications

This paper analyses visions as a network linking recent research in psychology with the managerial longing for a more thorough investigation of group cognition. The study contributes to literature on shared vision creation, suggesting the role which innovators and collaborators can occupy in the process.

Originality/value

This paper proposes how an approach based on a cognitive network can inform innovation management. The findings suggest that visions of innovators summarize the visions of a group, helping the development of an overall shared vision. Collaborators on the other hand are representative of specific clusters and can help developing radical visions.

Details

European Journal of Innovation Management, vol. 25 no. 6
Type: Research Article
ISSN: 1460-1060

Keywords

1 – 10 of 303