Search results

1 – 10 of 692
Article
Publication date: 8 September 2023

Xiancheng Ou, Yuting Chen, Siwei Zhou and Jiandong Shi

With the continuous growth of online education, the quality issue of online educational videos has become increasingly prominent, causing students in online learning to face the…

Abstract

Purpose

With the continuous growth of online education, the quality issue of online educational videos has become increasingly prominent, causing students in online learning to face the dilemma of knowledge confusion. The existing mechanisms for controlling the quality of online educational videos suffer from subjectivity and low timeliness. Monitoring the quality of online educational videos involves analyzing metadata features and log data, which is an important aspect. With the development of artificial intelligence technology, deep learning techniques with strong predictive capabilities can provide new methods for predicting the quality of online educational videos, effectively overcoming the shortcomings of existing methods. The purpose of this study is to find a deep neural network that can model the dynamic and static features of the video itself, as well as the relationships between videos, to achieve dynamic monitoring of the quality of online educational videos.

Design/methodology/approach

The quality of a video cannot be directly measured. According to previous research, the authors use engagement to represent the level of video quality. Engagement is the normalized participation time, which represents the degree to which learners tend to participate in the video. Based on existing public data sets, this study designs an online educational video engagement prediction model based on dynamic graph neural networks (DGNNs). The model is trained based on the video’s static features and dynamic features generated after its release by constructing dynamic graph data. The model includes a spatiotemporal feature extraction layer composed of DGNNs, which can effectively extract the time and space features contained in the video's dynamic graph data. The trained model is used to predict the engagement level of learners with the video on day T after its release, thereby achieving dynamic monitoring of video quality.

Findings

Models with spatiotemporal feature extraction layers consisting of four types of DGNNs can accurately predict the engagement level of online educational videos. Of these, the model using the temporal graph convolutional neural network has the smallest prediction error. In dynamic graph construction, using cosine similarity and Euclidean distance functions with reasonable threshold settings can construct a structurally appropriate dynamic graph. In the training of this model, the amount of historical time series data used will affect the model’s predictive performance. The more historical time series data used, the smaller the prediction error of the trained model.

Research limitations/implications

A limitation of this study is that not all video data in the data set was used to construct the dynamic graph due to memory constraints. In addition, the DGNNs used in the spatiotemporal feature extraction layer are relatively conventional.

Originality/value

In this study, the authors propose an online educational video engagement prediction model based on DGNNs, which can achieve the dynamic monitoring of video quality. The model can be applied as part of a video quality monitoring mechanism for various online educational resource platforms.

Details

International Journal of Web Information Systems, vol. 19 no. 5/6
Type: Research Article
ISSN: 1744-0084

Keywords

Article
Publication date: 18 January 2022

Yang Yi, Yang Sun, Saimei Yuan, Yiji Zhu, Mengyi Zhang and Wenjun Zhu

The purpose of this paper is to provide a fast and accurate network for spatiotemporal action localization in videos. It detects human actions both in time and space…

Abstract

Purpose

The purpose of this paper is to provide a fast and accurate network for spatiotemporal action localization in videos. It detects human actions both in time and space simultaneously in real-time, which is applicable in real-world scenarios such as safety monitoring and collaborative assembly.

Design/methodology/approach

This paper design an end-to-end deep learning network called collaborator only watch once (COWO). COWO recognizes the ongoing human activities in real-time with enhanced accuracy. COWO inherits from the architecture of you only watch once (YOWO), known to be the best performing network for online action localization to date, but with three major structural modifications: COWO enhances the intraclass compactness and enlarges the interclass separability in the feature level. A new correlation channel fusion and attention mechanism are designed based on the Pearson correlation coefficient. Accordingly, a correction loss function is designed. This function minimizes the same class distance and enhances the intraclass compactness. Use a probabilistic K-means clustering technique for selecting the initial seed points. The idea behind this is that the initial distance between cluster centers should be as considerable as possible. CIOU regression loss function is applied instead of the Smooth L1 loss function to help the model converge stably.

Findings

COWO outperforms the original YOWO with improvements of frame mAP 3% and 2.1% at a speed of 35.12 fps. Compared with the two-stream, T-CNN, C3D, the improvement is about 5% and 14.5% when applied to J-HMDB-21, UCF101-24 and AGOT data sets.

Originality/value

COWO extends more flexibility for assembly scenarios as it perceives spatiotemporal human actions in real-time. It contributes to many real-world scenarios such as safety monitoring and collaborative assembly.

Details

Assembly Automation, vol. 42 no. 2
Type: Research Article
ISSN: 0144-5154

Keywords

Article
Publication date: 18 May 2020

Ushapreethi P and Lakshmi Priya G G

To find a successful human action recognition system (HAR) for the unmanned environments.

Abstract

Purpose

To find a successful human action recognition system (HAR) for the unmanned environments.

Design/methodology/approach

This paper describes the key technology of an efficient HAR system. In this paper, the advancements for three key steps of the HAR system are presented to improve the accuracy of the existing HAR systems. The key steps are feature extraction, feature descriptor and action classification, which are implemented and analyzed. The usage of the implemented HAR system in the self-driving car is summarized. Finally, the results of the HAR system and other existing action recognition systems are compared.

Findings

This paper exhibits the proposed modification and improvements in the HAR system, namely the skeleton-based spatiotemporal interest points (STIP) feature and the improved discriminative sparse descriptor for the identified feature and the linear action classification.

Research limitations/implications

The experiments are carried out on captured benchmark data sets and need to be analyzed in a real-time environment.

Practical implications

The middleware support between the proposed HAR system and the self-driven car system provides several other challenging opportunities in research.

Social implications

The authors’ work provides the way to go a step ahead in machine vision especially in self-driving cars.

Originality/value

The method for extracting the new feature and constructing an improved discriminative sparse feature descriptor has been introduced.

Details

International Journal of Intelligent Unmanned Systems, vol. 9 no. 1
Type: Research Article
ISSN: 2049-6427

Keywords

Article
Publication date: 31 January 2024

Tan Zhang, Zhanying Huang, Ming Lu, Jiawei Gu and Yanxue Wang

Rotating machinery is a crucial component of large equipment, and detecting faults in it accurately is critical for reliable operation. Although fault diagnosis methods based on…

Abstract

Purpose

Rotating machinery is a crucial component of large equipment, and detecting faults in it accurately is critical for reliable operation. Although fault diagnosis methods based on deep learning have been significantly developed, the existing methods model spatial and temporal features separately and then weigh them, resulting in the decoupling of spatiotemporal features.

Design/methodology/approach

The authors propose a spatiotemporal long short-term memory (ST-LSTM) method for fault diagnosis of rotating machinery. The authors collected vibration signals from real rolling bearing and gearing test rigs for verification.

Findings

Through these two experiments, the authors demonstrate that machine learning methods still have advantages on small-scale data sets, but our proposed method exhibits a significant advantage due to the simultaneous modeling of the time domain and space domain. These results indicate the potential of the interactive spatiotemporal modeling method for fault diagnosis of rotating machinery.

Originality/value

The authors propose a ST-LSTM method for fault diagnosis of rotating machinery. The authors collected vibration signals from real rolling bearing and gearing test rigs for verification.

Details

Industrial Lubrication and Tribology, vol. 76 no. 2
Type: Research Article
ISSN: 0036-8792

Keywords

Article
Publication date: 28 November 2023

Jiaying Chen, Cheng Li, Liyao Huang and Weimin Zheng

Incorporating dynamic spatial effects exhibits considerable potential in improving the accuracy of forecasting tourism demands. This study aims to propose an innovative deep…

Abstract

Purpose

Incorporating dynamic spatial effects exhibits considerable potential in improving the accuracy of forecasting tourism demands. This study aims to propose an innovative deep learning model for capturing dynamic spatial effects.

Design/methodology/approach

A novel deep learning model founded on the transformer architecture, called the spatiotemporal transformer network, is presented. This model has three components: the temporal transformer, spatial transformer and spatiotemporal fusion modules. The dynamic temporal dependencies of each attraction are extracted efficiently by the temporal transformer module. The dynamic spatial correlations between attractions are extracted efficiently by the spatial transformer module. The extracted dynamic temporal and spatial features are fused in a learnable manner in the spatiotemporal fusion module. Convolutional operations are implemented to generate the final forecasts.

Findings

The results indicate that the proposed model performs better in forecasting accuracy than some popular benchmark models, demonstrating its significant forecasting performance. Incorporating dynamic spatiotemporal features is an effective strategy for improving forecasting. It can provide an important reference to related studies.

Practical implications

The proposed model leverages high-frequency data to achieve accurate predictions at the micro level by incorporating dynamic spatial effects. Destination managers should fully consider the dynamic spatial effects of attractions when planning and marketing to promote tourism resources.

Originality/value

This study incorporates dynamic spatial effects into tourism demand forecasting models by using a transformer neural network. It advances the development of methodologies in related fields.

目的

纳入动态空间效应在提高旅游需求预测的准确性方面具有相当大的潜力。本研究提出了一种捕捉动态空间效应的创新型深度学习模型。

设计/方法/途径

本研究提出了一种基于变压器架构的新型深度学习模型, 称为时空变压器网络。该模型由三个部分组成:时空转换器、空间转换器和时空融合模块。时空转换器模块可有效提取每个景点的动态时间依赖关系。空间转换器模块可有效提取景点之间的动态空间相关性。提取的动态时间和空间特征在时空融合模块中以可学习的方式进行融合。通过卷积运算生成最终预测结果。

研究结果

结果表明, 与一些流行的基准模型相比, 所提出的模型在预测准确性方面表现更好, 证明了其显著的预测性能。纳入动态时空特征是改进预测的有效策略。它可为相关研究提供重要参考。

实践意义

所提出的模型利用高频数据, 通过纳入动态空间效应, 在微观层面上实现了准确预测。旅游目的地管理者在规划和营销推广旅游资源时, 应充分考虑景点的动态空间效应。

原创性/价值

本研究通过使用变压器神经网络, 将动态空间效应纳入旅游需求预测模型。它推动了相关领域方法论的发展。

Objetivo

La incorporación de efectos espaciales dinámicos ofrece un considerable potencial para mejorar la precisión de la previsión de la demanda turística. Este estudio propone un modelo innovador de aprendizaje profundo para capturar los efectos espaciales dinámicos.

Diseño/metodología/enfoque

Se presenta un novedoso modelo de aprendizaje profundo basado en la arquitectura transformadora, denominado red de transformador espaciotemporal. Este modelo tiene tres componentes: el transformador temporal, el transformador espacial y los módulos de fusión espaciotemporal. El módulo transformador temporal extrae de manera eficiente las dependencias temporales dinámicas de cada atracción. El módulo transformador espacial extrae eficientemente las correlaciones espaciales dinámicas entre las atracciones. Las características dinámicas temporales y espaciales extraídas se fusionan de manera que se puede aprender en el módulo de fusión espaciotemporal. Se aplican operaciones convolucionales para generar las previsiones finales.

Conclusiones

Los resultados indican que el modelo propuesto obtiene mejores resultados en la precisión de las previsiones que algunos modelos de referencia conocidos, lo que demuestra su importante capacidad de previsión. La incorporación de características espaciotemporales dinámicas supone una estrategia eficaz para mejorar las previsiones. Esto puede proporcionar una referencia importante para estudios afines.

Implicaciones prácticas

El modelo propuesto aprovecha los datos de alta frecuencia para lograr predicciones precisas a nivel micro incorporando efectos espaciales dinámicos. Los gestores de destinos deberían tener plenamente en cuenta los efectos espaciales dinámicos de las atracciones en la planificación y marketing para la promoción de los recursos turísticos.

Originalidad/valor

Este estudio incorpora efectos espaciales dinámicos a los modelos de previsión de la demanda turística mediante el empleo de una red neuronal transformadora. Supone un avance en el desarrollo de metodologías en campos afines.

Article
Publication date: 20 April 2015

Takuya Sugitani, Masumi Shirakawa, Takahiro Hara and Shojiro Nishio

The purpose of this paper is to propose a method to detect local events in real time using Twitter, an online microblogging platform. The authors especially aim at detecting local…

Abstract

Purpose

The purpose of this paper is to propose a method to detect local events in real time using Twitter, an online microblogging platform. The authors especially aim at detecting local events regardless of the type and scale.

Design/methodology/approach

The method is based on the observation that relevant tweets (Twitter posts) are simultaneously posted from the place where a local event is happening. Specifically, the method first extracts the place where and the time when multiple tweets are posted using a hierarchical clustering technique. It next detects the co-occurrences of key terms in each spatiotemporal cluster to find local events. To determine key terms, it computes the term frequency-inverse document frequency (TFIDF) scores based on the spatiotemporal locality of tweets.

Findings

From the experimental results using geotagged tweet data between 9 a.m. and 3 p.m. on October 9, 2011, the method significantly improved the precision of between 50 and 100 per cent at the same recall compared to a baseline method.

Originality/value

In contrast to existing work, the method described in this paper can detect various types of small-scale local events as well as large-scale ones by incorporating the spatiotemporal feature of tweet postings and the text relevance of tweets. The findings will be useful to researchers who are interested in real-time event detection using microblogs.

Details

International Journal of Web Information Systems, vol. 11 no. 1
Type: Research Article
ISSN: 1744-0084

Keywords

Article
Publication date: 6 July 2023

Mengda Xing, Weilong Ding, Tianpu Zhang and Han Li

Remaining useful life (RUL) prediction for power transformer maintenance is a challenging task on heterogeneous data. Monitoring data of power transformers are not always…

Abstract

Purpose

Remaining useful life (RUL) prediction for power transformer maintenance is a challenging task on heterogeneous data. Monitoring data of power transformers are not always compatible or in an identical format; therefore, RUL predictions traditionally work separately on different data. Moreover, chemical molecules used in RUL prediction can be transformed into each other under different conditions, thus forming a complete graph with uncertain adjacency matrix (UAM). This study aims to find and evaluate a new model to achieve better results of RUL prediction than the other baselines.

Design/methodology/approach

In this work, the authors propose a spatiotemporal complete graph convolutional network (STCGCN) for RUL prediction in two branches, in which daily and hourly features are extracted from correlated heterogeneous data separately. This study provides a thorough evaluation of the proposed model on real-world data and compare the proposed model with state-of-the-art RUL prediction models.

Findings

By using the multibranch structure and EucCos similarity aggregation, STCGCN was able to capture the dynamic spatiotemporal patterns on a variety of heterogeneous data and obtain more accurate prediction results, compared to other time series prediction methods.

Originality/value

In this work, the authors propose a novel multibranch structure to compute feature maps from two heterogeneous data sources efficiently and a novel similarity aggregation method to compute the spatial UAM within the complete graph. Compared with traditional time series prediction models, the model pays attention to the spatial relationships in time series data.

Details

International Journal of Web Information Systems, vol. 19 no. 2
Type: Research Article
ISSN: 1744-0084

Keywords

Open Access
Article
Publication date: 8 November 2022

Yilong Ren and Jianbin Wang

The missing travel time data for roads is a common problem encountered by traffic management departments. Tensor decomposition, as one of the most widely used method for…

Abstract

Purpose

The missing travel time data for roads is a common problem encountered by traffic management departments. Tensor decomposition, as one of the most widely used method for completing missing traffic data, plays a significant role in the intelligent transportation system (ITS). However, existing methods of tensor decomposition focus on the global data structure, resulting in relatively low accuracy in fibrosis missing scenarios. Therefore, this paper aims to propose a novel tensor decomposition model which further considers the local spatiotemporal similarity for fibrosis missing to improve travel time completion accuracy.

Design/methodology/approach

The proposed model can aggregate road sections with similar physical attributes by spatial clustering, and then it calculates the temporal association of road sections by the dynamic longest common subsequence. A similarity relationship matrix in the temporal dimension is constructed and incorporated into the tensor completion model, which can enhance the local spatiotemporal relationship of the missing parts of the fibrosis type.

Findings

The experiment shows that this method is superior and robust. Compared with other baseline models, this method has the smallest error and maintains good completion results despite high missing rates.

Originality/value

This model has higher accuracy for the fibrosis missing and performs good convergence effects in the case of the high missing rate.

Details

Smart and Resilient Transportation, vol. 4 no. 3
Type: Research Article
ISSN: 2632-0487

Keywords

Article
Publication date: 16 August 2022

Liyao Huang, Cheng Li and Weimin Zheng

Given the importance of spatial effects in improving the accuracy of hotel demand forecasting, this study aims to introduce price and online rating, two critical factors…

Abstract

Purpose

Given the importance of spatial effects in improving the accuracy of hotel demand forecasting, this study aims to introduce price and online rating, two critical factors influencing hotel demand, as external variables into the model, and capture the spatial and temporal correlation of hotel demand within the region.

Design/methodology/approach

For high practical implications, the authors conduct the case study in Xiamen, China, where the hotel industry is prosperous. Based on the daily demand data of 118 hotels before and during the COVID-19 period (from January to June 2019 and from January to June 2021), the authors evaluate the prediction performance of the proposed innovative model, that is, a deep learning-based model, incorporating graph convolutional networks (GCN) and gated recurrent units.

Findings

The proposed model simultaneously predicts the daily demand of multiple hotels. It effectively captures the spatial-temporal characteristics of hotel demand. In addition, the features, price and online rating of competing hotels can further improve predictive performance. Meanwhile, the robustness of the model is verified by comparing the forecasting results for different periods (during and before the COVID-19 period).

Practical implications

From a long-term management perspective, long-term observation of market competitors’ rankings and price changes can facilitate timely adjustment of corresponding management measures, especially attention to extremely critical factors affecting forecast demand, such as price. While from a short-term operational perspective, short-term demand forecasting can greatly improve hotel operational efficiency, such as optimizing resource allocation and dynamically adjusting prices. The proposed model not only achieves short-term demand forecasting, but also greatly improves the forecasting accuracy by considering factors related to competitors in the same region.

Originality/value

The originalities of the study are as follows. First, this study represents a pioneering attempt to incorporate demand, price and online rating of other hotels into the forecasting model. Second, integrated deep learning models based on GCN and gated recurrent unit complement existing predictive models using historical data in a methodological sense.

Details

International Journal of Contemporary Hospitality Management, vol. 35 no. 1
Type: Research Article
ISSN: 0959-6119

Keywords

Open Access
Article
Publication date: 13 February 2024

Ke Zhang and Ailing Huang

The purpose of this paper is to provide a guiding framework for studying the travel patterns of PT users. The combination of public transit (PT) users’ travel data and user…

Abstract

Purpose

The purpose of this paper is to provide a guiding framework for studying the travel patterns of PT users. The combination of public transit (PT) users’ travel data and user profiling (UP) technology to draw a portrait of PT users can effectively understand users’ travel patterns, which is important to help optimize the scheduling of PT operations and planning of the network.

Design/methodology/approach

To achieve the purpose, the paper presents a three-level classification method to construct the labeling framework. A station area attribute mining method based on the term frequency-inverse document frequency weighting algorithm is proposed to determine the point of interest attributes of user travel stations, and the spatial correlation patterns of user travel stations are calculated by Moran’s Index. User travel feature labels are extracted from travel data containing Beijing PT data for one consecutive week.

Findings

In this paper, a universal PT user labeling system is obtained and some related methods are conducted including four categories of user-preferred travel area patterns mining and a station area attribute mining method. In the application of the Beijing case, a precise exploration of the spatiotemporal characteristics of PT users is conducted, resulting in the final Beijing PTUP system.

Originality/value

This paper combines UP technology with big data analysis techniques to study the travel patterns of PT users. A user profile label framework is constructed, and data visualization, statistical analysis and K-means clustering are applied to extract specific labels instructed by this system framework. Through these analytical processes, the user labeling system is improved, and its applicability is validated through the analysis of a Beijing PT case.

Details

Smart and Resilient Transportation, vol. 6 no. 1
Type: Research Article
ISSN: 2632-0487

Keywords

1 – 10 of 692