Search results
1 – 10 of 201Xiaojie Xu and Yun Zhang
The Chinese housing market has witnessed rapid growth during the past decade and the significance of housing price forecasting has undoubtedly elevated, becoming an important…
Abstract
Purpose
The Chinese housing market has witnessed rapid growth during the past decade and the significance of housing price forecasting has undoubtedly elevated, becoming an important issue to investors and policymakers. This study aims to examine neural networks (NNs) for office property price index forecasting from 10 major Chinese cities for July 2005–April 2021.
Design/methodology/approach
The authors aim at building simple and accurate NNs to contribute to pure technical forecasts of the Chinese office property market. To facilitate the analysis, the authors explore different model settings over algorithms, delays, hidden neurons and data-spitting ratios.
Findings
The authors reach a simple NN with three delays and three hidden neurons, which leads to stable performance of about 1.45% average relative root mean square error across the 10 cities for the training, validation and testing phases.
Originality/value
The results could be used on a standalone basis or combined with fundamental forecasts to form perspectives of office property price trends and conduct policy analysis.
Details
Keywords
Daniel Šandor and Marina Bagić Babac
Sarcasm is a linguistic expression that usually carries the opposite meaning of what is being said by words, thus making it difficult for machines to discover the actual meaning…
Abstract
Purpose
Sarcasm is a linguistic expression that usually carries the opposite meaning of what is being said by words, thus making it difficult for machines to discover the actual meaning. It is mainly distinguished by the inflection with which it is spoken, with an undercurrent of irony, and is largely dependent on context, which makes it a difficult task for computational analysis. Moreover, sarcasm expresses negative sentiments using positive words, allowing it to easily confuse sentiment analysis models. This paper aims to demonstrate the task of sarcasm detection using the approach of machine and deep learning.
Design/methodology/approach
For the purpose of sarcasm detection, machine and deep learning models were used on a data set consisting of 1.3 million social media comments, including both sarcastic and non-sarcastic comments. The data set was pre-processed using natural language processing methods, and additional features were extracted and analysed. Several machine learning models, including logistic regression, ridge regression, linear support vector and support vector machines, along with two deep learning models based on bidirectional long short-term memory and one bidirectional encoder representations from transformers (BERT)-based model, were implemented, evaluated and compared.
Findings
The performance of machine and deep learning models was compared in the task of sarcasm detection, and possible ways of improvement were discussed. Deep learning models showed more promise, performance-wise, for this type of task. Specifically, a state-of-the-art model in natural language processing, namely, BERT-based model, outperformed other machine and deep learning models.
Originality/value
This study compared the performance of the various machine and deep learning models in the task of sarcasm detection using the data set of 1.3 million comments from social media.
Details
Keywords
Ruchi Kejriwal, Monika Garg and Gaurav Sarin
Stock market has always been lucrative for various investors. But, because of its speculative nature, it is difficult to predict the price movement. Investors have been using both…
Abstract
Purpose
Stock market has always been lucrative for various investors. But, because of its speculative nature, it is difficult to predict the price movement. Investors have been using both fundamental and technical analysis to predict the prices. Fundamental analysis helps to study structured data of the company. Technical analysis helps to study price trends, and with the increasing and easy availability of unstructured data have made it important to study the market sentiment. Market sentiment has a major impact on the prices in short run. Hence, the purpose is to understand the market sentiment timely and effectively.
Design/methodology/approach
The research includes text mining and then creating various models for classification. The accuracy of these models is checked using confusion matrix.
Findings
Out of the six machine learning techniques used to create the classification model, kernel support vector machine gave the highest accuracy of 68%. This model can be now used to analyse the tweets, news and various other unstructured data to predict the price movement.
Originality/value
This study will help investors classify a news or a tweet into “positive”, “negative” or “neutral” quickly and determine the stock price trends.
Details
Keywords
Emerson Norabuena-Figueroa, Roger Rurush-Asencio, K. P. Jaheer Mukthar, Jose Sifuentes-Stratti and Elia Ramírez-Asís
The development of information technologies has led to a considerable transformation in human resource management from conventional or commonly known as personnel management to…
Abstract
The development of information technologies has led to a considerable transformation in human resource management from conventional or commonly known as personnel management to modern one. Data mining technology, which has been widely used in several applications, including those that function on the web, includes clustering algorithms as a key component. Web intelligence is a recent academic field that calls for sophisticated analytics and machine learning techniques to facilitate information discovery, particularly on the web. Human resource data gathered from the web are typically enormous, highly complex, dynamic, and unstructured. Traditional clustering methods need to be upgraded because they are ineffective. Standard clustering algorithms are enhanced and expanded with optimization capabilities to address this difficulty by swarm intelligence, a subset of nature-inspired computing. We collect the initial raw human resource data and preprocess the data wherein data cleaning, data normalization, and data integration takes place. The proposed K-C-means-data driven cuckoo bat optimization algorithm (KCM-DCBOA) is used for clustering of the human resource data. The feature extraction is done using principal component analysis (PCA) and the classification of human resource data is done using support vector machine (SVM). Other approaches from the literature were contrasted with the suggested approach. According to the experimental findings, the suggested technique has extremely promising features in terms of the quality of clustering and execution time.
Details
Keywords
Yansen Wu, Dongsheng Wen, Anmin Zhao, Haobo Liu and Ke Li
This study aims to study the thermal identification issue by harvesting both solar energy and atmospheric thermal updraft for a solar-powered unmanned aerial vehicle (SUAV) and…
Abstract
Purpose
This study aims to study the thermal identification issue by harvesting both solar energy and atmospheric thermal updraft for a solar-powered unmanned aerial vehicle (SUAV) and its electric energy performance under continuous soaring conditions.
Design/methodology/approach
The authors develop a specific dynamic model for SUAVs in both soaring and cruise modes. The support vector machine regression (SVMR) is adopted to estimate the thermal position, and it is combined with feedback control to implement the SUAV soaring in the updraft. Then, the optimal path model is built based on the graph theory considering the existence of several thermals distributed in the environment. The procedure is proposed to estimate the electricity cost of SUAV during flight as well as soaring, and making use of dynamic programming to maximize electric energy.
Findings
The simulation results present the integrated control method could allow SUAV to soar with the updraft. In addition, the proposed approach allows the SUAV to fly to the destination using distributed thermals while reducing the electric energy use.
Originality/value
Two simplified dynamic models are constructed for simulation considering there are different flight mode. Besides, the data-driven-based SVMR method is proposed to support SUAV soaring. Furthermore, instead of using length, the energy cost coefficient in optimization problem is set as electric power, which is more suitable for SUAV because its advantage is to transfer the three-dimensional path planning problem into the two-dimensional.
Details
Keywords
S. Thavasi and T. Revathi
With so many placement opportunities around the students in their final or prefinal year, they start to feel the strain of the season. The students feel the need to be aware of…
Abstract
Purpose
With so many placement opportunities around the students in their final or prefinal year, they start to feel the strain of the season. The students feel the need to be aware of their position and how to increase their chances of being hired. Hence, a system to guide their career is one of the needs of the day.
Design/methodology/approach
The job role prediction system utilizes machine learning techniques such as Naïve Bayes, K-Nearest Neighbor, Support Vector machines (SVM) and Artificial Neural Networks (ANN) to suggest a student’s job role based on their academic performance and course outcomes (CO), out of which ANN performs better. The system uses the Mepco Schlenk Engineering College curriculum, placement and students’ Assessment data sets, in which the CO and syllabus are used to determine the skills that the student has gained from their courses. The necessary skills for a job position are then extracted from the job advertisements. The system compares the student’s skills with the required skills for the job role based on the placement prediction result.
Findings
The system predicts placement possibilities with an accuracy of 93.33 and 98% precision. Also, the skill analysis for students gives the students information about their skill-set strengths and weaknesses.
Research limitations/implications
For skill-set analysis, only the direct assessment of the students is considered. Indirect assessment shall also be considered for future scope.
Practical implications
The model is adaptable and flexible (customizable) to any type of academic institute or universities.
Social implications
The research will be very much useful for the students community to bridge the gap between the academic and industrial needs.
Originality/value
Several works are done for career guidance for the students. However, these career guidance methodologies are designed only using the curriculum and students’ basic personal information. The proposed system will consider the students’ academic performance through direct assessment, along with their curriculum and basic personal information.
Details
Keywords
Koraljka Golub, Osma Suominen, Ahmed Taiye Mohammed, Harriet Aagaard and Olof Osterman
In order to estimate the value of semi-automated subject indexing in operative library catalogues, the study aimed to investigate five different automated implementations of an…
Abstract
Purpose
In order to estimate the value of semi-automated subject indexing in operative library catalogues, the study aimed to investigate five different automated implementations of an open source software package on a large set of Swedish union catalogue metadata records, with Dewey Decimal Classification (DDC) as the target classification system. It also aimed to contribute to the body of research on aboutness and related challenges in automated subject indexing and evaluation.
Design/methodology/approach
On a sample of over 230,000 records with close to 12,000 distinct DDC classes, an open source tool Annif, developed by the National Library of Finland, was applied in the following implementations: lexical algorithm, support vector classifier, fastText, Omikuji Bonsai and an ensemble approach combing the former four. A qualitative study involving two senior catalogue librarians and three students of library and information studies was also conducted to investigate the value and inter-rater agreement of automatically assigned classes, on a sample of 60 records.
Findings
The best results were achieved using the ensemble approach that achieved 66.82% accuracy on the three-digit DDC classification task. The qualitative study confirmed earlier studies reporting low inter-rater agreement but also pointed to the potential value of automatically assigned classes as additional access points in information retrieval.
Originality/value
The paper presents an extensive study of automated classification in an operative library catalogue, accompanied by a qualitative study of automated classes. It demonstrates the value of applying semi-automated indexing in operative information retrieval systems.
Details
Keywords
Shiqin Zeng, Frederick Chung and Baabak Ashuri
Completing Right-of-Way (ROW) acquisition process on schedule is critical to avoid delays and cost overruns on transportation projects. However, transportation agencies face…
Abstract
Purpose
Completing Right-of-Way (ROW) acquisition process on schedule is critical to avoid delays and cost overruns on transportation projects. However, transportation agencies face challenges in accurately forecasting ROW acquisition timelines in the early stage of projects due to complex nature of acquisition process and limited design information. There is a need of improving accuracy of estimating ROW acquisition duration during the early phase of project development and quantitatively identifying risk factors affecting the duration.
Design/methodology/approach
The quantitative research methodology used to develop the forecasting model includes an ensemble algorithm based on decision tree and adaptive boosting techniques. A dataset of Georgia Department of Transportation projects held from 2010 to 2019 is utilized to demonstrate building the forecasting model. Furthermore, sensitivity analysis is performed to identify critical drivers of ROW acquisition durations.
Findings
The forecasting model developed in this research achieves a high accuracy to predict ROW durations by explaining 74% of the variance in ROW acquisition durations using project features, which is outperforming single regression tree, multiple linear regression and support vector machine. Moreover, number of parcels, average cost estimation per parcel, length of projects, number of condemnations, number of relocations and type of work are found to be influential factors as drivers of ROW acquisition duration.
Originality/value
This research contributes to the state of knowledge in estimating ROW acquisition timeline through (1) developing a novel machine learning model to accurately estimate ROW acquisition timelines, and (2) identifying drivers (i.e. risk factors) of ROW acquisition durations. The findings of this research will provide transportation agencies with insights on how to improve practices in scheduling ROW acquisition process.
Details
Keywords
Jianping Zhang, Leilei Wang and Guodong Wang
With the rapid advancement in the automotive industry, the friction coefficient (FC), wear rate (WR) and weight loss (WL) have emerged as crucial parameters to measure the…
Abstract
Purpose
With the rapid advancement in the automotive industry, the friction coefficient (FC), wear rate (WR) and weight loss (WL) have emerged as crucial parameters to measure the performance of automotive braking systems, so the FC, WR and WL of friction material are predicted and analyzed in this work, with an aim of achieving accurate prediction of friction material properties.
Design/methodology/approach
Genetic algorithm support vector machine (GA-SVM) model is obtained by applying GA to optimize the SVM in this work, thus establishing a prediction model for friction material properties and achieving the predictive and comparative analysis of friction material properties. The process parameters are analyzed by using response surface methodology (RSM) and GA-RSM to determine them for optimal friction performance.
Findings
The results indicate that the GA-SVM prediction model has the smallest error for FC, WR and WL, showing that it owns excellent prediction accuracy. The predicted values obtained by response surface analysis are closed to those of GA-SVM model, providing further evidence of the validity and the rationality of the established prediction model.
Originality/value
The relevant results can serve as a valuable theoretical foundation for the preparation of friction material in engineering practice.
Details
Keywords
Muralidhar Vaman Kamath, Shrilaxmi Prashanth, Mithesh Kumar and Adithya Tantri
The compressive strength of concrete depends on many interdependent parameters; its exact prediction is not that simple because of complex processes involved in strength…
Abstract
Purpose
The compressive strength of concrete depends on many interdependent parameters; its exact prediction is not that simple because of complex processes involved in strength development. This study aims to predict the compressive strength of normal concrete and high-performance concrete using four datasets.
Design/methodology/approach
In this paper, five established individual Machine Learning (ML) regression models have been compared: Decision Regression Tree, Random Forest Regression, Lasso Regression, Ridge Regression and Multiple-Linear regression. Four datasets were studied, two of which are previous research datasets, and two datasets are from the sophisticated lab using five established individual ML regression models.
Findings
The five statistical indicators like coefficient of determination (R2), mean absolute error, root mean squared error, Nash–Sutcliffe efficiency and mean absolute percentage error have been used to compare the performance of the models. The models are further compared using statistical indicators with previous studies. Lastly, to understand the variable effect of the predictor, the sensitivity and parametric analysis were carried out to find the performance of the variable.
Originality/value
The findings of this paper will allow readers to understand the factors involved in identifying the machine learning models and concrete datasets. In so doing, we hope that this research advances the toolset needed to predict compressive strength.
Details