Search results

1 – 10 of over 1000
Article
Publication date: 26 August 2014

Bilal M’hamed Abidine, Belkacem Fergani, Mourad Oussalah and Lamya Fergani

The task of identifying activity classes from sensor information in smart home is very challenging because of the imbalanced nature of such data set where some activities occur…

Abstract

Purpose

The task of identifying activity classes from sensor information in smart home is very challenging because of the imbalanced nature of such data set where some activities occur more frequently than others. Typically probabilistic models such as Hidden Markov Model (HMM) and Conditional Random Fields (CRF) are known as commonly employed for such purpose. The paper aims to discuss these issues.

Design/methodology/approach

In this work, the authors propose a robust strategy combining the Synthetic Minority Over-sampling Technique (SMOTE) with Cost Sensitive Support Vector Machines (CS-SVM) with an adaptive tuning of cost parameter in order to handle imbalanced data problem.

Findings

The results have demonstrated the usefulness of the approach through comparison with state of art of approaches including HMM, CRF, the traditional C-Support vector machines (C-SVM) and the Cost-Sensitive-SVM (CS-SVM) for classifying the activities using binary and ubiquitous sensors.

Originality/value

Performance metrics in the experiment/simulation include Accuracy, Precision/Recall and F measure.

Details

Kybernetes, vol. 43 no. 8
Type: Research Article
ISSN: 0368-492X

Keywords

Open Access
Article
Publication date: 10 May 2022

Jindong Song, Jingbao Zhu and Shanyou Li

Using the strong motion data of K-net in Japan, the continuous magnitude prediction method based on support vector machine (SVM) was studied.

Abstract

Purpose

Using the strong motion data of K-net in Japan, the continuous magnitude prediction method based on support vector machine (SVM) was studied.

Design/methodology/approach

In the range of 0.5–10.0 s after the P-wave arrival, the prediction time window was established at an interval of 0.5 s. 12 P-wave characteristic parameters were selected as the model input parameters to construct the earthquake early warning (EEW) magnitude prediction model (SVM-HRM) for high-speed railway based on SVM.

Findings

The magnitude prediction results of the SVM-HRM model were compared with the traditional magnitude prediction model and the high-speed railway EEW current norm. Results show that at the 3.0 s time window, the magnitude prediction error of the SVM-HRM model is obviously smaller than that of the traditional τc method and Pd method. The overestimation of small earthquakes is obviously improved, and the construction of the model is not affected by epicenter distance, so it has generalization performance. For earthquake events with the magnitude range of 3–5, the single station realization rate of the SVM-HRM model reaches 95% at 0.5 s after the arrival of P-wave, which is better than the first alarm realization rate norm required by “The Test Method of EEW and Monitoring System for High-Speed Railway.” For earthquake events with magnitudes ranging from 3 to 5, 5 to 7 and 7 to 8, the single station realization rate of the SVM-HRM model is at 0.5 s, 1.5 s and 0.5 s after the P-wave arrival, respectively, which is better than the realization rate norm of multiple stations.

Originality/value

At the latest, 1.5 s after the P-wave arrival, the SVM-HRM model can issue the first earthquake alarm that meets the norm of magnitude prediction realization rate, which meets the accuracy and continuity requirements of high-speed railway EEW magnitude prediction.

Details

Railway Sciences, vol. 1 no. 2
Type: Research Article
ISSN: 2755-0907

Keywords

Article
Publication date: 18 April 2017

Yanjie Wang, Zhengchao Xie, InChio Lou, Wai Kin Ung and Kai Meng Mok

The purpose of this paper is to examine the applicability and capability of models based on a genetic algorithm and support vector machine (GA-SVM) and a genetic algorithm and…

Abstract

Purpose

The purpose of this paper is to examine the applicability and capability of models based on a genetic algorithm and support vector machine (GA-SVM) and a genetic algorithm and relevance vector machine (GA-RVM) for the prediction of phytoplankton abundances associated with algal blooms in a Macau freshwater reservoir, and compare their performances with an artificial neural network (ANN) model.

Design/methodology/approach

The hybrid models GA-SVM and GA-RVM were developed for the optimal control of parameters for predicting (based on the current month’s variables) and forecasting (based on the previous three months’ variables) phytoplankton dynamics in a Macau freshwater reservoir, MSR, which has experienced cyanobacterial blooms in recent years. There were 15 environmental parameters, including pH, SiO2, alkalinity, bicarbonate (HCO3−), dissolved oxygen (DO), total nitrogen (TN), UV254, turbidity, conductivity, nitrate (NO3−), orthophosphate (PO43−), total phosphorus (TP), suspended solids (SS) and total organic carbon (TOC) selected from the correlation analysis, with eight years (2001-2008) of data for training, and the most recent three years (2009-2011) for testing.

Findings

For both accuracy performance and generalized performance, the ANN, GA-SVM and GA-RVM had similar predictive powers of R2 of 0.73-0.75. However, whereas ANN and GA-RVM models showed very similar forecast performances, GA-SVM models had better forecast performances of R2 (0.862), RMSE (0.266) and MAE (0.0710) with the respective parameters of 0.987, 0.161 and 0.032 optimized using GA.

Originality/value

This is the first application of GA-SVM and GA-RVM models for predicting and forecasting algal bloom in freshwater reservoirs. GA-SVM was shown to be an effective new way for monitoring algal bloom problem in water resources.

Details

Engineering Computations, vol. 34 no. 2
Type: Research Article
ISSN: 0264-4401

Keywords

Article
Publication date: 21 May 2021

Saddam Bensaoucha, Youcef Brik, Sandrine Moreau, Sid Ahmed Bessedik and Aissa Ameur

This paper provides an effective study to detect and locate the inter-turn short-circuit faults (ITSC) in a three-phase induction motor (IM) using the support vector machine…

331

Abstract

Purpose

This paper provides an effective study to detect and locate the inter-turn short-circuit faults (ITSC) in a three-phase induction motor (IM) using the support vector machine (SVM). The characteristics extracted from the analysis of the phase shifts between the stator currents and their corresponding voltages are used as inputs to train the SVM. The latter automatically decides on the IM state, either a healthy motor or a short-circuit fault on one of its three phases.

Design/methodology/approach

To evaluate the performance of the SVM, three supervised algorithms of machine learning, namely, multi-layer perceptron neural networks (MLPNNs), radial basis function neural networks (RBFNNs) and extreme learning machine (ELM) are used along with the SVM in this study. Thus, all classifiers (SVM, MLPNN, RBFNN and ELM) are tested and the results are compared with the same data set.

Findings

The obtained results showed that the SVM outperforms MLPNN, RBFNNs and ELM to diagnose the health status of the IM. Especially, this technique (SVM) provides an excellent performance because it is able to detect a fault of two short-circuited turns (early detection) when the IM is operating under a low load.

Originality/value

The original of this work is to use the SVM algorithm based on the phase shift between the stator currents and their voltages as inputs to detect and locate the ITSC fault.

Details

COMPEL - The international journal for computation and mathematics in electrical and electronic engineering , vol. 40 no. 3
Type: Research Article
ISSN: 0332-1649

Keywords

Open Access
Article
Publication date: 28 July 2020

Harleen Kaur and Vinita Kumari

Diabetes is a major metabolic disorder which can affect entire body system adversely. Undiagnosed diabetes can increase the risk of cardiac stroke, diabetic nephropathy and other…

11357

Abstract

Diabetes is a major metabolic disorder which can affect entire body system adversely. Undiagnosed diabetes can increase the risk of cardiac stroke, diabetic nephropathy and other disorders. All over the world millions of people are affected by this disease. Early detection of diabetes is very important to maintain a healthy life. This disease is a reason of global concern as the cases of diabetes are rising rapidly. Machine learning (ML) is a computational method for automatic learning from experience and improves the performance to make more accurate predictions. In the current research we have utilized machine learning technique in Pima Indian diabetes dataset to develop trends and detect patterns with risk factors using R data manipulation tool. To classify the patients into diabetic and non-diabetic we have developed and analyzed five different predictive models using R data manipulation tool. For this purpose we used supervised machine learning algorithms namely linear kernel support vector machine (SVM-linear), radial basis function (RBF) kernel support vector machine, k-nearest neighbour (k-NN), artificial neural network (ANN) and multifactor dimensionality reduction (MDR).

Article
Publication date: 27 February 2020

Seyed Amin Bagherzadeh

This paper aims to propose a nonlinear model for aeroelastic aircraft that can predict the flight parameters throughout the investigated flight envelopes.

Abstract

Purpose

This paper aims to propose a nonlinear model for aeroelastic aircraft that can predict the flight parameters throughout the investigated flight envelopes.

Design/methodology/approach

A system identification method based on the support vector machine (SVM) is developed and applied to the nonlinear dynamics of an aeroelastic aircraft. In the proposed non-parametric gray-box method, force and moment coefficients are estimated based on the state variables, flight conditions and control commands. Then, flight parameters are estimated using aircraft equations of motion. Nonlinear system identification is performed using the SVM network by minimizing errors between the calculated and estimated force and moment coefficients. To that end, a least squares algorithm is used as the training rule to optimize the generalization bound given for the regression.

Findings

The results confirm that the SVM is successful at the aircraft system identification. The precision of the SVM model is preserved when the models are excited by input commands different from the training ones. Also, the generalization of the SVM model is acceptable at non-trained flight conditions within the trained flight conditions. Considering the precision and generalization of the model, the results indicate that the SVM is more successful than the well-known methods such as artificial neural networks.

Practical implications

In this paper, both the simulated and real flight data of the F/A-18 aircraft are used to provide aeroelastic models for its lateral-directional dynamics.

Originality/value

This paper proposes a non-parametric system identification method for aeroelastic aircraft based on the SVM method for the first time. Up to the author’s best knowledge, the SVM is not used for the aircraft system identification or the aircraft parameter estimation until now.

Details

Aircraft Engineering and Aerospace Technology, vol. 92 no. 3
Type: Research Article
ISSN: 1748-8842

Keywords

Article
Publication date: 28 February 2023

Meltem Aksoy, Seda Yanık and Mehmet Fatih Amasyali

When a large number of project proposals are evaluated to allocate available funds, grouping them based on their similarities is beneficial. Current approaches to group proposals…

Abstract

Purpose

When a large number of project proposals are evaluated to allocate available funds, grouping them based on their similarities is beneficial. Current approaches to group proposals are primarily based on manual matching of similar topics, discipline areas and keywords declared by project applicants. When the number of proposals increases, this task becomes complex and requires excessive time. This paper aims to demonstrate how to effectively use the rich information in the titles and abstracts of Turkish project proposals to group them automatically.

Design/methodology/approach

This study proposes a model that effectively groups Turkish project proposals by combining word embedding, clustering and classification techniques. The proposed model uses FastText, BERT and term frequency/inverse document frequency (TF/IDF) word-embedding techniques to extract terms from the titles and abstracts of project proposals in Turkish. The extracted terms were grouped using both the clustering and classification techniques. Natural groups contained within the corpus were discovered using k-means, k-means++, k-medoids and agglomerative clustering algorithms. Additionally, this study employs classification approaches to predict the target class for each document in the corpus. To classify project proposals, various classifiers, including k-nearest neighbors (KNN), support vector machines (SVM), artificial neural networks (ANN), classification and regression trees (CART) and random forest (RF), are used. Empirical experiments were conducted to validate the effectiveness of the proposed method by using real data from the Istanbul Development Agency.

Findings

The results show that the generated word embeddings can effectively represent proposal texts as vectors, and can be used as inputs for clustering or classification algorithms. Using clustering algorithms, the document corpus is divided into five groups. In addition, the results demonstrate that the proposals can easily be categorized into predefined categories using classification algorithms. SVM-Linear achieved the highest prediction accuracy (89.2%) with the FastText word embedding method. A comparison of manual grouping with automatic classification and clustering results revealed that both classification and clustering techniques have a high success rate.

Research limitations/implications

The proposed model automatically benefits from the rich information in project proposals and significantly reduces numerous time-consuming tasks that managers must perform manually. Thus, it eliminates the drawbacks of the current manual methods and yields significantly more accurate results. In the future, additional experiments should be conducted to validate the proposed method using data from other funding organizations.

Originality/value

This study presents the application of word embedding methods to effectively use the rich information in the titles and abstracts of Turkish project proposals. Existing research studies focus on the automatic grouping of proposals; traditional frequency-based word embedding methods are used for feature extraction methods to represent project proposals. Unlike previous research, this study employs two outperforming neural network-based textual feature extraction techniques to obtain terms representing the proposals: BERT as a contextual word embedding method and FastText as a static word embedding method. Moreover, to the best of our knowledge, there has been no research conducted on the grouping of project proposals in Turkish.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 16 no. 3
Type: Research Article
ISSN: 1756-378X

Keywords

Open Access
Article
Publication date: 11 April 2023

Wenhao Yi, Mingnian Wang, Jianjun Tong, Siguang Zhao, Jiawang Li, Dengbin Gui and Xiao Zhang

The purpose of the study is to quickly identify significant heterogeneity of surrounding rock of tunnel face that generally occurs during the construction of large-section rock…

Abstract

Purpose

The purpose of the study is to quickly identify significant heterogeneity of surrounding rock of tunnel face that generally occurs during the construction of large-section rock tunnels of high-speed railways.

Design/methodology/approach

Relying on the support vector machine (SVM)-based classification model, the nominal classification of blastholes and nominal zoning and classification terms were used to demonstrate the heterogeneity identification method for the surrounding rock of tunnel face, and the identification calculation was carried out for the five test tunnels. Then, the suggestions for local optimization of the support structures of large-section rock tunnels were put forward.

Findings

The results show that compared with the two classification models based on neural networks, the SVM-based classification model has a higher classification accuracy when the sample size is small, and the average accuracy can reach 87.9%. After the samples are replaced, the SVM-based classification model can still reach the same accuracy, whose generalization ability is stronger.

Originality/value

By applying the identification method described in this paper, the significant heterogeneity characteristics of the surrounding rock in the process of two times of blasting were identified, and the identification results are basically consistent with the actual situation of the tunnel face at the end of blasting, and can provide a basis for local optimization of support parameters.

Details

Railway Sciences, vol. 2 no. 1
Type: Research Article
ISSN: 2755-0907

Keywords

Book part
Publication date: 30 September 2020

B. G. Deepa and S. Senthil

Breast cancer (BC) is one of the leading cancer in the world, BC risk has been there for women of the middle age also, it is the malignant tumor. However, identifying BC in the…

Abstract

Breast cancer (BC) is one of the leading cancer in the world, BC risk has been there for women of the middle age also, it is the malignant tumor. However, identifying BC in the early stage will save most of the women’s life. As there is an advancement in the technology research used Machine Learning (ML) algorithm Random Forest for ranking the feature, Support Vector Machine (SVM), and Naïve Bayes (NB) supervised classifiers for selection of best optimized features and prediction of BC accuracy. The estimation of prediction accuracy has been done by using the dataset Wisconsin Breast Cancer Data from University of California Irvine (UCI) ML repository. To perform all these operation, Anaconda one of the open source distribution of Python has been used. The proposed work resulted in extemporize improvement in the NB and SVM classifier accuracy. The performance evaluation of the proposed model is estimated by using classification accuracy, confusion matrix, mean, standard deviation, variance, and root mean-squared error.

The experimental results shows that 70-30 data split will result in best accuracy. SVM acts as a feature optimizer of 12 best features with the result of 97.66% accuracy and improvement of 1.17% after feature reduction. NB results with feature optimizer 17 of best features with the result of 96.49% accuracy and improvement of 1.17% after feature reduction.

The study shows that proposal model works very effectively as compare to the existing models with respect to accuracy measures.

Details

Big Data Analytics and Intelligence: A Perspective for Health Care
Type: Book
ISBN: 978-1-83909-099-8

Keywords

Article
Publication date: 14 December 2021

Arijit Maji and Indrajit Mukherjee

The purpose of this study is to propose an effective unsupervised one-class-classifier (OCC) support vector machine (SVM)-based single multivariate control chart (OCC-SVM) to…

Abstract

Purpose

The purpose of this study is to propose an effective unsupervised one-class-classifier (OCC) support vector machine (SVM)-based single multivariate control chart (OCC-SVM) to simultaneously monitor “location” and “scale” shifts of a manufacturing process.

Design/methodology/approach

The step-by-step approach to developing, implementing and fine-tuning the intrinsic parameters of the OCC-SVM chart is demonstrated based on simulation and two real-life case examples.

Findings

A comparative study, considering varied known and unknown response distributions, indicates that the OCC-SVM is highly effective in detecting process shifts of samples with individual observations. OCC-SVM chart also shows promising results for samples with a rational subgroup of observations. In addition, the results also indicate that the performance of OCC-SVM is unaffected by the small reference sample size.

Research limitations/implications

The sample responses are considered identically distributed with no significant multivariate autocorrelation between sample observations.

Practical implications

The proposed easy-to-implement chart shows satisfactory performance to detect an out-of-control signal with known or unknown response distributions.

Originality/value

Various multivariate (e.g. parametric or nonparametric) control chart(s) are recommended to monitor the mean (e.g. location) and variance (e.g. scale) of multiple correlated responses in a manufacturing process. However, real-life implementation of a parametric control chart may be complex due to its restrictive response distribution assumptions. There is no evidence of work in the open literature that demonstrates the suitability of an unsupervised OCC-SVM chart to simultaneously monitor “location” and “scale” shifts of multivariate responses. Thus, a new efficient OCC-SVM single chart approach is proposed to address this gap to monitor a multivariate manufacturing process with unknown response distributions.

Details

International Journal of Quality & Reliability Management, vol. 40 no. 2
Type: Research Article
ISSN: 0265-671X

Keywords

1 – 10 of over 1000