Search results

1 – 10 of over 8000
Article
Publication date: 2 November 2021

Huimin Li, Limin Su, Jian Zuo, Xiaowei An, Guanghua Dong, Lunyan Wang and Chengyi Zhang

Unbalanced bidding can seriously imposed the government from obtaining the best value for the taxpayers' money in public procurement since it increases the owner's cost and…

Abstract

Purpose

Unbalanced bidding can seriously imposed the government from obtaining the best value for the taxpayers' money in public procurement since it increases the owner's cost and decreases the fairness of the competitive bidding process. How to detect an unbalanced bid is a challenging task faced by theoretical researchers and practical actors. This study aims to develop an identification method of unbalanced bidding in the construction industry.

Design/methodology/approach

The identification of unbalanced bidding is considered as a multi-criteria decision-making (MCDM) problem. A data-driven unit price database from the historical bidding document is built to present the reference unit prices as benchmarks. According to the proposed extended TOPSIS method, the data-driven unit price is chosen as the positive ideal solution, and the unit price that has the furthest absolute distance measure as the negative ideal solution. The concept of relative distance is introduced to measure the distances between positive and negative ideal solutions and each bidding unit price. The unbalanced bidding degree is ranked by means of relative distance.

Findings

The proposed model can be used for the quantitative evaluation of unbalanced bidding from a decision-making perspective. The identification process is developed according to the decision-making process. The finding shows that the model will support owners to efficiently and effectively identify unbalanced bidding in the bid evaluation stage.

Originality/value

The data-driven reference unit prices improve the accuracy of the benchmark to evaluate the unbalanced bidding. The extended TOPSIS model is applied to identify unbalanced bidding; the owners can undertake objective decision-making to identify and prevent unbalanced bidding at the stage of procurement.

Details

Engineering, Construction and Architectural Management, vol. 30 no. 2
Type: Research Article
ISSN: 0969-9988

Keywords

Open Access
Article
Publication date: 24 June 2021

Bo Wang, Guanwei Wang, Youwei Wang, Zhengzheng Lou, Shizhe Hu and Yangdong Ye

Vehicle fault diagnosis is a key factor in ensuring the safe and efficient operation of the railway system. Due to the numerous vehicle categories and different fault mechanisms…

Abstract

Purpose

Vehicle fault diagnosis is a key factor in ensuring the safe and efficient operation of the railway system. Due to the numerous vehicle categories and different fault mechanisms, there is an unbalanced fault category problem. Most of the current methods to solve this problem have complex algorithm structures, low efficiency and require prior knowledge. This study aims to propose a new method which has a simple structure and does not require any prior knowledge to achieve a fast diagnosis of unbalanced vehicle faults.

Design/methodology/approach

This study proposes a novel K-means with feature learning based on the feature learning K-means-improved cluster-centers selection (FKM-ICS) method, which includes the ICS and the FKM. Specifically, this study defines cluster centers approximation to select the initialized cluster centers in the ICS. This study uses improved term frequency-inverse document frequency to measure and adjust the feature word weights in each cluster, retaining the top τ feature words with the highest weight in each cluster and perform the clustering process again in the FKM. With the FKM-ICS method, clustering performance for unbalanced vehicle fault diagnosis can be significantly enhanced.

Findings

This study finds that the FKM-ICS can achieve a fast diagnosis of vehicle faults on the vehicle fault text (VFT) data set from a railway station in the 2017 (VFT) data set. The experimental results on VFT indicate the proposed method in this paper, outperforms several state-of-the-art methods.

Originality/value

This is the first effort to address the vehicle fault diagnostic problem and the proposed method performs effectively and efficiently. The ICS enables the FKM-ICS method to exclude the effect of outliers, solves the disadvantages of the fault text data contained a certain amount of noisy data, which effectively enhanced the method stability. The FKM enhances the distribution of feature words that discriminate between different fault categories and reduces the number of feature words to make the FKM-ICS method faster and better cluster for unbalanced vehicle fault diagnostic.

Details

Smart and Resilient Transportation, vol. 3 no. 2
Type: Research Article
ISSN: 2632-0487

Keywords

Article
Publication date: 9 April 2024

Lu Wang, Jiahao Zheng, Jianrong Yao and Yuangao Chen

With the rapid growth of the domestic lending industry, assessing whether the borrower of each loan is at risk of default is a pressing issue for financial institutions. Although…

Abstract

Purpose

With the rapid growth of the domestic lending industry, assessing whether the borrower of each loan is at risk of default is a pressing issue for financial institutions. Although there are some models that can handle such problems well, there are still some shortcomings in some aspects. The purpose of this paper is to improve the accuracy of credit assessment models.

Design/methodology/approach

In this paper, three different stages are used to improve the classification performance of LSTM, so that financial institutions can more accurately identify borrowers at risk of default. The first approach is to use the K-Means-SMOTE algorithm to eliminate the imbalance within the class. In the second step, ResNet is used for feature extraction, and then two-layer LSTM is used for learning to strengthen the ability of neural networks to mine and utilize deep information. Finally, the model performance is improved by using the IDWPSO algorithm for optimization when debugging the neural network.

Findings

On two unbalanced datasets (category ratios of 700:1 and 3:1 respectively), the multi-stage improved model was compared with ten other models using accuracy, precision, specificity, recall, G-measure, F-measure and the nonparametric Wilcoxon test. It was demonstrated that the multi-stage improved model showed a more significant advantage in evaluating the imbalanced credit dataset.

Originality/value

In this paper, the parameters of the ResNet-LSTM hybrid neural network, which can fully mine and utilize the deep information, are tuned by an innovative intelligent optimization algorithm to strengthen the classification performance of the model.

Details

Kybernetes, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0368-492X

Keywords

Open Access
Article
Publication date: 9 December 2022

Rui Wang, Shunjie Zhang, Shengqiang Liu, Weidong Liu and Ao Ding

The purpose is using generative adversarial network (GAN) to solve the problem of sample augmentation in the case of imbalanced bearing fault data sets and improving residual…

Abstract

Purpose

The purpose is using generative adversarial network (GAN) to solve the problem of sample augmentation in the case of imbalanced bearing fault data sets and improving residual network is used to improve the diagnostic accuracy of the bearing fault intelligent diagnosis model in the environment of high signal noise.

Design/methodology/approach

A bearing vibration data generation model based on conditional GAN (CGAN) framework is proposed. The method generates data based on the adversarial mechanism of GANs and uses a small number of real samples to generate data, thereby effectively expanding imbalanced data sets. Combined with the data augmentation method based on CGAN, a fault diagnosis model of rolling bearing under the condition of data imbalance based on CGAN and improved residual network with attention mechanism is proposed.

Findings

The method proposed in this paper is verified by the western reserve data set and the truck bearing test bench data set, proving that the CGAN-based data generation method can form a high-quality augmented data set, while the CGAN-based and improved residual with attention mechanism. The diagnostic model of the network has better diagnostic accuracy under low signal-to-noise ratio samples.

Originality/value

A bearing vibration data generation model based on CGAN framework is proposed. The method generates data based on the adversarial mechanism of GAN and uses a small number of real samples to generate data, thereby effectively expanding imbalanced data sets. Combined with the data augmentation method based on CGAN, a fault diagnosis model of rolling bearing under the condition of data imbalance based on CGAN and improved residual network with attention mechanism is proposed.

Details

Smart and Resilient Transportation, vol. 5 no. 1
Type: Research Article
ISSN: 2632-0487

Keywords

Book part
Publication date: 6 July 2004

Fathi Fakhfakh

This paper uses an unbalanced panel of 129 French firms over the period 1981–1991 to test the effects of two participatory schemes – profit sharing and employee share ownership …

Abstract

This paper uses an unbalanced panel of 129 French firms over the period 1981–1991 to test the effects of two participatory schemes – profit sharing and employee share ownership – on voluntary quits. The effects of sharing schemes on productivity are well documented and most studies show positive and significant effects on productivity but their effects on quits have been less studied. This paper is the first French study looking at the effects of profit sharing and employee share ownership on quits. Our empirical investigation shows that employee share ownership reduces voluntary quits significantly whereas pure profit sharing has no significant effect.

Details

Employee Participation, Firm Performance and Survival
Type: Book
ISBN: 978-0-76231-114-9

Article
Publication date: 5 May 2015

Li Zhou and Calum G. Turvey

– The purpose of this paper is to investigate the linkages between climate change, income dynamics and nutrition intake in rural China.

Abstract

Purpose

The purpose of this paper is to investigate the linkages between climate change, income dynamics and nutrition intake in rural China.

Design/methodology/approach

Using a system of simultaneous equations in a three-stage least squares model instrumented with carbohydrates, fats, proteins and farm income the authors found generally that the greatest impact on nutrition would be from changes in temperature.

Findings

The authors do not find that modest changes in precipitation affect nutrient intake, but extreme events such as drought do. Furthermore, the authors found a strong income effect and this income effect is opposite the heating effect. This may suggest that large swings in nutrient intake brought about by climate change may be countermanded by equivalent increases in income. The authors also found that in terms of general measures of elasticity that market effects, especially in the price of meats, can impact carbohydrate, fat and protein intake as much as global warming.

Originality/value

The authors believe that three aspects of this manuscript will make it interesting. First, in the short term, poorer households would be the most vulnerable and sensitive to climate change. However, in the long term, all households in rural China appear able to deal with changing climatic conditions through adaptation. Second, the authors do not find evidences to prove the existence of a poverty nutrition trap in rural China. Third, the results also indicate that, the nutrition intake of households in rural China is more prone to gradual changes, rather than extreme events.

Details

China Agricultural Economic Review, vol. 7 no. 2
Type: Research Article
ISSN: 1756-137X

Keywords

Article
Publication date: 29 November 2021

Ziming Zeng, Tingting Li, Shouqiang Sun, Jingjing Sun and Jie Yin

Twitter fake accounts refer to bot accounts created by third-party organizations to influence public opinion, commercial propaganda or impersonate others. The effective…

Abstract

Purpose

Twitter fake accounts refer to bot accounts created by third-party organizations to influence public opinion, commercial propaganda or impersonate others. The effective identification of bot accounts is conducive to accurately judge the disseminated information for the public. However, in actual fake account identification, it is expensive and inefficient to manually label Twitter accounts, and the labeled data are usually unbalanced in classes. To this end, the authors propose a novel framework to solve these problems.

Design/methodology/approach

In the proposed framework, the authors introduce the concept of semi-supervised self-training learning and apply it to the real Twitter account data set from Kaggle. Specifically, the authors first train the classifier in the initial small amount of labeled account data, then use the trained classifier to automatically label large-scale unlabeled account data. Next, iteratively select high confidence instances from unlabeled data to expand the labeled data. Finally, an expanded Twitter account training set is obtained. It is worth mentioning that the resampling technique is integrated into the self-training process, and the data class is balanced at the initial stage of the self-training iteration.

Findings

The proposed framework effectively improves labeling efficiency and reduces the influence of class imbalance. It shows excellent identification results on 6 different base classifiers, especially for the initial small-scale labeled Twitter accounts.

Originality/value

This paper provides novel insights in identifying Twitter fake accounts. First, the authors take the lead in introducing a self-training method to automatically label Twitter accounts from the semi-supervised background. Second, the resampling technique is integrated into the self-training process to effectively reduce the influence of class imbalance on the identification effect.

Details

Data Technologies and Applications, vol. 56 no. 3
Type: Research Article
ISSN: 2514-9288

Keywords

Book part
Publication date: 1 September 2021

Son Nguyen, Phyllis Schumacher, Alan Olinsky and John Quinn

We study the performances of various predictive models including decision trees, random forests, neural networks, and linear discriminant analysis on an imbalanced data set of…

Abstract

We study the performances of various predictive models including decision trees, random forests, neural networks, and linear discriminant analysis on an imbalanced data set of home loan applications. During the process, we propose our undersampling algorithm to cope with the issues created by the imbalance of the data. Our technique is shown to work competitively against popular resampling techniques such as random oversampling, undersampling, synthetic minority oversampling technique (SMOTE), and random oversampling examples (ROSE). We also investigate the relation between the true positive rate, true negative rate, and the imbalance of the data.

Open Access
Article
Publication date: 10 August 2022

Jie Ma, Zhiyuan Hao and Mo Hu

The density peak clustering algorithm (DP) is proposed to identify cluster centers by two parameters, i.e. ρ value (local density) and δ value (the distance between a point and…

Abstract

Purpose

The density peak clustering algorithm (DP) is proposed to identify cluster centers by two parameters, i.e. ρ value (local density) and δ value (the distance between a point and another point with a higher ρ value). According to the center-identifying principle of the DP, the potential cluster centers should have a higher ρ value and a higher δ value than other points. However, this principle may limit the DP from identifying some categories with multi-centers or the centers in lower-density regions. In addition, the improper assignment strategy of the DP could cause a wrong assignment result for the non-center points. This paper aims to address the aforementioned issues and improve the clustering performance of the DP.

Design/methodology/approach

First, to identify as many potential cluster centers as possible, the authors construct a point-domain by introducing the pinhole imaging strategy to extend the searching range of the potential cluster centers. Second, they design different novel calculation methods for calculating the domain distance, point-domain density and domain similarity. Third, they adopt domain similarity to achieve the domain merging process and optimize the final clustering results.

Findings

The experimental results on analyzing 12 synthetic data sets and 12 real-world data sets show that two-stage density peak clustering based on multi-strategy optimization (TMsDP) outperforms the DP and other state-of-the-art algorithms.

Originality/value

The authors propose a novel DP-based clustering method, i.e. TMsDP, and transform the relationship between points into that between domains to ultimately further optimize the clustering performance of the DP.

Details

Data Technologies and Applications, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2514-9288

Keywords

Book part
Publication date: 19 December 2012

Hild Marte Bjørnsen and Ashok K. Mishra

The objective of this study is to investigate the simultaneity between farm couples’ decisions on labor allocation and production efficiency. Using an unbalanced panel data set of…

Abstract

The objective of this study is to investigate the simultaneity between farm couples’ decisions on labor allocation and production efficiency. Using an unbalanced panel data set of Norwegian farm households (1989–2008), we estimate off-farm labor supply of married farm couples and farm efficiency in a three-equation system of jointly determined endogenous variables. We address the issue of latent heterogeneity between households. We solve the problem by two-stage OLS and GLS estimation where state dependence is accounted for in the reduced form equations. We compare the results against simpler model specifications where we suppress censoring of off-farm labor hours and endogeneity of regressors, respectively. In the reduced form specification, a considerably large number of parameters are statistically significant. Davidson–McKinnon test of exogeneity confirms that both operator and spouse's off-farm labor supply should be treated as endogenous in estimating farming efficiency. The parameter estimates seem robust across model specifications. Off-farm labor supply of farm operators and spouses is jointly determined. Off-farm work by farm operator and spouses positively affects farming efficiency. Farming efficiency increases with operator's age, farm size, agricultural subsidises, and share of current investment to total farm capital stock.

Details

Essays in Honor of Jerry Hausman
Type: Book
ISBN: 978-1-78190-308-7

Keywords

1 – 10 of over 8000