Search results
1 – 10 of 198Reza Edris Abadi, Mohammad Javad Ershadi and Seyed Taghi Akhavan Niaki
The overall goal of the data mining process is to extract information from an extensive data set and make it understandable for further use. When working with large volumes of…
Abstract
Purpose
The overall goal of the data mining process is to extract information from an extensive data set and make it understandable for further use. When working with large volumes of unstructured data in research information systems, it is necessary to divide the information into logical groupings after examining their quality before attempting to analyze it. On the other hand, data quality results are valuable resources for defining quality excellence programs of any information system. Hence, the purpose of this study is to discover and extract knowledge to evaluate and improve data quality in research information systems.
Design/methodology/approach
Clustering in data analysis and exploiting the outputs allows practitioners to gain an in-depth and extensive look at their information to form some logical structures based on what they have found. In this study, data extracted from an information system are used in the first stage. Then, the data quality results are classified into an organized structure based on data quality dimension standards. Next, clustering algorithms (K-Means), density-based clustering (density-based spatial clustering of applications with noise [DBSCAN]) and hierarchical clustering (balanced iterative reducing and clustering using hierarchies [BIRCH]) are applied to compare and find the most appropriate clustering algorithms in the research information system.
Findings
This paper showed that quality control results of an information system could be categorized through well-known data quality dimensions, including precision, accuracy, completeness, consistency, reputation and timeliness. Furthermore, among different well-known clustering approaches, the BIRCH algorithm of hierarchical clustering methods performs better in data clustering and gives the highest silhouette coefficient value. Next in line is the DBSCAN method, which performs better than the K-Means method.
Research limitations/implications
In the data quality assessment process, the discrepancies identified and the lack of proper classification for inconsistent data have led to unstructured reports, making the statistical analysis of qualitative metadata problems difficult and thus impossible to root out the observed errors. Therefore, in this study, the evaluation results of data quality have been categorized into various data quality dimensions, based on which multiple analyses have been performed in the form of data mining methods.
Originality/value
Although several pieces of research have been conducted to assess data quality results of research information systems, knowledge extraction from obtained data quality scores is a crucial work that has rarely been studied in the literature. Besides, clustering in data quality analysis and exploiting the outputs allows practitioners to gain an in-depth and extensive look at their information to form some logical structures based on what they have found.
Details
Keywords
Faruk Bulut, Melike Bektaş and Abdullah Yavuz
In this study, supervision and control of the possible problems among people over a large area with a limited number of drone cameras and security staff is established.
Abstract
Purpose
In this study, supervision and control of the possible problems among people over a large area with a limited number of drone cameras and security staff is established.
Design/methodology/approach
These drones, namely unmanned aerial vehicles (UAVs) will be adaptively and automatically distributed over the crowds to control and track the communities by the proposed system. Since crowds are mobile, the design of the drone clusters will be simultaneously re-organized according to densities and distributions of people. An adaptive and dynamic distribution and routing mechanism of UAV fleets for crowds is implemented to control a specific given region. The nine popular clustering algorithms have been used and tested in the presented mechanism to gain better performance.
Findings
The nine popular clustering algorithms have been used and tested in the presented mechanism to gain better performance. An outperformed clustering performance from the aggregated model has been received when compared with a singular clustering method over five different test cases about crowds of human distributions. This study has three basic components. The first one is to divide the human crowds into clusters. The second one is to determine an optimum route of UAVs over clusters. The last one is to direct the most appropriate security personnel to the events that occurred.
Originality/value
This study has three basic components. The first one is to divide the human crowds into clusters. The second one is to determine an optimum route of UAVs over clusters. The last one is to direct the most appropriate security personnel to the events that occurred.
Details
Keywords
Greta Krešić, Elena Dujmić, Dina Lončarić, Snježana Zrnčić, Nikolina Liović and Jelka Pleadin
This study aims to investigate the influence of sociodemographic characteristics, perceived risks, health and nutritional motives and taste preference on at-home fish consumption.
Abstract
Purpose
This study aims to investigate the influence of sociodemographic characteristics, perceived risks, health and nutritional motives and taste preference on at-home fish consumption.
Design/methodology/approach
Data were collected from a nationally representative sample of people responsible for food purchasing in households, using the CAWI (computer-aided web interviewing) method. The eligible study sample comprised 977 participants in Croatia and 967 in Italy, who reported fishery products consumption in the previous 12 months. A questionnaire was used to examine sociodemographic characteristics, fish consumption frequency and factors affecting fish consumption. Determinants of white and fatty fish consumption were estimated with ordered probit models, along with marginal effects for each factor in the models.
Findings
Common positive determinants of white and fatty fish consumption in Croatia and Italy were health and nutritional motives (p < 0.001, p = 0.001, p < 0.001, p = 0.010), taste preference (p < 0.001, p < 0.001, p = 0.001, p = 0.001) and maritime nature of the living region. The common negative determinant of white and fatty fish consumption in Croatia and Italy was financial risk (p < 0.001, p < 0.001, p < 0.001, p = 0.005). The country-specific positive determinant in Croatia was the number of household members (p < 0.001), while negative determinants for white and fatty fish were functional risk (p = 0.004, p = 0.013), number of children (p = 0.030, p = 0.001) and female gender (for fatty fish) (p = 0.028). In Italy, older age negatively affected (p < 0.001) fish consumption, while number of children (p = 0.009) and household income positively affected white fish consumption.
Originality/value
An adequate probabilistic model of national representative samples ensures credibility of results. Policy and marketing activities are proposed that can encourage higher fish consumption.
Details
Keywords
Tingting Tian, Hongjian Shi, Ruhui Ma and Yuan Liu
For privacy protection, federated learning based on data separation allows machine learning models to be trained on remote devices or in isolated data devices. However, due to the…
Abstract
Purpose
For privacy protection, federated learning based on data separation allows machine learning models to be trained on remote devices or in isolated data devices. However, due to the limited resources such as bandwidth and power of local devices, communication in federated learning can be much slower than in local computing. This study aims to improve communication efficiency by reducing the number of communication rounds and the size of information transmitted in each round.
Design/methodology/approach
This paper allows each user node to perform multiple local trainings, then upload the local model parameters to a central server. The central server updates the global model parameters by weighted averaging the parameter information. Based on this aggregation, user nodes first cluster the parameter information to be uploaded and then replace each value with the mean value of its cluster. Considering the asymmetry of the federated learning framework, adaptively select the optimal number of clusters required to compress the model information.
Findings
While maintaining the loss convergence rate similar to that of federated averaging, the test accuracy did not decrease significantly.
Originality/value
By compressing uplink traffic, the work can improve communication efficiency on dynamic networks with limited resources.
Details
Keywords
Thalia Anthony, Juanita Sherwood, Harry Blagg and Kieran Tranter
Florent Govaerts and Svein Ottar Olsen
This study aimed to identify and profile segments of seaweed consumers in the United Kingdom.
Abstract
Purpose
This study aimed to identify and profile segments of seaweed consumers in the United Kingdom.
Design/methodology/approach
Hierarchical k-means cluster analysis was used to identify consumer segments based on consumers' self-identity and environmental values. In addition, the study used subjective knowledge, intentions and consumption to profile different consumer segments. The data were collected in 2022 through a consumer survey with a representative sample from the United Kingdom (n = 1,110).
Findings
Cluster analysis segmented consumers into three groups: progressive (39%), conservative (33%) and egoistic (28%). The progressive segment was most likely to consume seaweed food products. Consumers in the progressive segment identify themselves as food innovative and healthy; they also highly value the environment and their pleasure. Conservative and egoistic consumers were significantly less likely to consume seaweed food products.
Practical implications
The results suggest that public policy officers and marketers promote seaweed food products by emphasizing biospheric values for innovative (younger) consumers, as well as seaweed’s good taste and nutritional/health qualities.
Originality/value
This study identifies and examines the profiles and characteristics of seaweed consumers based on their values and self-identity. Through this research, the authors have discovered how environmental values and self-identity can effectively group consumers into homogeneous segments. Moreover, the authors have identified a specific consumer group in the UK that is more likely to consume seaweed food products.
Details
Keywords
Entrepreneurship is a prominent area of inquiry which is enriched by an ample literature base and challenged by definitional deficiencies. Over the years, multiple perspectives of…
Abstract
Entrepreneurship is a prominent area of inquiry which is enriched by an ample literature base and challenged by definitional deficiencies. Over the years, multiple perspectives of entrepreneurship have emerged and a holistic approach to entrepreneurship has been proposed. This can facilitate the continued enlargement of the entrepreneurship field and allow for interdisciplinary research within the African region. This chapter contributes to the literature on entrepreneurship in developing economies by providing an extensive review of the various approaches that entrepreneurship has been conceptualised. Nine themes are explored: the great person, economic perspective, psychological perspective, sociological perspective, behavioural perspective, management, intrapreneurship, cognitive perspective and leadership perspective. This is followed by an examination of entrepreneurship as a process, as a new venture creation and as an art of opportunity recognition and exploitation. In the last section of this chapter, a clarion call is made for more African scholarship and research in the field of entrepreneurship.
Details
Keywords
Prajakta Thakare and Ravi Sankar V.
Agriculture is the backbone of a country, contributing more than half of the sector of economy throughout the world. The need for precision agriculture is essential in evaluating…
Abstract
Purpose
Agriculture is the backbone of a country, contributing more than half of the sector of economy throughout the world. The need for precision agriculture is essential in evaluating the conditions of the crops with the aim of determining the proper selection of pesticides. The conventional method of pest detection fails to be stable and provides limited accuracy in the prediction. This paper aims to propose an automatic pest detection module for the accurate detection of pests using the hybrid optimization controlled deep learning model.
Design/methodology/approach
The paper proposes an advanced pest detection strategy based on deep learning strategy through wireless sensor network (WSN) in the agricultural fields. Initially, the WSN consisting of number of nodes and a sink are clustered as number of clusters. Each cluster comprises a cluster head (CH) and a number of nodes, where the CH involves in the transfer of data to the sink node of the WSN and the CH is selected using the fractional ant bee colony optimization (FABC) algorithm. The routing process is executed using the protruder optimization algorithm that helps in the transfer of image data to the sink node through the optimal CH. The sink node acts as the data aggregator and the collection of image data thus obtained acts as the input database to be processed to find the type of pest in the agricultural field. The image data is pre-processed to remove the artifacts present in the image and the pre-processed image is then subjected to feature extraction process, through which the significant local directional pattern, local binary pattern, local optimal-oriented pattern (LOOP) and local ternary pattern (LTP) features are extracted. The extracted features are then fed to the deep-convolutional neural network (CNN) in such a way to detect the type of pests in the agricultural field. The weights of the deep-CNN are tuned optimally using the proposed MFGHO optimization algorithm that is developed with the combined characteristics of navigating search agents and the swarming search agents.
Findings
The analysis using insect identification from habitus image Database based on the performance metrics, such as accuracy, specificity and sensitivity, reveals the effectiveness of the proposed MFGHO-based deep-CNN in detecting the pests in crops. The analysis proves that the proposed classifier using the FABC+protruder optimization-based data aggregation strategy obtains an accuracy of 94.3482%, sensitivity of 93.3247% and the specificity of 94.5263%, which is high as compared to the existing methods.
Originality/value
The proposed MFGHO optimization-based deep-CNN is used for the detection of pest in the crop fields to ensure the better selection of proper cost-effective pesticides for the crop fields in such a way to increase the production. The proposed MFGHO algorithm is developed with the integrated characteristic features of navigating search agents and the swarming search agents in such a way to facilitate the optimal tuning of the hyperparameters in the deep-CNN classifier for the detection of pests in the crop fields.
Details