Search results

1 – 10 of over 91000
Article
Publication date: 9 May 2016

Shi Cheng, Qingyu Zhang and Quande Qin

The quality and quantity of data are vital for the effectiveness of problem solving. Nowadays, big data analytics, which require managing an immense amount of data rapidly, has…

6133

Abstract

Purpose

The quality and quantity of data are vital for the effectiveness of problem solving. Nowadays, big data analytics, which require managing an immense amount of data rapidly, has attracted more and more attention. It is a new research area in the field of information processing techniques. It faces the big challenges and difficulties of a large amount of data, high dimensionality, and dynamical change of data. However, such issues might be addressed with the help from other research fields, e.g., swarm intelligence (SI), which is a collection of nature-inspired searching techniques. The paper aims to discuss these issues.

Design/methodology/approach

In this paper, the potential application of SI in big data analytics is analyzed. The correspondence and association between big data analytics and SI techniques are discussed. As an example of the application of the SI algorithms in the big data processing, a commodity routing system in a port in China is introduced. Another example is the economic load dispatch problem in the planning of a modern power system.

Findings

The characteristics of big data include volume, variety, velocity, veracity, and value. In the SI algorithms, these features can be, respectively, represented as large scale, high dimensions, dynamical, noise/surrogates, and fitness/objective problems, which have been effectively solved.

Research limitations/implications

In current research, the example problem of the port is formulated but not solved yet given the ongoing nature of the project. The example could be understood as advanced IT or data processing technology, however, its underlying mechanism could be the SI algorithms. This paper is the first step in the research to utilize the SI algorithm to a big data analytics problem. The future research will compare the performance of the method and fit it in a dynamic real system.

Originality/value

Based on the combination of SI and data mining techniques, the authors can have a better understanding of the big data analytics problems, and design more effective algorithms to solve real-world big data analytical problems.

Details

Industrial Management & Data Systems, vol. 116 no. 4
Type: Research Article
ISSN: 0263-5577

Keywords

Article
Publication date: 6 January 2022

Ahmad Latifian

Big data has posed problems for businesses, the Information Technology (IT) sector and the science community. The problems posed by big data can be effectively addressed using…

Abstract

Purpose

Big data has posed problems for businesses, the Information Technology (IT) sector and the science community. The problems posed by big data can be effectively addressed using cloud computing and associated distributed computing technology. Cloud computing and big data are two significant past-year problems that allow high-efficiency and competitive computing tools to be delivered as IT services. The paper aims to examine the role of the cloud as a tool for managing big data in various aspects to help businesses.

Design/methodology/approach

This paper delivers solutions in the cloud for storing, compressing, analyzing and processing big data. Hence, articles were divided into four categories: articles on big data storage, articles on big data processing, articles on analyzing and finally, articles on data compression in cloud computing. This article is based on a systematic literature review. Also, it is based on a review of 19 published papers on big data.

Findings

From the results, it can be inferred that cloud computing technology has features that can be useful for big data management. Challenging issues are raised in each section. For example, in storing big data, privacy and security issues are challenging.

Research limitations/implications

There were limitations to this systematic review. The first limitation is that only English articles were reviewed. Also, articles that matched the keywords were used. Finally, in this review, authoritative articles were reviewed, and slides and tutorials were avoided.

Practical implications

The research presents new insight into the business value of cloud computing in interfirm collaborations.

Originality/value

Previous research has often examined other aspects of big data in the cloud. This article takes a new approach to the subject. It allows big data researchers to comprehend the various aspects of big data management in the cloud. In addition, setting an agenda for future research saves time and effort for readers searching for topics within big data.

Details

Kybernetes, vol. 51 no. 6
Type: Research Article
ISSN: 0368-492X

Keywords

Article
Publication date: 21 December 2021

Laouni Djafri

This work can be used as a building block in other settings such as GPU, Map-Reduce, Spark or any other. Also, DDPML can be deployed on other distributed systems such as P2P…

384

Abstract

Purpose

This work can be used as a building block in other settings such as GPU, Map-Reduce, Spark or any other. Also, DDPML can be deployed on other distributed systems such as P2P networks, clusters, clouds computing or other technologies.

Design/methodology/approach

In the age of Big Data, all companies want to benefit from large amounts of data. These data can help them understand their internal and external environment and anticipate associated phenomena, as the data turn into knowledge that can be used for prediction later. Thus, this knowledge becomes a great asset in companies' hands. This is precisely the objective of data mining. But with the production of a large amount of data and knowledge at a faster pace, the authors are now talking about Big Data mining. For this reason, the authors’ proposed works mainly aim at solving the problem of volume, veracity, validity and velocity when classifying Big Data using distributed and parallel processing techniques. So, the problem that the authors are raising in this work is how the authors can make machine learning algorithms work in a distributed and parallel way at the same time without losing the accuracy of classification results. To solve this problem, the authors propose a system called Dynamic Distributed and Parallel Machine Learning (DDPML) algorithms. To build it, the authors divided their work into two parts. In the first, the authors propose a distributed architecture that is controlled by Map-Reduce algorithm which in turn depends on random sampling technique. So, the distributed architecture that the authors designed is specially directed to handle big data processing that operates in a coherent and efficient manner with the sampling strategy proposed in this work. This architecture also helps the authors to actually verify the classification results obtained using the representative learning base (RLB). In the second part, the authors have extracted the representative learning base by sampling at two levels using the stratified random sampling method. This sampling method is also applied to extract the shared learning base (SLB) and the partial learning base for the first level (PLBL1) and the partial learning base for the second level (PLBL2). The experimental results show the efficiency of our solution that the authors provided without significant loss of the classification results. Thus, in practical terms, the system DDPML is generally dedicated to big data mining processing, and works effectively in distributed systems with a simple structure, such as client-server networks.

Findings

The authors got very satisfactory classification results.

Originality/value

DDPML system is specially designed to smoothly handle big data mining classification.

Details

Data Technologies and Applications, vol. 56 no. 4
Type: Research Article
ISSN: 2514-9288

Keywords

Abstract

Details

Lean Six Sigma in Higher Education
Type: Book
ISBN: 978-1-78769-929-8

Article
Publication date: 19 October 2015

Kun Chen, Xin Li and Huaiqing Wang

Although big data analytics has reaped great business rewards, big data system design and integration still face challenges resulting from the demanding environment, including…

2720

Abstract

Purpose

Although big data analytics has reaped great business rewards, big data system design and integration still face challenges resulting from the demanding environment, including challenges involving variety, uncertainty, and complexity. These characteristics in big data systems demand flexible and agile integration architectures. Furthermore, a formal model is needed to support design and verification. The purpose of this paper is to resolve the two problems with a collective intelligence (CI) model.

Design/methodology/approach

In the conceptual CI framework as proposed by Schut (2010), a CI design should be comprised of a general model, which has formal form for verification and validation, and also a specific model, which is an implementable system architecture. After analyzing the requirements of system integration in big data environments, the authors apply the CI framework to resolve the integration problem. In the model instantiation, the authors use multi-agent paradigm as the specific model, and the hierarchical colored Petri Net (PN) as the general model.

Findings

First, multi-agent paradigm is a good implementation for reuse and integration of big data analytics modules in an agile and loosely coupled method. Second, the PN models provide effective simulation results in the system design period. It gives advice on business process design and workload balance control. Third, the CI framework provides an incrementally build and deployed method for system integration. It is especially suitable to the dynamic data analytics environment. These findings have both theoretical and managerial implications.

Originality/value

In this paper, the authors propose a CI framework, which includes both practical architectures and theoretical foundations, to solve the system integration problem in big data environment. It provides a new point of view to dynamically integrate large-scale modules in an organization. This paper also has practical suggestions for Chief Technical Officers, who want to employ big data technologies in their companies.

Details

Industrial Management & Data Systems, vol. 115 no. 9
Type: Research Article
ISSN: 0263-5577

Keywords

Book part
Publication date: 19 September 2019

Emre Soyer, Koen Pauwels and Steven H. Seggie

While Big Data offer marketing managers information that is high in volume, variety, velocity, and veracity (the 4Vs), these features wouldn’t necessarily improve their…

Abstract

While Big Data offer marketing managers information that is high in volume, variety, velocity, and veracity (the 4Vs), these features wouldn’t necessarily improve their decision-making. Managers would still be vulnerable to confirmation bias, control illusions, communication problems, and confidence issues (the 4Cs). The authors argue that traditional remedies for such biases don’t go far enough and propose a lean start-up approach to data-based learning in marketing management. Specifically, they focus on the marketing analytics component of Big Data and how adaptations of the lean start-up methodology can be used in some combination with such analytics to help marketing managers improve their decision-making and innovation process. Beyond the often discussed technical obstacles and operational costs associated with handling Big Data, this chapter contributes by analyzing the various learning and decision-making problems that can emerge once the 4Vs of Big Data have materialized.

Details

Marketing in a Digital World
Type: Book
ISBN: 978-1-78756-339-1

Keywords

Book part
Publication date: 30 September 2020

Arindam Chakrabarty and Uday Sankar Das

History teaches us that the glorious victory of mankind across the centuries was accomplished through the successful use of information. The gigantic progressions and rapid…

Abstract

History teaches us that the glorious victory of mankind across the centuries was accomplished through the successful use of information. The gigantic progressions and rapid transformation of human societies have endorsed legitimacy of abundant data, multiple dynamic variables & critical complexities which reinforce the academia and researchers for understanding and pioneering into ‘Big Data Analytics (BDA)’. Health is one of the vibrant socio-economic variables which have correlations with other aspects of life, that is, education, poverty, income, etc. In fact, there are unending debates whether health can be a basic input for a holistic developmental process or it is the outcome of various developmental factors. BDAs are being used across various sectors of the economy. The developed nations have been yielding most feasible solutions using various forms of analysis of big data. Astronomical research has been using a large quantum of data for accomplishing various satellite projects, space technology, and numerous space missions for the astronaut. With the advent of fourth industrial revolution, the world community has been thriving toward a new age technological innovations that include artificial intelligence, machine learning, block chain technology, etc., which act a pivotal tool for BDAs. In the health sector, application of BDAs has been attempted and experimented in the developed nations which have resulted prolific and sustainable solutions to the most typical cumbersome problems. This chapter has demonstrated how BDAs can make progressive reforms in the Indian Health sector outlining the present status and emerging challenges.

Details

Big Data Analytics and Intelligence: A Perspective for Health Care
Type: Book
ISBN: 978-1-83909-099-8

Keywords

Article
Publication date: 12 June 2017

Chad Laux, Na Li, Corey Seliger and John Springer

The purpose of this paper is to develop a framework for utilizing Six Sigma (SS) principles and Big Data analytics at a US public university for the improvement of student…

2399

Abstract

Purpose

The purpose of this paper is to develop a framework for utilizing Six Sigma (SS) principles and Big Data analytics at a US public university for the improvement of student success. This research utilizes findings from the Gallup index to identify performance factors of higher education. The goal is to offer a reimagined SS DMAIC methodology that incorporates Big Data principles.

Design/methodology/approach

The authors utilize a conceptual research design methodology based upon theory building consisting of discovery, description, explanation of the disciplines of SS and Big Data.

Findings

The authors have found that the interdisciplinary approach to SS and Big Data may be grounded in a framework that reimagines the define, measure, analyze, improve and control (DMAIC) methodology that incorporates Big Data principles. The authors offer propositions of SS DMAIC to be theory tested in subsequent study and offer the practitioner managing the performance of higher education institutions (HEIs) indicators and examples for managing the student success mission of the organization.

Research limitations/implications

The study is limited to conceptual research design with regard to the SS and Big Data interdisciplinary research. For performance management, this study is limited to HEIs and non-FERPA student data. Implications of this study include a detailed framework for conducting SS Big Data projects.

Practical implications

Devising a more effective management approach for higher education needs to be based upon student success and performance indicators that accurately measure and support the higher education mission. A proactive approach should utilize the data rich environment being generated. The individual that is most successful in engaging and managing this effort will have the knowledge and skills that are found in both SS and Big Data.

Social implications

HEIs have historically been significant contributors to the development of meritocracy in democratic societies. Due to a variety of factors, HEIs, especially publicly funded institutions, have been under stress due to a reduction of public funding, resulting in more limited access to the public in which they serve.

Originality/value

This paper examines Big Data and SS in interdisciplinary effort, an important contribution to SS but lacking a conceptual foundation in the literature. Higher education, as an industry, lacks penetration and adoption of continuous improvement efforts, despite being under tremendous cost pressures and ripe for disruption.

Details

International Journal of Productivity and Performance Management, vol. 66 no. 5
Type: Research Article
ISSN: 1741-0401

Keywords

Article
Publication date: 22 February 2024

Ranjeet Kumar Singh

Although the challenges associated with big data are increasing, the question of the most suitable big data analytics (BDA) platform in libraries is always significant. The…

54

Abstract

Purpose

Although the challenges associated with big data are increasing, the question of the most suitable big data analytics (BDA) platform in libraries is always significant. The purpose of this study is to propose a solution to this problem.

Design/methodology/approach

The current study identifies relevant literature and provides a review of big data adoption in libraries. It also presents a step-by-step guide for the development of a BDA platform using the Apache Hadoop Ecosystem. To test the system, an analysis of library big data using Apache Pig, which is a tool from the Apache Hadoop Ecosystem, was performed. It establishes the effectiveness of Apache Hadoop Ecosystem as a powerful BDA solution in libraries.

Findings

It can be inferred from the literature that libraries and librarians have not taken the possibility of big data services in libraries very seriously. Also, the literature suggests that there is no significant effort made to establish any BDA architecture in libraries. This study establishes the Apache Hadoop Ecosystem as a possible solution for delivering BDA services in libraries.

Research limitations/implications

The present work suggests adapting the idea of providing various big data services in a library by developing a BDA platform, for instance, providing assistance to the researchers in understanding the big data, cleaning and curation of big data by skilled and experienced data managers and providing the infrastructural support to store, process, manage, analyze and visualize the big data.

Practical implications

The study concludes that Apache Hadoops’ Hadoop Distributed File System and MapReduce components significantly reduce the complexities of big data storage and processing, respectively, and Apache Pig, using Pig Latin scripting language, is very efficient in processing big data and responding to queries with a quick response time.

Originality/value

According to the study, there are significantly fewer efforts made to analyze big data from libraries. Furthermore, it has been discovered that acceptance of the Apache Hadoop Ecosystem as a solution to big data problems in libraries are not widely discussed in the literature, although Apache Hadoop is regarded as one of the best frameworks for big data handling.

Details

Digital Library Perspectives, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2059-5816

Keywords

Article
Publication date: 1 February 2016

Derina Holtzhausen

The purpose of this paper is to consider the threats and potential of Big Data for strategic communication. It explains the concepts of datafication and Big Data and establishes…

7603

Abstract

Purpose

The purpose of this paper is to consider the threats and potential of Big Data for strategic communication. It explains the concepts of datafication and Big Data and establishes the social and cultural context of Big Data from the way those constructing algorithms superimpose their value systems and cultural references onto the data. It links Big Data and strategic communication through the segmentation devices and strategies both use and propose discourse analysis as a valid method for the critique of Big Data. The importance of strategic communication for the public sphere suggests that Big Data can pose a serious threat to public discourse.

Design/methodology/approach

This is a conceptual and theoretical paper that first explains and interprets various new terms and concepts and then uses established theoretical approaches to analyze these phenomena.

Findings

The use of Big Data for the micro-segmentation of audiences establishes its relationship with strategic communication. Big Data analyses and algorithms are not neutral. Treating algorithms as language and communication allow them to be subjected to discourse analysis to expose underlying power relations for resistance strategies to emerge. Strategic communicators should guard the public sphere and take an activist stance against the potential harm of Big Data. That requires a seat at the institutional technology table and speaking out against discriminatory practices. However, Big Data can also greatly benefit society and improve discourse in the public sphere.

Research limitations/implications

There is not yet empirical data available on the impact of datafication on communication practice, which might be a problem well into the future. It also might be hard to do empirical research on its impact on practice and the public sphere. The heuristic value of this piece is that it laid down the theoretical foundations of the phenomena to be studied, which can in future be used for ethnographic research or qualitative studies. It might eventually be possible to follow personalized messages generated through datafication to study if they actually lead to behavior change in specific audience members.

Practical/implications

As guardians of the public sphere strategic communication practitioners have to educate themselves on the realities of Big Data and should consciously acquire a seat at the institutional technology table. Practitioners will need to be involved in decisions on how algorithms are formulated and who they target. This will require them to serve as activists to ensure social justice. They also will need to contribute to organizational transparency by making organizational information widely available and accessible through media bought, owned, and earned. Strategic communicators need to create a binary partnership with journalists of all kinds to secure the public sphere.

Social/implications

The paper exposes the role of algorithms in the construction of data and the extent to which algorithms are products of people who impose their own values and belief systems on them. Algorithms and the data they generate are subjective and value-laden. The concept of algorithms as language and communication and the use of Big Data for the segmentation of society for purposes of communication establish the connection between Big Data and strategic communication. The paper also exposes the potential for harm in the use of Big Data, as well as its potential for improving society and bringing about social justice.

Originality/value

The value of this paper is that it introduces the concept of datafication to communication studies and proposes theoretical foundations for the study of Big Data in the context of strategic communications. It provides a theoretical and social foundation for the inclusion of the public sphere in a definition of strategic communication and emphasizes strategic communicators’ commitment to the public sphere as more important than ever before. It highlights how communication practice and society can impact each other positively and negatively and that Big Data should not be the future of strategic communication but only a part of it.

Details

Journal of Communication Management, vol. 20 no. 1
Type: Research Article
ISSN: 1363-254X

Keywords

1 – 10 of over 91000