Search results
1 – 10 of over 12000Abstract
Purpose
Identifying the frontiers of a specific research field is one of the most basic tasks in bibliometrics and research published in leading conferences is crucial to the data mining research community, whereas few research studies have focused on it. The purpose of this study is to detect the intellectual structure of data mining based on conference papers.
Design/methodology/approach
This study takes the authoritative conference papers of the ranking 9 in the data mining field provided by Google Scholar Metrics as a sample. According to paper amount, this paper first detects the annual situation of the published documents and the distribution of the published conferences. Furthermore, from the research perspective of keywords, CiteSpace was used to dig into the conference papers to identify the frontiers of data mining, which focus on keywords term frequency, keywords betweenness centrality, keywords clustering and burst keywords.
Findings
Research showed that the research heat of data mining had experienced a linear upward trend during 2007 and 2016. The frontier identification based on the conference papers showed that there were five research hotspots in data mining, including clustering, classification, recommendation, social network analysis and community detection. The research contents embodied in the conference papers were also very rich.
Originality/value
This study detected the research frontier from leading data mining conference papers. Based on the keyword co-occurrence network, from four dimensions of keyword term frequency, betweeness centrality, clustering analysis and burst analysis, this paper identified and analyzed the research frontiers of data mining discipline from 2007 to 2016.
Details
Keywords
Zhiwen Pan, Jiangtian Li, Yiqiang Chen, Jesus Pacheco, Lianjun Dai and Jun Zhang
The General Society Survey(GSS) is a kind of government-funded survey which aims at examining the Socio-economic status, quality of life, and structure of contemporary society…
Abstract
Purpose
The General Society Survey(GSS) is a kind of government-funded survey which aims at examining the Socio-economic status, quality of life, and structure of contemporary society. GSS data set is regarded as one of the authoritative source for the government and organization practitioners to make data-driven policies. The previous analytic approaches for GSS data set are designed by combining expert knowledges and simple statistics. By utilizing the emerging data mining algorithms, we proposed a comprehensive data management and data mining approach for GSS data sets.
Design/methodology/approach
The approach are designed to be operated in a two-phase manner: a data management phase which can improve the quality of GSS data by performing attribute pre-processing and filter-based attribute selection; a data mining phase which can extract hidden knowledge from the data set by performing data mining analysis including prediction analysis, classification analysis, association analysis and clustering analysis.
Findings
According to experimental evaluation results, the paper have the following findings: Performing attribute selection on GSS data set can increase the performance of both classification analysis and clustering analysis; all the data mining analysis can effectively extract hidden knowledge from the GSS data set; the knowledge generated by different data mining analysis can somehow cross-validate each other.
Originality/value
By leveraging the power of data mining techniques, the proposed approach can explore knowledge in a fine-grained manner with minimum human interference. Experiments on Chinese General Social Survey data set are conducted at the end to evaluate the performance of our approach.
Details
Keywords
Nicola Castellano, Roberto Del Gobbo and Lorenzo Leto
The concept of productivity is central to performance management and decision-making, although it is complex and multifaceted. This paper aims to describe a methodology based on…
Abstract
Purpose
The concept of productivity is central to performance management and decision-making, although it is complex and multifaceted. This paper aims to describe a methodology based on the use of Big Data in a cluster analysis combined with a data envelopment analysis (DEA) that provides accurate and reliable productivity measures in a large network of retailers.
Design/methodology/approach
The methodology is described using a case study of a leading kitchen furniture producer. More specifically, Big Data is used in a two-step analysis prior to the DEA to automatically cluster a large number of retailers into groups that are homogeneous in terms of structural and environmental factors and assess a within-the-group level of productivity of the retailers.
Findings
The proposed methodology helps reduce the heterogeneity among the units analysed, which is a major concern in DEA applications. The data-driven factorial and clustering technique allows for maximum within-group homogeneity and between-group heterogeneity by reducing subjective bias and dimensionality, which is embedded with the use of Big Data.
Practical implications
The use of Big Data in clustering applied to productivity analysis can provide managers with data-driven information about the structural and socio-economic characteristics of retailers' catchment areas, which is important in establishing potential productivity performance and optimizing resource allocation. The improved productivity indexes enable the setting of targets that are coherent with retailers' potential, which increases motivation and commitment.
Originality/value
This article proposes an innovative technique to enhance the accuracy of productivity measures through the use of Big Data clustering and DEA. To the best of the authors’ knowledge, no attempts have been made to benefit from the use of Big Data in the literature on retail store productivity.
Details
Keywords
Orlando Troisi, Anna Visvizi and Mara Grimaldi
Digitalization accelerates the need of tourism and hospitality ecosystems to reframe business models in line with a data-driven orientation that can foster value creation and…
Abstract
Purpose
Digitalization accelerates the need of tourism and hospitality ecosystems to reframe business models in line with a data-driven orientation that can foster value creation and innovation. Since the question of data-driven business models (DDBMs) in hospitality remains underexplored, this paper aims at (1) revealing the key dimensions of the data-driven redefinition of business models in smart hospitality ecosystems and (2) conceptualizing the key drivers underlying the emergence of innovation in these ecosystems.
Design/methodology/approach
The empirical research is based on semi-structured interviews collected from a sample of hospitality managers, employed in three different accommodation services, i.e. hotels, bed and breakfast (B&Bs) and guesthouses, to explore data-driven strategies and practices employed on site.
Findings
The findings allow to devise a conceptual framework that classifies the enabling dimensions of DDBMs in smart hospitality ecosystems. Here, the centrality of strategy conducive to the development of data-driven innovation is stressed.
Research limitations/implications
The study thus developed a conceptual framework that will serve as a tool to examine the impact of digitalization in other service industries. This study will also be useful for small and medium-sized enterprises (SMEs) managers, who seek to understand the possibilities data-driven management strategies offer in view of stimulating innovation in the managers' companies.
Originality/value
The paper reinterprets value creation practices in business models through the lens of data-driven approaches. In this way, this paper offers a new (conceptual and empirical) perspective to investigate how the hospitality sector at large can use the massive amounts of data available to foster innovation in the sector.
Details
Keywords
The purpose of this paper is to explore why autistic people and their caregivers choose interventions other than applied behavior analysis (ABA), and how their decision impacts…
Abstract
Purpose
The purpose of this paper is to explore why autistic people and their caregivers choose interventions other than applied behavior analysis (ABA), and how their decision impacts them over their lifespan. The focus group was divided into those who pursued augmentative and alternative communication (AAC)-based supports, received ABA, selected other interventions or received no intervention at all. The reported posttraumatic stress symptoms (PTSS) of ABA recipients were compared to non-ABA recipients in order to evaluate the long-term impacts of all intervention types. Using a mixed-method thematic analysis, optional comments submitted alongside a quantitative online survey were reviewed for emergent themes. These comments augmented the survey Likert scores with a qualitative impression of the diverse intervention-related attitudes among participants. Investigating the lived experiences of autism intervention recipients illuminated the scope of the long-term impacts of each intervention that was chosen. Overall, autistics who received no intervention fared best, based on the lowest reported PTSS. These findings may inform the potential redesign of autism interventions based on the firsthand reported experiences and opinions of autistics.
Design/methodology/approach
The aim of this study was to conduct research that is both question-driven and data-driven to aid in the analysis of existing data (Van Helden, 2013). In the research question-driven approach, the independent variables were the intervention type and duration of exposure relative to lifespan; the dependent variables were the PTSS severity score and binary indicator of meeting PTSS criteria. The analyses that were conducted included linear regression analyses of severity score on intervention type and duration, and χ2 tests for independence of the probabilities of PTSS criterion satisfaction and intervention type. This experiment was designed to test the data-driven hypothesis that the prevalence and severity of PTSS are dependent on the type of autism intervention and duration of exposure. After reviewing the primary data set, the data-driven inquiry determined that the sample for secondary analysis should be categorized by communication-based vs non-communication-based intervention type in order to best complement the limitations and strengths of the published findings from the primary analysis.
Findings
Autistics who received no intervention had a 59 percent lower likelihood of meeting the PTSS criteria when compared to their ABA peers, and they remained 99.6 percent stable in their reported symptoms throughout their lifespan (R2=0.004). ABA recipients were 1.74 times more likely to meet the PTSS criteria when compared to their AAC peers. Within the 23 percent who selected an intervention other than ABA, consisting of psychotherapy, mental health, son-rise and other varying interventions, 63 percent were asymptomatic. This suggests that the combined benefits of communication-based interventions over behaviorism-influenced ABA practices may contribute to enhanced quality of life. Although not generalizable beyond the scope of this study, it is indicated from the data that autistics who received no intervention at all fared best over their lifetimes.
Research limitations/implications
The obvious advantage of a secondary analysis is to uncover key findings that may have been overlooked in the preliminary study. Omitted variables in the preliminary data leave the researcher naive to crucially significant findings, which may be mitigated by subsequent testing in follow-up studies (Cheng and Phillips, 2014, p. 374). Frequency tables and cross-tabulations of all variables included in the primary analysis were reproduced. The secondary analysis of existing data was conducted from the design variables used in the original study and applied in the secondary analyses to generate less biased estimates (Lohr, 2010; Graubard and Korn, 1996). Inclusion criteria for each intervention group, PTSS scores and exposure duration, were inherited from the primary analysis, to allow for strategic judgment about the coding of the core variables pertaining to AAC and PTSS. The data sample from 460 respondents was reduced to a non-ABA group of n=330. An external statistician scored each respondent, and interrater reliability was assessed using Cohen’s κ coefficient (κ=1).
Practical implications
Including the autistic voice in the long-term planning of childhood interventions is essential to those attempting to meet the needs of the individuals, their families and communities. Both parents and autistic participant quotes were obtained directly from the optional comments to reveal why parents quit or persisted with an autism intervention.
Social implications
Practitioners and intervention service providers must consider this feedback from those who are directly impacted by the intervention style, frequency or intensity. The need for such work is confirmed in the recent literature as well, such as community-based participatory research (Raymaker, 2016). Autistics should be recognized as experts in their own experience (Milton, 2014). Community–academic partnerships are necessary to investigate the needs of the autistic population (Meza et al., 2016).
Originality/value
Most autistic people do not consider autism to be a mental illness nor a behavior disorder. It is imperative to recognize that when injurious behavior persists, and disturbance in mood, cognition, sleep pattern and focus are exacerbated, the symptoms are unrelated to autism and closely align to the diagnostic criteria for posttraumatic stress disorder (PTSD). When PTSD is underdiagnosed and untreated, the autistic individual may experience hyperarousal and become triggered by otherwise agreeable stimuli. Since autism interventions are typically structured around high contact, prolonged hours and 1:1 engagement, the nature of the intervention must be re-evaluated as a potentially traumatic event for an autistic person in the hyperarousal state. Any interventions which trigger more than it helps should be avoided and discontinued when PTSS emerge.
Details
Keywords
Kalliopi Platanou, Kristiina Mäkelä, Anton Beletskiy and Anatoli Colicev
The purpose of this paper is to propose new directions for human resource management (HRM) research by drawing attention to online data as a complementary data source to…
Abstract
Purpose
The purpose of this paper is to propose new directions for human resource management (HRM) research by drawing attention to online data as a complementary data source to traditional quantitative and qualitative data, and introducing network text analysis as a method for large quantities of textual material.
Design/methodology/approach
The paper first presents the added value and potential challenges of utilising online data in HRM research, and then proposes a four-step process for analysing online data with network text analysis.
Findings
Online data represent a naturally occuring source of real-time behavioural data that do not suffer from researcher intervention or hindsight bias. The authors argue that as such, this type of data provides a promising yet currently largely untapped empirical context for HRM research that is particularly suited for examining discourses and behavioural and social patterns over time.
Practical implications
While online data hold promise for many novel research questions, it is less appropriate for research questions that seek to establish causality between variables. When using online data, particular attention must be paid to ethical considerations, as well as the validity and representativeness of the sample.
Originality/value
The authors introduce online data and network text analysis as a new avenue for HRM research, with potential to address novel research questions at micro-, meso- and macro-levels of analysis.
Details
Keywords
Thomas G. Cech, Trent J. Spaulding and Joseph A. Cazier
The purpose of this paper is to lay out the data competence maturity model (DCMM) and discuss how the application of the model can serve as a foundation for a measured and…
Abstract
Purpose
The purpose of this paper is to lay out the data competence maturity model (DCMM) and discuss how the application of the model can serve as a foundation for a measured and deliberate use of data in secondary education.
Design/methodology/approach
Although the model is new, its implications, and its application are derived from key findings and best practices from the software development, data analytics and secondary education performance literature. These principles can guide educators to better manage student and operational outcomes. This work builds and applies the DCMM model to secondary education.
Findings
The conceptual model reveals significant opportunities to improve data-driven decision making in schools and local education agencies (LEAs). Moving past the first and second stages of the data competency maturity model should allow educators to better incorporate data into the regular decision-making process.
Practical implications
Moving up the DCMM to better integrate data into their decision-making process has the potential to produce profound improvements for schools and LEAs. Data science is about making better decisions. Understanding the path laid out in the DCMM to helping an organization move to a more mature data-driven decision-making process will help improve both student and operational outcomes.
Originality/value
This paper brings a new concept, the DCMM, to the educational literature and discusses how these principles can be applied to improve decision making by integrating them into their decision-making process and trying to help the organization mature within this framework.
Details
Keywords
Simone Fanelli, Lorenzo Pratici, Fiorella Pia Salvatore, Chiara Carolina Donelli and Antonello Zangrandi
This study aims to provide a picture of the current state of art in the use of big data for decision-making processes for the management of health-care organizations.
Abstract
Purpose
This study aims to provide a picture of the current state of art in the use of big data for decision-making processes for the management of health-care organizations.
Design/methodology/approach
A systematic literature review was carried out. The research uses two analyses: descriptive analysis, describing the evolution of citations; keywords; and the ten most influential papers, and bibliometric analysis, for content evaluation, for which a cluster analysis was performed.
Findings
A total of 48 articles were selected for bibliographic coupling out of an initial sample of more than 5,000 papers. Of the 48 articles, 29 are linked on the basis of their bibliography. Clustering the 29 articles on the basis of actual content, four research areas emerged: quality of care, quality of service, crisis management and data management.
Originality/value
Health-care organizations believe strongly that big data can become the most effective tool for correctly influencing the decision-making processes. Thus, more and more organizations continue to invest in big data analytics, and the literature on this topic has expanded rapidly. This study seeks to provide a comprehensive picture of the different streams of literature existing, together with gaps in research and future perspectives. The literature is mature enough for an analysis to be made and provide managers with useful insights on opportunities, criticisms and perspectives on the use of big data for health-care organizations. However, to date, there is no comprehensive literature review on the big data analysis in health care. Furthermore, as big data is a “sexy catchphrase,” more clarity on its usage may be needed. It represents an important tool to be investigated and its great potential is often yet to be discovered. This study thus sheds light on emerging issues and suggests further research that may be needed.
Details
Keywords
Gustavo Grander, Luciano Ferreira da Silva and Ernesto Del Rosário Santibañez Gonzalez
This paper aims to analyze how decision support systems manage Big data to obtain value.
Abstract
Purpose
This paper aims to analyze how decision support systems manage Big data to obtain value.
Design/methodology/approach
A systematic literature review was performed with screening and analysis of 72 articles published between 2012 and 2019.
Findings
The findings reveal that techniques of big data analytics, machine learning algorithms and technologies predominantly related to computer science and cloud computing are used on decision support systems. Another finding was that the main areas that these techniques and technologies are been applied are logistic, traffic, health, business and market. This article also allows authors to understand the relationship in which descriptive, predictive and prescriptive analyses are used according to an inverse relationship of complexity in data analysis and the need for human decision-making.
Originality/value
As it is an emerging theme, this study seeks to present an overview of the techniques and technologies that are being discussed in the literature to solve problems in their respective areas, as a form of theoretical contribution. The authors also understand that there is a practical contribution to the maturity of the discussion and with reflections even presented as suggestions for future research, such as the ethical discussion. This study’s descriptive classification can also serve as a guide for new researchers who seek to understand the research involving decision support systems and big data to gain value in our society.
Details
Keywords
The purpose this paper is to review some of the statistical methods used in the field of social sciences.
Abstract
Purpose
The purpose this paper is to review some of the statistical methods used in the field of social sciences.
Design/methodology/approach
A review of some of the statistical methodologies used in areas like survey methodology, official statistics, sociology, psychology, political science, criminology, public policy, marketing research, demography, education and economics.
Findings
Several areas are presented such as parametric modeling, nonparametric modeling and multivariate methods. Focus is also given to time series modeling, analysis of categorical data and sampling issues and other useful techniques for the analysis of data in the social sciences. Indicative references are given for all the above methods along with some insights for the application of these techniques.
Originality/value
This paper reviews some statistical methods that are used in social sciences and the authors draw the attention of researchers on less popular methods. The purpose is not to give technical details and also not to refer to all the existing techniques or to all the possible areas of statistics. The focus is mainly on the applied aspect of the techniques and the authors give insights about techniques that can be used to answer problems in the abovementioned areas of research.
Details