Search results
1 – 10 of over 15000
This chapter conceptualises a link between Industrial Revolution 4.0 (IR 4.0), big data, data science and sustainable tourism.
Abstract
Purpose
This chapter conceptualises a link between Industrial Revolution 4.0 (IR 4.0), big data, data science and sustainable tourism.
Design/Methodology/Approach
The author adopts a grounded theory and conceptual approach to endeavour in this exploratory research.
Findings
The outcome shows a significant rise of big data in the tourism sector under three major dimensions, i.e. business, governance and research. And, some exemplary evidence of institutions promoting the use of big data and data science for sustainable tourism has been discussed.
Originality/Value
The conceptualised interlinkage of concepts like IR 4.0, big data, data science and sustainable development provides a valuable knowledge resource to policy-makers, researchers, businesses and students.
Details
Keywords
Victoria Delaney and Victor R. Lee
With increased focus on data literacy and data science education in K-12, little is known about what makes a data set preferable for use by classroom teachers. Given that…
Abstract
Purpose
With increased focus on data literacy and data science education in K-12, little is known about what makes a data set preferable for use by classroom teachers. Given that educational designers often privilege authenticity, the purpose of this study is to examine how teachers use features of data sets to determine their suitability for authentic data science learning experiences with their students.
Design/methodology/approach
Interviews with 12 practicing high school mathematics and statistics teachers were conducted and video-recorded. Teachers were given two different data sets about the same context and asked to explain which one would be better suited for an authentic data science experience. Following knowledge analysis methods, the teachers’ responses were coded and iteratively reviewed to find themes that appeared across multiple teachers related to their aesthetic judgments.
Findings
Three aspects of authenticity for data sets for this task were identified. These include thinking of authentic data sets as being “messy,” as requiring more work for the student or analyst to pore through than other data sets and as involving computation.
Originality/value
Analysis of teachers’ aesthetics of data sets is a new direction for work on data literacy and data science education. The findings invite the field to think critically about how to help teachers develop new aesthetics and to provide data sets in curriculum materials that are suited for classroom use.
Details
Keywords
Ibrahim Oluwajoba Adisa, Danielle Herro, Oluwadara Abimbade and Golnaz Arastoopour Irgens
This study is part of a participatory design research project and aims to develop and study pedagogical frameworks and tools for integrating computational thinking (CT) concepts…
Abstract
Purpose
This study is part of a participatory design research project and aims to develop and study pedagogical frameworks and tools for integrating computational thinking (CT) concepts and data science practices into elementary school classrooms.
Design/methodology/approach
This paper describes a pedagogical approach that uses a data science framework the research team developed to assist teachers in providing data science instruction to elementary-aged students. Using phenomenological case study methodology, the authors use classroom observations, student focus groups, video recordings and artifacts to detail ways learners engage in data science practices and understand how they perceive their engagement during activities and learning.
Findings
Findings suggest student engagement in data science is enhanced when data problems are contextualized and connected to students’ lived experiences; data analysis and data-based decision-making is practiced in multiple ways; and students are given choices to communicate patterns, interpret graphs and tell data stories. The authors note challenges students experienced with data practices including conflict between inconsistencies in data patterns and lived experiences and focusing on data visualization appearances versus relationships between variables.
Originality/value
Data science instruction in elementary schools is an understudied, emerging and important area of data science education. Most elementary schools offer limited data science instruction; few elementary schools offer data science curriculum with embedded CT practices integrated across disciplines. This research assists elementary educators in fostering children's data science engagement and agency while developing their ability to reason, visualize and make decisions with data.
Details
Keywords
Lukas Goretzki, Martin Messner and Maria Wurm
Data science promises new opportunities for organizational decision-making. Data scientists arguably play an important role in this regard and one can even observe a certain…
Abstract
Purpose
Data science promises new opportunities for organizational decision-making. Data scientists arguably play an important role in this regard and one can even observe a certain “buzz” around this nascent occupation. This paper enquires into how data scientists construct their occupational identity and the challenges they experience when enacting it.
Design/methodology/approach
Based on semi-structured interviews with data scientists working in different industries, the authors explore how these actors draw on their educational background, work experiences and perception of the contemporary digitalization discourse to craft their occupational identities.
Findings
The authors identify three main components of data scientists’ occupational identity: a scientific mindset, an interest in sophisticated forms of data work and a problem-solving attitude. The authors demonstrate how enacting this identity is sometimes challenged through what data scientists perceive as either too low or too high expectations that managers form towards them. To address those expectations, they engage in outward-facing identity work by carrying out educational work within the organization and (paradoxically) stressing both prestigious and non-prestigious parts of their work to “tame” the ambiguity and hype they perceive in managers’ expectations. In addition, they act upon themselves to better appreciate managers’ perspectives and expectations.
Originality/value
This study contributes to research on data scientists as well as the accounting literature that often refers to data scientists as new competitors for accountants. It cautions scholars and practitioners alike to be careful when discussing the possibilities and limitations of data science concerning advancements in accounting and control.
Details
Keywords
Temidayo Oluwasola Osunsanmi, Timothy O. Olawumi, Andrew Smith, Suha Jaradat, Clinton Aigbavboa, John Aliu, Ayodeji Oke, Oluwaseyi Ajayi and Opeyemi Oyeyipo
The study aims to develop a model that supports the application of data science techniques for real estate professionals in the fourth industrial revolution (4IR) era. The present…
Abstract
Purpose
The study aims to develop a model that supports the application of data science techniques for real estate professionals in the fourth industrial revolution (4IR) era. The present 4IR era gave birth to big data sets and is beyond real estate professionals' analysis techniques. This has led to a situation where most real estate professionals rely on their intuition while neglecting a rigorous analysis for real estate investment appraisals. The heavy reliance on their intuition has been responsible for the under-performance of real estate investment, especially in Africa.
Design/methodology/approach
This study utilised a survey questionnaire to randomly source data from real estate professionals. The questionnaire was analysed using a combination of Statistical package for social science (SPSS) V24 and Analysis of a Moment Structures (AMOS) graphics V27 software. Exploratory factor analysis was employed to break down the variables (drivers) into meaningful dimensions helpful in developing the conceptual framework. The framework was validated using covariance-based structural equation modelling. The model was validated using fit indices like discriminant validity, standardised root mean square (SRMR), comparative fit index (CFI), Normed Fit Index (NFI), etc.
Findings
The model revealed that an inclusive educational system, decentralised real estate market and data management system are the major drivers for applying data science techniques to real estate professionals. Also, real estate professionals' application of the drivers will guarantee an effective data analysis of real estate investments.
Originality/value
Numerous studies have clamoured for adopting data science techniques for real estate professionals. There is a lack of studies on the drivers that will guarantee the successful adoption of data science techniques. A modern form of data analysis for real estate professionals was also proposed in the study.
Details
Keywords
Sukjin You, Soohyung Joo and Marie Katsurai
The purpose of this study is to explore to which extent data mining research would be associated with the library and information science (LIS) discipline. This study aims to…
Abstract
Purpose
The purpose of this study is to explore to which extent data mining research would be associated with the library and information science (LIS) discipline. This study aims to identify data mining related subject terms and topics in representative LIS scholarly publications.
Design/methodology/approach
A large set of bibliographic records over 38,000 was collected from a scholarly database representing the fields of LIS and the data mining, respectively. A multitude of text mining techniques were applied to investigate prevailing subject terms and research topics, such as influential term analysis and Dirichlet multinomial regression topic modeling.
Findings
The findings of this study revealed the relationship between the LIS and data mining research domains. Various data mining method terms were observed in recent LIS publications, such as machine learning, artificial intelligence and neural networks. The topic modeling result identified prevailing data mining related research topics in LIS, such as machine learning, deep learning, big data and among others. In addition, this study investigated the trends of popular topics in LIS over time in the recent decade.
Originality/value
This investigation is one of a few studies that empirically investigated the relationships between the LIS and data mining research domains. Multiple text mining techniques were employed to delineate to which extent the two research domains would be associated with each other based on both at the term-level and topic-level analysis. Methodologically, the study identified influential terms in each domain using multiple feature selection indices. In addition, Dirichlet multinomial regression was applied to explore LIS topics in relation to data mining.
Details
Keywords
Magdalena Saldana-Perez, Giovanni Guzmán, Carolina Palma-Preciado, Amadeo Argüelles-Cruz and Marco Moreno-Ibarra
Climate change is a problem that concerns all of us. Despite the information produced by organizations such as the Expert Team on Climate Change Detection and Indices and the…
Abstract
Purpose
Climate change is a problem that concerns all of us. Despite the information produced by organizations such as the Expert Team on Climate Change Detection and Indices and the United Nations, only a few cities have been planned taking into account the climate changes indices. This paper aims to study climatic variations, how climate conditions might change in the future and how these changes will affect the activities and living conditions in cities, specifically focusing on Mexico city.
Design/methodology/approach
In this approach, two distinct machine learning regression models, k-Nearest Neighbors and Support Vector Regression, were used to predict variations in climate change indices within select urban areas of Mexico city. The calculated indices are based on maximum, minimum and average temperature data collected from the National Water Commission in Mexico and the Scientific Research Center of Ensenada. The methodology involves pre-processing temperature data to create a training data set for regression algorithms. It then computes predictions for each temperature parameter and ultimately assesses the performance of these algorithms based on precision metrics scores.
Findings
This paper combines a geospatial perspective with computational tools and machine learning algorithms. Among the two regression algorithms used, it was observed that k-Nearest Neighbors produced superior results, achieving an R2 score of 0.99, in contrast to Support Vector Regression, which yielded an R2 score of 0.74.
Originality/value
The full potential of machine learning algorithms has not been fully harnessed for predicting climate indices. This paper also identifies the strengths and weaknesses of each algorithm and how the generated estimations can then be considered in the decision-making process.
Details
Keywords
Sarah Amber Evans, Lingzi Hong, Jeonghyun Kim, Erin Rice-Oyler and Irhamni Ali
Data literacy empowers college students, equipping them with essential skills necessary for their personal lives and careers in today’s data-driven world. This study aims to…
Abstract
Purpose
Data literacy empowers college students, equipping them with essential skills necessary for their personal lives and careers in today’s data-driven world. This study aims to explore how community college students evaluate their data literacy and further examine demographic and educational/career advancement disparities in their self-assessed data literacy levels.
Design/methodology/approach
An online survey presenting a data literacy self-assessment scale was distributed and completed by 570 students at four community colleges. Statistical tests were performed between the data literacy factor scores and students’ demographic and educational/career advancement variables.
Findings
Male students rated their data literacy skills higher than females. The 18–19 age group has relatively lower confidence in their data literacy scores than other age groups. High school graduates do not feel proficient in data literacy to the level required for college and the workplace. Full-time employed students demonstrate more confidence in their data literacy than part-time and nonemployed students.
Originality/value
Given the lack of research on community college students’ data literacy, the findings of this study can be valuable in designing and implementing data literacy training programs for different groups of community college students.
Details
Keywords
Archana S.N. and Padmakumar P.K.
The purpose of this study was to understand the landscape of Indian research data repositories (RDRs) indexed in the re3data.org. The study analysed the metadata elements of…
Abstract
Purpose
The purpose of this study was to understand the landscape of Indian research data repositories (RDRs) indexed in the re3data.org. The study analysed the metadata elements of Indian RDRs to identify their disciplinary orientations, typology, standards adopted, foreign collaborations, etc. The study ascertained the current status of the Indian RDRs by visiting their respective websites and tried to identify and map the exact disciplinary orientation of each RDR.
Design/methodology/approach
The study used “content analysis” of the metadata elements extracted from re3data.org along with the information analysis of the respective websites of the registered RDRs.
Findings
The study identified that only 80% of the Indian RDRs listed by the re3data.org is currently active. Most of the Indian RDRs are hosted by the central and state governments and are almost equally distributed among Life Sciences, Natural Sciences and Social Sciences domains. The data provided by the re3data.org for the Indian RDRs are not complete and up-to-date.
Practical implications
The findings indicate the presence of a good number of inactive RDRs in the re3data.org. The study suggests using a revised version of the DFG subject classification scheme or considering a standard classification scheme for subject indexing.
Originality/value
To the best of the authors’ knowledge, this study is the first of its kind that critically analysed the metadata values extracted and moved further to identify the current status of Indian RDRs.
Details
Keywords
Neema Florence Mosha and Patrick Ngulube
The study aims to investigate the utilisation of open research data repositories (RDRs) for storing and sharing research data in higher learning institutions (HLIs) in Tanzania.
Abstract
Purpose
The study aims to investigate the utilisation of open research data repositories (RDRs) for storing and sharing research data in higher learning institutions (HLIs) in Tanzania.
Design/methodology/approach
A survey research design was employed to collect data from postgraduate students at the Nelson Mandela African Institution of Science and Technology (NM-AIST) in Arusha, Tanzania. The data were collected and analysed quantitatively and qualitatively. A census sampling technique was employed to select the sample size for this study. The quantitative data were analysed using the Statistical Package for the Social Sciences (SPSS), whilst the qualitative data were analysed thematically.
Findings
Less than half of the respondents were aware of and were using open RDRs, including Zenodo, DataVerse, Dryad, OMERO, GitHub and Mendeley data repositories. More than half of the respondents were not willing to share research data and cited a lack of ownership after storing their research data in most of the open RDRs and data security. HILs need to conduct training on using trusted repositories and motivate postgraduate students to utilise open repositories (ORs). The challenges for underutilisation of open RDRs were a lack of policies governing the storage and sharing of research data and grant constraints.
Originality/value
Research data storage and sharing are of great interest to researchers in HILs to inform them to implement open RDRs to support these researchers. Open RDRs increase visibility within HILs and reduce research data loss, and research works will be cited and used publicly. This paper identifies the potential for additional studies focussed on this area.
Details