Search results
1 – 10 of over 5000Zhiwen Pan, Jiangtian Li, Yiqiang Chen, Jesus Pacheco, Lianjun Dai and Jun Zhang
The General Society Survey(GSS) is a kind of government-funded survey which aims at examining the Socio-economic status, quality of life, and structure of contemporary society…
Abstract
Purpose
The General Society Survey(GSS) is a kind of government-funded survey which aims at examining the Socio-economic status, quality of life, and structure of contemporary society. GSS data set is regarded as one of the authoritative source for the government and organization practitioners to make data-driven policies. The previous analytic approaches for GSS data set are designed by combining expert knowledges and simple statistics. By utilizing the emerging data mining algorithms, we proposed a comprehensive data management and data mining approach for GSS data sets.
Design/methodology/approach
The approach are designed to be operated in a two-phase manner: a data management phase which can improve the quality of GSS data by performing attribute pre-processing and filter-based attribute selection; a data mining phase which can extract hidden knowledge from the data set by performing data mining analysis including prediction analysis, classification analysis, association analysis and clustering analysis.
Findings
According to experimental evaluation results, the paper have the following findings: Performing attribute selection on GSS data set can increase the performance of both classification analysis and clustering analysis; all the data mining analysis can effectively extract hidden knowledge from the GSS data set; the knowledge generated by different data mining analysis can somehow cross-validate each other.
Originality/value
By leveraging the power of data mining techniques, the proposed approach can explore knowledge in a fine-grained manner with minimum human interference. Experiments on Chinese General Social Survey data set are conducted at the end to evaluate the performance of our approach.
Details
Keywords
Ruchi Kejriwal, Monika Garg and Gaurav Sarin
Stock market has always been lucrative for various investors. But, because of its speculative nature, it is difficult to predict the price movement. Investors have been using both…
Abstract
Purpose
Stock market has always been lucrative for various investors. But, because of its speculative nature, it is difficult to predict the price movement. Investors have been using both fundamental and technical analysis to predict the prices. Fundamental analysis helps to study structured data of the company. Technical analysis helps to study price trends, and with the increasing and easy availability of unstructured data have made it important to study the market sentiment. Market sentiment has a major impact on the prices in short run. Hence, the purpose is to understand the market sentiment timely and effectively.
Design/methodology/approach
The research includes text mining and then creating various models for classification. The accuracy of these models is checked using confusion matrix.
Findings
Out of the six machine learning techniques used to create the classification model, kernel support vector machine gave the highest accuracy of 68%. This model can be now used to analyse the tweets, news and various other unstructured data to predict the price movement.
Originality/value
This study will help investors classify a news or a tweet into “positive”, “negative” or “neutral” quickly and determine the stock price trends.
Details
Keywords
Kalervo Järvelin and Pertti Vakkari
This paper analyses the research in Library and Information Science (LIS) and reports on (1) the status of LIS research in 2015 and (2) on the evolution of LIS research…
Abstract
Purpose
This paper analyses the research in Library and Information Science (LIS) and reports on (1) the status of LIS research in 2015 and (2) on the evolution of LIS research longitudinally from 1965 to 2015.
Design/methodology/approach
The study employs a quantitative intellectual content analysis of articles published in 30+ scholarly LIS journals, following the design by Tuomaala et al. (2014). In the content analysis, we classify articles along eight dimensions covering topical content and methodology.
Findings
The topical findings indicate that the earlier strong LIS emphasis on L&I services has declined notably, while scientific and professional communication has become the most popular topic. Information storage and retrieval has given up its earlier strong position towards the end of the years analyzed. Individuals are increasingly the units of observation. End-user's and developer's viewpoints have strengthened at the cost of intermediaries' viewpoint. LIS research is methodologically increasingly scattered since survey, scientometric methods, experiment, case studies and qualitative studies have all gained in popularity. Consequently, LIS may have become more versatile in the analysis of its research objects during the years analyzed.
Originality/value
Among quantitative intellectual content analyses of LIS research, the study is unique in its scope: length of analysis period (50 years), width (8 dimensions covering topical content and methodology) and depth (the annual batch of 30+ scholarly journals).
Details
Keywords
Zhiwen Pan, Wen Ji, Yiqiang Chen, Lianjun Dai and Jun Zhang
The disability datasets are the datasets that contain the information of disabled populations. By analyzing these datasets, professionals who work with disabled populations can…
Abstract
Purpose
The disability datasets are the datasets that contain the information of disabled populations. By analyzing these datasets, professionals who work with disabled populations can have a better understanding of the inherent characteristics of the disabled populations, so that working plans and policies, which can effectively help the disabled populations, can be made accordingly.
Design/methodology/approach
In this paper, the authors proposed a big data management and analytic approach for disability datasets.
Findings
By using a set of data mining algorithms, the proposed approach can provide the following services. The data management scheme in the approach can improve the quality of disability data by estimating miss attribute values and detecting anomaly and low-quality data instances. The data mining scheme in the approach can explore useful patterns which reflect the correlation, association and interactional between the disability data attributes. Experiments based on real-world dataset are conducted at the end to prove the effectiveness of the approach.
Originality/value
The proposed approach can enable data-driven decision-making for professionals who work with disabled populations.
Details
Keywords
Paramita Ray and Amlan Chakrabarti
Social networks have changed the communication patterns significantly. Information available from different social networking sites can be well utilized for the analysis of users…
Abstract
Social networks have changed the communication patterns significantly. Information available from different social networking sites can be well utilized for the analysis of users opinion. Hence, the organizations would benefit through the development of a platform, which can analyze public sentiments in the social media about their products and services to provide a value addition in their business process. Over the last few years, deep learning is very popular in the areas of image classification, speech recognition, etc. However, research on the use of deep learning method in sentiment analysis is limited. It has been observed that in some cases the existing machine learning methods for sentiment analysis fail to extract some implicit aspects and might not be very useful. Therefore, we propose a deep learning approach for aspect extraction from text and analysis of users sentiment corresponding to the aspect. A seven layer deep convolutional neural network (CNN) is used to tag each aspect in the opinionated sentences. We have combined deep learning approach with a set of rule-based approach to improve the performance of aspect extraction method as well as sentiment scoring method. We have also tried to improve the existing rule-based approach of aspect extraction by aspect categorization with a predefined set of aspect categories using clustering method and compared our proposed method with some of the state-of-the-art methods. It has been observed that the overall accuracy of our proposed method is 0.87 while that of the other state-of-the-art methods like modified rule-based method and CNN are 0.75 and 0.80 respectively. The overall accuracy of our proposed method shows an increment of 7–12% from that of the state-of-the-art methods.
Details
Keywords
Abstract
Details
Keywords
Tiina Kalliomäki-Levanto and Antti Ukkonen
Interruptions are prevalent in knowledge work, and their negative consequences have driven research to find ways for interruption management. However, these means almost always…
Abstract
Purpose
Interruptions are prevalent in knowledge work, and their negative consequences have driven research to find ways for interruption management. However, these means almost always leave the responsibility and burden of interruptions with individual knowledge workers. System-level approaches for interruption management, on the other hand, have the potential to reduce the burden on employees. This paper’s objective is to pave way for system-level interruption management by showing that data about factual characteristics of work can be used to identify interrupting situations.
Design/methodology/approach
The authors provide a demonstration of using trace data from information and communications technology (ICT)-systems and machine learning to identify interrupting situations. They conduct a “simulation” of automated data collection by asking employees of two companies to provide information concerning situations and interruptions through weekly reports. They obtain information regarding four organizational elements: task, people, technology and structure, and employ classification trees to show that this data can be used to identify situations across which the level of interruptions differs.
Findings
The authors show that it is possible to identifying interrupting situations from trace data. During the eight-week observation period in Company A they identified seven and in Company B four different situations each having a different probability of occurrence of interruptions.
Originality/value
The authors extend employee-level interruption management to the system-level by using “task” as a bridging concept. Task is a core concept in both traditional interruption research and Leavitt's 1965 socio-technical model which allows us to connect other organizational elements (people, structure and technology) to interruptions.
Details
Keywords
Abstract
Purpose
The study aimsto analyze the main elements associated with the evolution of Brazilian agtechs from the initial conception of the business model to becoming companies in the scale-up stage.
Design/methodology/approach
The exploratory research was conducted based on data collected through in-depth interviews. The answers were analyzed quantitatively using descending hierarchical classification (DHC) and correspondence factor analysis (CFA) and qualitatively using content analysis.
Findings
Five main elements were identified as responsible for the evolution of the companies up to their entering the scale-up phase: (1) governance, (2) decisions inherent to resource allocation, (3) monitoring of strategic, tactical and operational activities, (4) fostering human capital development and (5) business model validation. Each element presents a set of performance indicators that show the scalability of these companies.
Practical implications
The model developed can help companies that have not yet advanced from the conception of the business model to the scalability of different sectors, in addition to agribusiness.
Social implications
Proposal of a model that presents the main elements that impact on scalability and respective indicators that contributed to the scalability process of Brazilian agtechs.
Originality/value
This study contributed to advancing the knowledge on the organizational life cycle (OLC) of agricultural startups, particularly regarding the factors responsible for their scalability.
Details
Keywords
Roy Langer and Suzanne C. Beckman
This paper discusses how netnography can be applied in order to conduct covert research on sensitive research topics. An analysis of a Danish internet message board on cosmetic…
Abstract
Purpose
This paper discusses how netnography can be applied in order to conduct covert research on sensitive research topics. An analysis of a Danish internet message board on cosmetic surgery illustrates suggestions concerning modifications of netnography guidelines.
Design/methodology/approach
Owing to the relevance of studying sensitive research topics – in particular when access to informants is difficult – netnography has been applied in an analysis of cross consumer online‐communication about cosmetic surgery on a Danish internet message board. Methodological stages and procedures including entreé, data collection, analysis and interpretation have been followed. In terms of research ethics and member checks, however, the suggested guidelines have been modified.
Findings
Empirical findings verify that consumers use internet message boards in order to exchange information and advice about cosmetic surgery. Especially the opportunity to masquerade and to cover their identities allows them to express attitudes, opinions, and experiences freely – and hence to study these in order gain deeper insights into consumption motives, concerns, and experiences.
Originality/value
The paper suggests that netnography is a suitable methodology for the study of sensitive research topics, enabling the researcher in an unobtrusive and covert way to gain deeper insights into consumers' opinions, motives, and concerns. Based on a discussion of netnography's position in between discourse analysis, content analysis and ethnography, it is argued for the legitimacy of covert research, including a revision of existing guidelines for research ethics with regard to informed consent when conducting netnography.
Details
Keywords
The purpose this paper is to review some of the statistical methods used in the field of social sciences.
Abstract
Purpose
The purpose this paper is to review some of the statistical methods used in the field of social sciences.
Design/methodology/approach
A review of some of the statistical methodologies used in areas like survey methodology, official statistics, sociology, psychology, political science, criminology, public policy, marketing research, demography, education and economics.
Findings
Several areas are presented such as parametric modeling, nonparametric modeling and multivariate methods. Focus is also given to time series modeling, analysis of categorical data and sampling issues and other useful techniques for the analysis of data in the social sciences. Indicative references are given for all the above methods along with some insights for the application of these techniques.
Originality/value
This paper reviews some statistical methods that are used in social sciences and the authors draw the attention of researchers on less popular methods. The purpose is not to give technical details and also not to refer to all the existing techniques or to all the possible areas of statistics. The focus is mainly on the applied aspect of the techniques and the authors give insights about techniques that can be used to answer problems in the abovementioned areas of research.
Details