Search results

1 – 10 of over 2000
Click here to view access options
Article
Publication date: 21 December 2020

Sudha Cheerkoot-Jalim and Kavi Kumar Khedo

This work shows the results of a systematic literature review on biomedical text mining. The purpose of this study is to identify the different text mining approaches used…

Abstract

Purpose

This work shows the results of a systematic literature review on biomedical text mining. The purpose of this study is to identify the different text mining approaches used in different application areas of the biomedical domain, the common tools used and the challenges of biomedical text mining as compared to generic text mining algorithms. This study will be of value to biomedical researchers by allowing them to correlate text mining approaches to specific biomedical application areas. Implications for future research are also discussed.

Design/methodology/approach

The review was conducted following the principles of the Kitchenham method. A number of research questions were first formulated, followed by the definition of the search strategy. The papers were then selected based on a list of assessment criteria. Each of the papers were analyzed and information relevant to the research questions were extracted.

Findings

It was found that researchers have mostly harnessed data sources such as electronic health records, biomedical literature, social media and health-related forums. The most common text mining technique was natural language processing using tools such as MetaMap and Unstructured Information Management Architecture, alongside the use of medical terminologies such as Unified Medical Language System. The main application area was the detection of adverse drug events. Challenges identified included the need to deal with huge amounts of text, the heterogeneity of the different data sources, the duality of meaning of words in biomedical text and the amount of noise introduced mainly from social media and health-related forums.

Originality/value

To the best of the authors’ knowledge, other reviews in this area have focused on either specific techniques, specific application areas or specific data sources. The results of this review will help researchers to correlate most relevant and recent advances in text mining approaches to specific biomedical application areas by providing an up-to-date and holistic view of work done in this research area. The use of emerging text mining techniques has great potential to spur the development of innovative applications, thus considerably impacting on the advancement of biomedical research.

Details

Journal of Knowledge Management, vol. 25 no. 3
Type: Research Article
ISSN: 1367-3270

Keywords

Click here to view access options
Article
Publication date: 31 May 2018

Antonio Usai, Marco Pironti, Monika Mital and Chiraz Aouina Mejri

The aim of this work is to increase awareness of the potential of the technique of text mining to discover knowledge and further promote research collaboration between…

Downloads
3271

Abstract

Purpose

The aim of this work is to increase awareness of the potential of the technique of text mining to discover knowledge and further promote research collaboration between knowledge management and the information technology communities. Since its emergence, text mining has involved multidisciplinary studies, focused primarily on database technology, Web-based collaborative writing, text analysis, machine learning and knowledge discovery. However, owing to the large amount of research in this field, it is becoming increasingly difficult to identify existing studies and therefore suggest new topics.

Design/methodology/approach

This article offers a systematic review of 85 academic outputs (articles and books) focused on knowledge discovery derived from the text mining technique. The systematic review is conducted by applying “text mining at the term level, in which knowledge discovery takes place on a more focused collection of words and phrases that are extracted from and label each document” (Feldman et al., 1998, p. 1).

Findings

The results revealed that the keywords extracted to be associated with the main labels, id est, knowledge discovery and text mining, can be categorized in two periods: from 1998 to 2009, the term knowledge and text were always used. From 2010 to 2017 in addition to these terms, sentiment analysis, review manipulation, microblogging data and knowledgeable users were the other terms frequently used. Besides this, it is possible to notice the technical, engineering nature of each term present in the first decade. Whereas, a diverse range of fields such as business, marketing and finance emerged from 2010 to 2017 owing to a greater interest in the online environment.

Originality/value

This is a first comprehensive systematic review on knowledge discovery and text mining through the use of a text mining technique at term level, which offers to reduce redundant research and to avoid the possibility of missing relevant publications.

Details

Journal of Knowledge Management, vol. 22 no. 7
Type: Research Article
ISSN: 1367-3270

Keywords

Click here to view access options
Article
Publication date: 9 October 2019

Francisco Villarroel Ordenes and Shunyuan Zhang

The purpose of this paper is to describe and position the state-of-the-art of text and image mining methods in business research. By providing a detailed conceptual and…

Downloads
2532

Abstract

Purpose

The purpose of this paper is to describe and position the state-of-the-art of text and image mining methods in business research. By providing a detailed conceptual and technical review of both methods, it aims to increase their utilization in service research.

Design/methodology/approach

On a first stage, the authors review business literature in marketing, operations and management concerning the use of text and image mining methods. On a second stage, the authors identify and analyze empirical papers that used text and image mining methods in services journals and premier business. Finally, avenues for further research in services are provided.

Findings

The manuscript identifies seven text mining methods and describes their approaches, processes, techniques and algorithms, involved in their implementation. Four of these methods are positioned similarly for image mining. There are 39 papers using text mining in service research, with a focus on measuring consumer sentiment, experiences, and service quality. Due to the nonexistent use of image mining service journals, the authors review their application in marketing and management, and suggest ideas for further research in services.

Research limitations/implications

This manuscript focuses on the different methods and their implementation in service research, but it does not offer a complete review of business literature using text and image mining methods.

Practical implications

The results have a number of implications for the discipline that are presented and discussed. The authors provide research directions using text and image mining methods in service priority areas such as artificial intelligence, frontline employees, transformative consumer research and customer experience.

Originality/value

The manuscript provides an introduction to text and image mining methods to service researchers and practitioners interested in the analysis of unstructured data. This paper provides several suggestions concerning the use of new sources of data (e.g. customer reviews, social media images, employee reviews and emails), measurement of new constructs (beyond sentiment and valence) and the use of more recent methods (e.g. deep learning).

Details

Journal of Service Management, vol. 30 no. 5
Type: Research Article
ISSN: 1757-5818

Keywords

Click here to view access options
Article
Publication date: 13 July 2020

Issam Tlemsani, Farhi Marir and Munir Majdalawieh

This paper revolves around the usage of data analytics in the Qur’an and Hadith through a new text mining technique to answer the main research question of whether the…

Abstract

Purpose

This paper revolves around the usage of data analytics in the Qur’an and Hadith through a new text mining technique to answer the main research question of whether the activities and the data flows of the Murabaha financing contract is compatible with Sharia law. The purpose of this paper is to provide a thorough and comprehensive database that will be used to examine existing practices in Islamic banks’ and improve compliancy with Islamic financial law (Sharia).

Design/methodology/approach

To design a Sharia-compliant Murabaha business process originated on text mining, the authors start by identifying the factors deemed necessary in their text mining techniques of both texts; using a four-step strategy to analyze those text mining analytics; then, they list the three basic approaches in text mining used for new knowledge discovery in databases: the co-occurrence approach based on the recursive co-occurrence algorithm; the machine learning or statistical-based; and the knowledge-based. They identify any variation and association between the Murabaha business processes produced using text mining against the one developed through data collection.

Findings

The main finding attained in this paper is to confirm the compatibility of all activities and the data flows in the Murabaha financing contract produced using data analytics of the Quran and Hadith texts against the Murabaha business process that was developed based on data collection. Another key finding is revealing some shortcomings regarding Islamic banks business process compliance with Sharia law.

Practical implications

Given Murabaha as the most popular mode of Islamic financing with more than 75% in total transactions, this research has managed to touch-base on an area that is interesting to the vast majority of those dealing with Islamic finance instruments. By reaching findings that could improve the existing Islamic Murabaha business process and concluding on Sharia compliance of the existing Murabaha business process, this research is quite relevant and could be used in practice as well as in influencing public policy. In fact, Islamic Sharia law experts, Islamic finance professionals and Islamic banks may find the results of this study very useful in improving at least one aspect of the Islamic finance transactions.

Originality/value

By using a novel, fresh text mining methods built on recursive occurrence of synonym words from the Qur’an and Hadith to enrich Islamic finance, this research study can claim to have been the first of its kind in using machine learning to mine the Quran, Hadith and in extracting valuable knowledge to support and consolidate the Islamic financial business processes and make them more compliant with the i.

Details

Journal of Islamic Accounting and Business Research, vol. 11 no. 9
Type: Research Article
ISSN: 1759-0817

Keywords

Click here to view access options
Article
Publication date: 28 January 2020

Mohamed Zaki and Janet R. McColl-Kennedy

The purpose of this paper is to offer a step-by-step text mining analysis roadmap (TMAR) for service researchers. The paper provides guidance on how to choose between…

Abstract

Purpose

The purpose of this paper is to offer a step-by-step text mining analysis roadmap (TMAR) for service researchers. The paper provides guidance on how to choose between alternative tools, using illustrative examples from a range of business contexts.

Design/methodology/approach

The authors provide a six-stage TMAR on how to use text mining methods in practice. At each stage, the authors provide a guiding question, articulate the aim, identify a range of methods and demonstrate how machine learning and linguistic techniques can be used in practice with illustrative examples drawn from business, from an array of data types, services and contexts.

Findings

At each of the six stages, this paper demonstrates useful insights that result from the text mining techniques to provide an in-depth understanding of the phenomenon and actionable insights for research and practice.

Originality/value

There is little research to guide scholars and practitioners on how to gain insights from the extensive “big data” that arises from the different data sources. In a first, this paper addresses this important gap highlighting the advantages of using text mining to gain useful insights for theory testing and practice in different service contexts.

Details

Journal of Services Marketing, vol. 34 no. 1
Type: Research Article
ISSN: 0887-6045

Keywords

Click here to view access options
Article
Publication date: 2 November 2021

Ririn Diar Astanti, Ivana Carissa Sutanto and The Jin Ai

This paper aims to propose a framework on complaint management system for quality management by applying the text mining method and potential failure identification that…

Abstract

Purpose

This paper aims to propose a framework on complaint management system for quality management by applying the text mining method and potential failure identification that can support organization learning (OL). Customer complaints in the form of email text is the input of the framework, while the most frequent complaints are visualized using a Pareto diagram. The company can learn from this Pareto diagram and take action to improve their process.

Design/methodology/approach

The first main part of the framework is creating a defect database from potential failure identification, which is the initial part of the failure mode and effect analysis technique. The second main part is the text mining of customer email complaints. The last part of the framework is matching the result of text mining with the defect database and presenting in the form of a Pareto diagram. After the framework is proposed, a case study is conducted to illustrate the applicability of the proposed method.

Findings

By using the defect database, the framework can interpret the customer email complaints into the list of most defect complained by customer using a Pareto diagram. The results of the Pareto diagram, based on the results of text mining of consumer complaints via email, can be used by a company to learn from complaint and to analyze the potential failure mode. This analysis helps company to take anticipatory action for avoiding potential failure mode happening in the future.

Originality/value

The framework on complaint management system for quality management by applying the text mining method and potential failure identification is proposed for the first time in this paper.

Details

The TQM Journal, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1754-2731

Keywords

Click here to view access options
Article
Publication date: 1 April 2021

Farshid Danesh, Meisam Dastani and Mohammad Ghorbani

The present article's primary purpose is the topic modeling of the global coronavirus publications in the last 50 years.

Downloads
2410

Abstract

Purpose

The present article's primary purpose is the topic modeling of the global coronavirus publications in the last 50 years.

Design/methodology/approach

The present study is applied research that has been conducted using text mining. The statistical population is the coronavirus publications that have been collected from the Web of Science Core Collection (1970–2020). The main keywords were extracted from the Medical Subject Heading browser to design the search strategy. Latent Dirichlet allocation and Python programming language were applied to analyze the data and implement the text mining algorithms of topic modeling.

Findings

The findings indicated that the SARS, science, protein, MERS, veterinary, cell, human, RNA, medicine and virology are the most important keywords in the global coronavirus publications. Also, eight important topics were identified in the global coronavirus publications by implementing the topic modeling algorithm. The highest number of publications were respectively on the following topics: “structure and proteomics,” “Cell signaling and immune response,” “clinical presentation and detection,” “Gene sequence and genomics,” “Diagnosis tests,” “vaccine and immune response and outbreak,” “Epidemiology and Transmission” and “gastrointestinal tissue.”

Originality/value

The originality of this article can be considered in three ways. First, text mining and Latent Dirichlet allocation were applied to analyzing coronavirus literature for the first time. Second, coronavirus is mentioned as a hot topic of research. Finally, in addition to the retrospective approaches to 50 years of data collection and analysis, the results can be exploited with prospective approaches to strategic planning and macro-policymaking.

Click here to view access options
Article
Publication date: 2 September 2019

Shenghua Zhou, S. Thomas Ng, Sang Hoon Lee, Frank J. Xu and Yifan Yang

In the architecture, engineering and construction (AEC) industry, technology developers have difficulties in fully understanding user needs due to the high domain…

Abstract

Purpose

In the architecture, engineering and construction (AEC) industry, technology developers have difficulties in fully understanding user needs due to the high domain knowledge threshold and the lack of effective and efficient methods to minimise information asymmetry between technology developers and AEC users. The paper aims to discuss this issue.

Design/methodology/approach

A synthetic approach combining domain knowledge and text mining techniques is proposed to help capture user needs, which is demonstrated using building information modelling (BIM) apps as a case. The synthetic approach includes the: collection and cleansing of BIM apps’ attribute data and users’ comments; incorporation of domain knowledge into the collected comments; performance of a sentiment analysis to distinguish positive and negative comments; exploration of the relationships between user sentiments and BIM apps’ attributes to unveil user preferences; and establishment of a topic model to identify problems frequently raised by users.

Findings

The results show that those BIM app categories with high user interest but low sentiments or supplies, such as “reality capture”, “interoperability” and “structural simulation and analysis”, should deserve greater efforts and attention from developers. BIM apps with continual updates and of small size are more preferred by users. Problems related to the “support for new Revit”, “import & export” and “external linkage” are most frequently complained by users.

Originality/value

The main contributions of this work include: the innovative application of text mining techniques to identify user needs to drive BIM apps development; and the development of a synthetic approach to orchestrating domain knowledge, text mining techniques (i.e. sentiment analysis and topic modelling) and statistical methods in order to help extract user needs for promoting the success of emerging technologies in the AEC industry.

Details

Engineering, Construction and Architectural Management, vol. 27 no. 2
Type: Research Article
ISSN: 0969-9988

Keywords

Click here to view access options
Article
Publication date: 6 February 2019

Mu-Chen Chen, Yu-Hsiang Hsiao, Kuo-Chien Chang and Ming-Ke Lin

Leisure and tourism activities have proliferated and become important parts of modern life, and the hotel industry plays a necessary role in the supply for and demand from…

Downloads
1133

Abstract

Purpose

Leisure and tourism activities have proliferated and become important parts of modern life, and the hotel industry plays a necessary role in the supply for and demand from consumers. The purpose of this paper is to develop guidelines for hotel service development by applying a service development approach integrating Kansei engineering and text mining.

Design/methodology/approach

The online reviews represent the voice of customers regarding the products and services. Consumers’ online comments might become a key factor for consumers choosing hotels when planning their tourism itinerary. With the framework of Kansei engineering, this paper adopts text mining to extract the sets of Kansei words and hotel service characteristics from the online contents as well as the relationships among Kansei words, service characteristics and these two sets. The relationships are generated by using link analysis, and then the guidelines for hotel service development are proposed based on the obtained relationships.

Findings

The results of the present research can provide the hotel industry a comprehensive understanding of hotels’ customers opinions, and can offer specific advice on how to differentiate one’s products and services from competitors’ in order to improve customer satisfaction and increase hotels’ performance in the end. Finally, this study finds out the service development guidelines to meet customers’ requirements which can provide suggestions for hotel managers. The implications both for academic and industry are also drawn based on the obtained results.

Originality/value

Now, in the internet era, consumers can comment on their hotel living experience directly through the internet. The large amount of user-generated content (UGC) provided by consumers also provides chances for the hospitality industry to understand consumers’ opinions through online review mining. The UGC with consumers’ opinions to hotel services can be continuously collected and analyzed by hoteliers. Therefore, this paper demonstrates how to apply the hybrid approach integrating Kansei engineering and online review mining to hotel service development.

Details

Data Technologies and Applications, vol. 53 no. 1
Type: Research Article
ISSN: 2514-9288

Keywords

Click here to view access options
Article
Publication date: 16 December 2019

Alberto Arenal, Claudio Feijoo, Ana Moreno, Cristina Armuña and Sergio Ramos

Academic research into entrepreneurship policy is particularly interesting due to the increasing relevance of the topic and since knowledge about the evolution of themes…

Abstract

Purpose

Academic research into entrepreneurship policy is particularly interesting due to the increasing relevance of the topic and since knowledge about the evolution of themes in this field is still rather limited. The purpose of this paper is to analyse the key concepts, topics, trends and shifts that have shaped the entrepreneurship policy research agenda during the period 1990–2016.

Design/methodology/approach

This paper uses text mining techniques, cluster analysis and complementary bibliographic data to examine the evolution of a corpus of 1,048 academic papers focused on entrepreneurship-related policies and published during the period 1990–2016 in ten relevant journals. In particular, the paper follows a standard text mining workflow: first, as text is unstructured, content requires a set of pre-processing tasks and then a stemming process. Then, the paper examines the most repeated concepts within the corpus, considering the whole period 1990–2016 and also in five-year terms. Finally, the paper conducts a k-means clustering to divide the collection of documents into coherent groups with similar content. The analyses in the paper also include geographical particularities considering three regional sub-corpora, distinguishing those articles authored in the European Union (EU), the USA and South and Eastern Asia, respectively.

Findings

Results of the analysis show that inclusion, employment and regulation-related papers have largely dominated the research in the field, evolving from an initial classical approach to the relationship between entrepreneurship and employment to a wider, multidisciplinary perspective, including the relevance of management, geographies and narrower topics such as agglomeration economics or internationalisation instead of the previous generic sectorial approaches. The text mining analysis also reveals how entrepreneurship policy research has gained increasing attention and has become both more open, with a growing cooperation among researchers from different affiliations, and more sophisticated, with concepts and themes that moved the research agenda forward, closer to the priorities of policy implementation.

Research limitations/implications

The paper identifies main trends and research gaps in the field of entrepreneurship policy providing actionable knowledge by presenting the spectrum of both over-explored and understudied research themes in the field. In practical terms the results of the text mining analysis can be interpreted as a compass to navigate the entrepreneurship policy research agenda.

Practical implications

The paper presents the heterogeneity of topics under research in the field, reinforcing the concept of entrepreneurship as a multidisciplinary and dynamic domain. Therefore, the definition and adoption of a certain policy agenda in entrepreneurship should consider multiple aspects (needs, objectives, stakeholders, expected outputs, etc.) to be comprehensive and aligned with its complexity. In addition, the paper shows how text mining techniques could be used to map the research activity in a particular field, contributing to the challenge of linking research and policy.

Originality/value

The exploratory nature of text mining allows us to obtain new knowledge and reveals hidden patterns from large quantities of documents/text data, representing an opportunity to complement other qualitative reviews. In this sense, the main value of this paper is not to advise on the future configuration of entrepreneurship policy as a research topic, but to unwrap the past by unveiling how key themes of the entrepreneurship policy research agenda have emerged, evolved and/or declined over time as a foundation on which to build further developments.

Details

Journal of Entrepreneurship and Public Policy, vol. 9 no. 1
Type: Research Article
ISSN: 2045-2101

Keywords

1 – 10 of over 2000