Search results

1 – 10 of 593

View access options

Article

Publication date: 3 August 2021

A qualitative and quantitative comparison between Web scraping and API methods for Twitter credibility analysis

Irvin Dongo, Yudith Cardinale, Ana Aguilera, Fabiola Martinez, Yuni Quintero, German Robayo and David Cabeza

This paper aims to perform an exhaustive revision of relevant and recent related studies, which reveals that both extraction methods are currently used to analyze credibility on…

HTML

PDF (268 KB)

Downloads

546

Abstract

Purpose

This paper aims to perform an exhaustive revision of relevant and recent related studies, which reveals that both extraction methods are currently used to analyze credibility on Twitter. Thus, there is clear evidence of the need of having different options to extract different data for this purpose. Nevertheless, none of these studies perform a comparative evaluation of both extraction techniques. Moreover, the authors extend a previous comparison, which uses a recent developed framework that offers both alternates of data extraction and implements a previously proposed credibility model, by adding a qualitative evaluation and a Twitter-Application Programming Interface (API) performance analysis from different locations.

Design/methodology/approach

As one of the most popular social platforms, Twitter has been the focus of recent research aimed at analyzing the credibility of the shared information. To do so, several proposals use either Twitter API or Web scraping to extract the data to perform the analysis. Qualitative and quantitative evaluations are performed to discover the advantages and disadvantages of both extraction methods.

Findings

The study demonstrates the differences in terms of accuracy and efficiency of both extraction methods and gives relevance to much more problems related to this area to pursue true transparency and legitimacy of information on the Web.

Originality/value

Results report that some Twitter attributes cannot be retrieved by Web scraping. Both methods produce identical credibility values when a robust normalization process is applied to the text (i.e. tweet). Moreover, concerning the time performance, Web scraping is faster than Twitter API and it is more flexible in terms of obtaining data; however, Web scraping is very sensitive to website changes. Additionally, the response time of the Twitter API is proportional to the distance from the central server at San Francisco.

Details

International Journal of Web Information Systems, vol. 17 no. 6

Type: Research Article

DOI:

ISSN: 1744-0084

Keywords

View access options

Article

Publication date: 12 November 2019

Web scraping for food price research

Judith Hillen

The purpose of this paper is to discuss web scraping as a method for extracting large amounts of data from online sources. The author wants to raise awareness of the method’s…

HTML

PDF (219 KB)

Downloads

1081

Abstract

Purpose

The purpose of this paper is to discuss web scraping as a method for extracting large amounts of data from online sources. The author wants to raise awareness of the method’s potential in the field of food price research, hoping to enable fellow researchers to apply this method.

Design/methodology/approach

The author explains the technical procedure of web scraping, reviews the existing literature, and identifies areas of application and limitations for food price research.

Findings

The author finds that web scraping is a promising method to collect customised, high-frequency data in real time, overcoming several limitations of currently used food price data sources. With today’s applications mostly focussing on (online) consumer prices, the scope of applications for web scraping broadens as more and more price data are published online.

Research limitations/implications

To better deal with the technical and legal challenges of web scraping and to exploit its scalability, joint data collection projects in the field of agricultural and food economics should be considered.

Originality/value

In agricultural and food economics, web scraping as a data collection technique has received little attention. This is one of the first articles to address this topic with particular focus on food price analysis.

Details

British Food Journal, vol. 121 no. 12

Type: Research Article

DOI:

ISSN: 0007-070X

Keywords

View access options

Article

Publication date: 14 August 2017

Leveraging cross-media analytics to detect events and mine opinions for emergency management

Wei Xu, Lingyu Liu and Wei Shang

Timely detection of emergency events and effective tracking of corresponding public opinions are critical in emergency management. As media are immediate sources of information on…

HTML

PDF (529 KB)

Downloads

780

Abstract

Purpose

Timely detection of emergency events and effective tracking of corresponding public opinions are critical in emergency management. As media are immediate sources of information on emergencies, the purpose of this paper is to propose cross-media analytics to detect and track emergency events and provide decision support for government and emergency management departments.

Design/methodology/approach

In this paper, a novel emergency event detection and opinion mining method is proposed for emergency management using cross-media analytics. In the proposed approach, an event detection module is constructed to discover emergency events based on cross-media analytics, and after the detected event is confirmed as an emergency event, an opinion mining module is used to analyze public sentiments and then generate public sentiment time series for early warning via a semantic expansion technique.

Findings

Empirical results indicate that a specific emergency can be detected and that public opinion can be tracked effectively and efficiently using cross-media analytics. In addition, the proposed system can be used for decision support and real-time response for government and emergency management departments.

Research limitations/implications

This paper takes full advantage of cross-media information and proposes novel emergency event detection and opinion mining methods for emergency management using cross-media analytics. The empirical analysis results illustrate the efficiency of the proposed method.

Practical implications

The proposed method can be applied for detection of emergency events and tracking of public opinions for emergency decision support and governmental real-time response.

Originality/value

This research work contributes to the design of a decision support system for emergency event detection and opinion mining. In the proposed approaches, emergency events are detected by leveraging cross-media analytics, and public sentiments are measured using an auto-expansion of the domain dictionary in the field of emergency management to eliminate the misclassification of the general dictionary and to make the quantization more accurate.

Details

Online Information Review, vol. 41 no. 4

Type: Research Article

DOI:

ISSN: 1468-4527

Keywords

View access options

Article

Publication date: 18 November 2022

A decade of medical tourism research: looking back to moving forward

Ediansyah, Mts Arief, Mohammad Hamsal and Sri Bramantoro Abdinagoro

This article aims to know the direction of current research based on the previous research in the last ten years (2012–2021).

HTML

PDF (1.9 MB)

Downloads

539

Abstract

Purpose

This article aims to know the direction of current research based on the previous research in the last ten years (2012–2021).

Design/methodology/approach

Text mining was integrated with a network and content analysis as part of the mix methodological approach. The scientific articles, on the other hand, were assembled on Litmaps through web scraping. This process selected 86 articles about medical tourism published between 2012 and 2021. This study employed AntConc, RStudio and Gephi tools for data analysis and visualization.

Findings

A total of 138 articles were identified through Litmaps using web scraping and 86 studies met the criteria. The trend of medical tourism research is a positive sign for tourism and health industries; this is the beginning to recognize the importance of elaborating on these two topics. Several researchers have frequently studied issues of destination, hospital, development, quality, stakeholders, surgery, service, economics and policy. Policymakers must establish a medical tourism ecosystem to accommodate all stakeholders in this industry. This study also recommends focusing on supply and institution for medical tourism future research.

Research limitations/implications

This literature review presents research trends on medical tourism in 2012–2021 based solely on articles available on the Litmaps search engine. If the time span is extended and the sources of articles are expanded there will be more literature available for analysis. The articles obtained are also only articles published in English due to the language limitations of the author.

Practical implications

Policymakers must establish a medical tourism ecosystem to accommodate all stakeholders in this industry. Stakeholders must work together to provide medical tourism package therefore people can get their health services while visiting available tourist areas.

Originality/value

The literary study of medical tourism over 10 years is considered the most recent systematic literature review.

Details

Journal of Hospitality and Tourism Insights, vol. 6 no. 5

Type: Research Article

DOI:

ISSN: 2514-9792

Keywords

View access options

Article

Publication date: 4 April 2016

Predicting online product sales via online reviews, sentiments, and promotion strategies: A big data architecture and neural network approach

Alain Yee Loong Chong, Boying Li, Eric W.T. Ngai, Eugene Ch'ng and Filbert Lee

The purpose of this paper is to investigate if online reviews (e.g. valence and volume), online promotional strategies (e.g. free delivery and discounts) and sentiments from user…

HTML

PDF (809 KB)

Downloads

10000

Abstract

Purpose

The purpose of this paper is to investigate if online reviews (e.g. valence and volume), online promotional strategies (e.g. free delivery and discounts) and sentiments from user reviews can help predict product sales.

Design/methodology/approach

The authors designed a big data architecture and deployed Node.js agents for scraping the Amazon.com pages using asynchronous input/output calls. The completed web crawling and scraping data sets were then preprocessed for sentimental and neural network analysis. The neural network was employed to examine which variables in the study are important predictors of product sales.

Findings

This study found that although online reviews, online promotional strategies and online sentiments can all predict product sales, some variables are more important predictors than others. The authors found that the interplay effects of these variables become more important variables than the individual variables themselves. For example, online volume interactions with sentiments and discounts are more important than the individual predictors of discounts, sentiments or online volume.

Originality/value

This study designed big data architecture, in combination with sentimental and neural network analysis that can facilitate future business research for predicting product sales in an online environment. This study also employed a predictive analytic approach (e.g. neural network) to examine the variables, and this approach is useful for future data analysis in a big data environment where prediction can have more practical implications than significance testing. This study also examined the interplay between online reviews, sentiments and promotional strategies, which up to now have mostly been examined individually in previous studies.

Details

International Journal of Operations & Production Management, vol. 36 no. 4

Type: Research Article

DOI:

ISSN: 0144-3577

Keywords

View access options

Article

Publication date: 28 January 2014

Consuming and producing linked open data: the case of OpenCourseWare

Nelson Piedra, Edmundo Tovar, Ricardo Colomo-Palacios, Jorge Lopez-Vargas and Janneth Alexandra Chicaiza

The aim of this paper is to present an initiative to apply the principles of Linked Data to enhance the search and discovery of OpenCourseWare (OCW) contents created and shared by…

HTML

PDF (423 KB)

Downloads

1252

Abstract

Purpose

The aim of this paper is to present an initiative to apply the principles of Linked Data to enhance the search and discovery of OpenCourseWare (OCW) contents created and shared by the universities.

Design/methodology/approach

This paper is a case study of how linked data technologies can be applied for the enhancement of open learning contents.

Findings

Results presented under the umbrella of OCW-Universia consortium, as the integration and access to content from different repositories OCW and the development of a query method to access these data, reveal that linked data would offer a solution to filter and select semantically those open educational contents, and automatically are linked to the linked open data cloud.

Originality/value

The new OCW-Universia integration with linked data adds new features to the initial framework including improved query mechanisms and interoperability.

Details

Program, vol. 48 no. 1

Type: Research Article

DOI:

ISSN: 0033-0337

Keywords

View access options

Article

Publication date: 1 February 2022

Leveraging knowledge discovery and knowledge visualization to define the “inner areas”: an application to an Italian province

Valentino Moretto, Gianluca Elia and Gianpaolo Ghiani

Starting from a critical analysis of the main criteria currently used to identify marginal areas, this paper aims to propose a new classification model of such territories by…

HTML

PDF (1.8 MB)

Downloads

386

Abstract

Purpose

Starting from a critical analysis of the main criteria currently used to identify marginal areas, this paper aims to propose a new classification model of such territories by leveraging knowledge discovery approaches and knowledge visualization techniques, which represent a fundamental pillar in the knowledge-based urban development process.

Design/methodology/approach

The methodology adopted in this study relies on the design science research, which includes five steps: problem identification, objective definition, solution design and development, demonstration and evaluation.

Findings

Results demonstrate how to exploit knowledge discovery and visualization to obtain multiple mappings of inner areas, in the aim to identify good practices and optimize resources to set up more effective territorial development strategies and plans. The proposed approach overcomes the traditional way adopted to map inner areas that uses a single indicator (i.e. the distance between a municipality and the nearest pole where it is possible to access to education, health and transportation services) and leverages seven groups of indicators that represent the distinguishing features of territories (territorial capital, social costs, citizenship, geo-demography, economy, innovation and sustainable development).

Research limitations/implications

The proposed model could be enriched by new variables, whose value can be collected by official sources and stakeholders engaged to provide both structured and unstructured data. Also, another enhancement could be the development of a cross-algorithms comparison that may reveal useful to suggest which algorithm can better suit the needs of policy makers or practitioners.

Practical implications

This study sets the ground for proposing a decision support tool that policy makers can use to classify in a new way the inner areas, thus overcoming the current approach and leveraging the distinguishing features of territories.

Originality/value

This study shows how the availability of distributed knowledge sources, the modern knowledge management techniques and the emerging digital technologies can provide new opportunities for the governance of a city or territory, thus revitalizing the domain of knowledge-based urban development.

Details

Journal of Knowledge Management, vol. 26 no. 10

Type: Research Article

DOI:

ISSN: 1367-3270

Keywords

View access options

Article

Publication date: 2 July 2018

Illicit Bitcoin transactions: challenges in getting to the who, what, when and where

Angela S.M. Irwin and Adam B. Turner

The purpose of this paper is to highlight the intelligence and investigatory challenges experienced by law enforcement agencies in discovering the identity of illicit Bitcoin…

HTML

PDF (648 KB)

Downloads

2426

Abstract

Purpose

The purpose of this paper is to highlight the intelligence and investigatory challenges experienced by law enforcement agencies in discovering the identity of illicit Bitcoin users and the transactions that they perform. This paper proposes solutions to assist law enforcement agencies in piecing together the disparate and complex technical, behavioural and criminological elements that make up cybercriminal offending.

Design/methodology/approach

A literature review was conducted to highlight the main law enforcement challenges and discussions and examine current discourse in the areas of anonymity and attribution. The paper also looked at other research and projects that aim to identify illicit transactions involving cryptocurrencies and the darknet.

Findings

An optimal solution would be one which has a predictive capability and a machine learning architecture which automatically collects and analyses data from the Bitcoin blockchain and other external data sources and applies search criteria matching, indexing and clustering to identify suspicious behaviours. The implementation of a machine learning architecture would help improve results over time and would be less manpower intensive. Cyber investigators would also receive intelligence in a format and language that they understand and it would allow for intelligence-led and predictive policing rather than reactive policing. The optimal solution would be one which allows for intelligence-led, predictive policing and enables and encourages information sharing between multiple stakeholders from the law enforcement, financial intelligence units, cyber security organisations and fintech industry. This would enable the creation of red flags and behaviour models and the provision of up-to-date intelligence on the threat landscape to form a viable intelligence product for law enforcement agencies so that they can more easily get to the who, what, when and where.

Originality/value

The development of a functional software architecture that, in theory, could be used to detected suspicious illicit transactions on the Bitcoin network.

Details

Journal of Money Laundering Control, vol. 21 no. 3

Type: Research Article

DOI:

ISSN: 1368-5201

Keywords

View access options

Article

Publication date: 12 November 2019

Reputational intelligence: innovating brand management through social media data

Ana-María Casado-Molina, Celia M.Q. Ramos, María-Mercedes Rojas-de-Gracia and José Ignacio Peláez Sánchez

Companies are currently facing the challenge of understanding how their business is affected by the large volume of opinions continually generated by their stakeholders in social…

HTML

PDF (316 KB)

Downloads

706

Abstract

Purpose

Companies are currently facing the challenge of understanding how their business is affected by the large volume of opinions continually generated by their stakeholders in social media regarding their intangible assets (experiences, emotions and attitudes). With this in mind, the purpose of this paper is to present an innovative management model, named E2AB, to measure and analyse reputational intangibles from digital ecosystems and their impacts on tangible assets.

Design/methodology/approach

The methodology applied was big data and business intelligence techniques. These methods were used in the computing process to obtain daily data from every asset guarantees that the model is validated with robust data. This model has been corroborated using data from the banking sector, specifically 402,383 net data inputs from the digital ecosystems.

Findings

This study illustrates the existence of a holistic influence of intangible assets over tangible assets. The findings demonstrate complex relationships between tangible and intangible assets, determined not only by the type of variable but also by its valence and intensity.

Practical implications

These findings may help chief communication officers and general managers a better understanding of how intangible assets extracted from online users’ opinions are related to their organisation’s tangible assets plus a chance to find out about their impact and how to manage them for a practical and agile decision making in real time.

Originality/value

It is a pioneering work in establishing a model, which demonstrates transversal and holistic relationships between relational intangible and tangible assets of firms from digital ecosystems, using business intelligence techniques.

Details

Industrial Management & Data Systems, vol. 120 no. 1

Type: Research Article

DOI:

ISSN: 0263-5577

Keywords

Open Access

Article

Publication date: 22 November 2022

Similarities and differences between absorptive capacity and appropriability: a bibliometric perspective

Sergio David Cuéllar, Maria Teresa Fernandez-Bajón and Felix de Moya-Anegón

This study aimed to examine the similarities and differences between the ability to analyze the environment and exploit new knowledge (absorptive capacity) and the skills to…

HTML

PDF (3.2 MB)

Downloads

1151

Abstract

Purpose

This study aimed to examine the similarities and differences between the ability to analyze the environment and exploit new knowledge (absorptive capacity) and the skills to generate value from innovation (appropriation). These fields have similar origins and are sometimes confused by practitioners and academics.

Design/methodology/approach

A review was conducted based on a full-text analysis of 681 and 431 papers on appropriation and absorptive capacity, respectively, from Scopus, Science Direct and Lens, using methodologies such as text mining, backward citation analysis, modularity clustering and latent Dirichlet allocation analysis.

Findings

In business disciplines, the fields are considered different; however, in other disciplines, it was found that some authors defined them quite similarly. The citation analysis results showed that appropriation was more relevant to absorptive capacity, or vice versa. From the dimension perspective, it was found that although appropriation was considered a relevant element for absorptive capacity, the last models did not include it. Finally, it was found that studies on both topics identified the importance of appropriation and absorptive capacity for innovation performance, knowledge management and technology transfer.

Originality/value

This is one of the first studies to examine in-depth the relationship between appropriation and absorptive capacity, bridging a gap in both fields.

Details

Benchmarking: An International Journal, vol. 31 no. 1

Type: Research Article

DOI:

ISSN: 1463-5771

Keywords

Access

Year

Content type

Article (593)

1 – 10 of 593