Search results
1 – 10 of 441This paper aims to give an overview of the history and evolution of commercial search engines. It traces the development of search engines from their early days to their current…
Abstract
Purpose
This paper aims to give an overview of the history and evolution of commercial search engines. It traces the development of search engines from their early days to their current form as complex technology-powered systems that offer a wide range of features and services.
Design/methodology/approach
In recent years, advancements in artificial intelligence (AI) technology have led to the development of AI-powered chat services. This study explores official announcements and releases of three major search engines, Google, Bing and Baidu, of AI-powered chat services.
Findings
Three major players in the search engine market, Google, Microsoft and Baidu started to integrate AI chat into their search results. Google has released Bard, later upgraded to Gemini, a LaMDA-powered conversational AI service. Microsoft has launched Bing Chat, renamed later to Copilot, a GPT-powered by OpenAI search engine. The largest search engine in China, Baidu, released a similar service called Ernie. There are also new AI-based search engines, which are briefly described.
Originality/value
This paper discusses the strengths and weaknesses of the traditional – algorithmic powered search engines and modern search with generative AI support, and the possibilities of merging them into one service. This study stresses the types of inquiries provided to search engines, users’ habits of using search engines and the technological advantage of search engine infrastructure.
Details
Keywords
Artur Strzelecki and Andrej Miklosik
The landscape of search engine usage has evolved since the last known data were used to calculate click-through rate (CTR) values. The objective was to provide a replicable method…
Abstract
Purpose
The landscape of search engine usage has evolved since the last known data were used to calculate click-through rate (CTR) values. The objective was to provide a replicable method for accessing data from the Google search engine using programmatic access and calculating CTR values from the retrieved data to show how the CTRs have changed since the last studies were published.
Design/methodology/approach
In this study, the authors present the estimated CTR values in organic search results based on actual clicks and impressions data, and establish a protocol for collecting this data using Google programmatic access. For this study, the authors collected data on 416,386 clicks, 31,648,226 impressions and 8,861,416 daily queries.
Findings
The results show that CTRs have decreased from previously reported values in both academic research and industry benchmarks. The estimates indicate that the top-ranked result in Google's organic search results features a CTR of 9.28%, followed by 5.82 and 3.11% for positions two and three, respectively. The authors also demonstrate that CTRs vary across various types of devices. On desktop devices, the CTR decreases steadily with each lower ranking position. On smartphones, the CTR starts high but decreases rapidly, with an unprecedented increase from position 13 onwards. Tablets have the lowest and most variable CTR values.
Practical implications
The theoretical implications include the generation of a current dataset on search engine results and user behavior, made available to the research community, creation of a unique methodology for generating new datasets and presenting the updated information on CTR trends. The managerial implications include the establishment of the need for businesses to focus on optimizing other forms of Google search results in addition to organic text results, and the possibility of application of this study's methodology to determine CTRs for their own websites.
Originality/value
This study provides a novel method to access real CTR data and estimates current CTRs for top organic Google search results, categorized by device.
Details
Keywords
Andreas Skalkos, Aggeliki Tsohou, Maria Karyda and Spyros Kokolakis
Search engines, the most popular online services, are associated with several concerns. Users are concerned about the unauthorized processing of their personal data, as well as…
Abstract
Purpose
Search engines, the most popular online services, are associated with several concerns. Users are concerned about the unauthorized processing of their personal data, as well as about search engines keeping track of their search preferences. Various search engines have been introduced to address these concerns, claiming that they protect users’ privacy. The authors call these search engines privacy-preserving search engines (PPSEs). This paper aims to investigate the factors that motivate search engine users to use PPSEs.
Design/methodology/approach
This study adopted protection motivation theory (PMT) and associated its constructs with subjective norms to build a comprehensive research model. The authors tested the research model using survey data from 830 search engine users worldwide.
Findings
The results confirm the interpretive power of PMT in privacy-related decision-making and show that users are more inclined to take protective measures when they consider that data abuse is a more severe risk and that they are more vulnerable to data abuse. Furthermore, the results highlight the importance of subjective norms in predicting and determining PPSE use. Because subjective norms refer to perceived social influences from important others to engage or refrain from protective behavior, the authors reveal that the recommendation from people that users consider important motivates them to take protective measures and use PPSE.
Research limitations/implications
Despite its interesting results, this research also has some limitations. First, because the survey was conducted online, the study environment was less controlled. Participants may have been disrupted or affected, for example, by the presence of others or background noise during the session. Second, some of the survey items could possibly be misinterpreted by the respondents in the study questionnaire, as they did not have access to clarifications that a researcher could possibly provide. Third, another limitation refers to the use of the Amazon Turk tool. According Paolacci and Chandler (2014) in comparison to the US population, the MTurk workers are more educated, younger and less religiously and politically diverse. Fourth, another limitation of this study could be that Actual Use of PPSE is self-reported by the participants. This could cause bias because it is argued that internet users’ statements may be in contrast with their actions in real life or in an experimental scenario (Berendt et al., 2005, Jensen et al., 2005); Moreover, some limitations of this study emerge from the use of PMT as the background theory of the study. PMT identifies the main factors that affect protection motivation, but other environmental and cognitive factors can also have a significant role in determining the way an individual’s attitude is formed. As Rogers (1975) argued, PMT as proposed does not attempt to specify all of the possible factors in a fear appeal that may affect persuasion, but rather a systematic exposition of a limited set of components and cognitive mediational processes that may account for a significant portion of the variance in acceptance by users. In addition, as Tanner et al. (1991) argue, the ‘PMT’s assumption that the subjects have not already developed a coping mechanism is one of its limitations. Finally, another limitation is that the sample does not include users from China, which is the second most populated country. Unfortunately, DuckDuckGo has been blocked in China, so it has not been feasible to include users from China in this study.
Practical implications
The proposed model and, specifically, the subjective norms construct proved to be successful in predicting PPSE use. This study demonstrates the need for PPSE to exhibit and advertise the technology and measures they use to protect users’ privacy. This will contribute to the effort to persuade internet users to use these tools.
Social implications
This study sought to explore the privacy attitudes of search engine users using PMT and its constructs’ association with subjective norms. It used the PMT to elucidate users’ perceptions that motivate them to privacy adoption behavior, as well as how these perceptions influence the type of search engine they use. This research is a first step toward gaining a better understanding of the processes that drive people’s motivation to, or not to, protect their privacy online by means of using PPSE. At the same time, this study contributes to search engine vendors by revealing that users’ need to be persuaded not only about their policy toward privacy but also by considering and implementing new strategies of diffusion that could enhance the use of the PPSE.
Originality/value
This research is a first step toward gaining a better understanding of the processes that drive people’s motivation to, or not to, protect their privacy online by means of using PPSEs.
Details
Keywords
Dinda Thalia Andariesta and Meditya Wasesa
This research presents machine learning models for predicting international tourist arrivals in Indonesia during the COVID-19 pandemic using multisource Internet data.
Abstract
Purpose
This research presents machine learning models for predicting international tourist arrivals in Indonesia during the COVID-19 pandemic using multisource Internet data.
Design/methodology/approach
To develop the prediction models, this research utilizes multisource Internet data from TripAdvisor travel forum and Google Trends. Temporal factors, posts and comments, search queries index and previous tourist arrivals records are set as predictors. Four sets of predictors and three distinct data compositions were utilized for training the machine learning models, namely artificial neural networks (ANNs), support vector regression (SVR) and random forest (RF). To evaluate the models, this research uses three accuracy metrics, namely root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE).
Findings
Prediction models trained using multisource Internet data predictors have better accuracy than those trained using single-source Internet data or other predictors. In addition, using more training sets that cover the phenomenon of interest, such as COVID-19, will enhance the prediction model's learning process and accuracy. The experiments show that the RF models have better prediction accuracy than the ANN and SVR models.
Originality/value
First, this study pioneers the practice of a multisource Internet data approach in predicting tourist arrivals amid the unprecedented COVID-19 pandemic. Second, the use of multisource Internet data to improve prediction performance is validated with real empirical data. Finally, this is one of the few papers to provide perspectives on the current dynamics of Indonesia's tourism demand.
Details
Keywords
Hui-Min Lai, Shin-Yuan Hung and David C. Yen
Seekers who visit professional virtual communities (PVCs) are usually motivated by knowledge-seeking, which is a complex cognitive process. How do seekers search for knowledge…
Abstract
Purpose
Seekers who visit professional virtual communities (PVCs) are usually motivated by knowledge-seeking, which is a complex cognitive process. How do seekers search for knowledge, and how is their search linked to prior knowledge or PVC situation factors? From the cognitive process and interactional psychology perspectives, this study investigated the three-way interactions between seekers’ expertise, task complexity, and perceptions of PVC features (i.e. knowledge quality and system quality) on knowledge-seeking strategies and resultant outcomes.
Design/methodology/approach
A field experiment was conducted with 119 seekers in a PVC using a 2 × 2 factorial design of seekers’ expertise (i.e. expert versus novice) and task complexity (i.e. low versus high).
Findings
The study reveals three significant insights: (1) For a high-complexity task, experts adopt an ask-directed searching strategy compared to novices, whereas novices adopt a browsing strategy; (2) For a high-complexity task, experts who perceive a high system quality are more likely than novices to adopt an ask-directed searching strategy; and (3) Task completion time and task quality are associated with the adoption of ask-directed searching strategies, whereas knowledge seekers’ satisfaction is more associated with the adoption of browsing strategy.
Originality/value
We draw on the perspectives of cognitive process and interactional psychology to explore potential two- and three-way interactions of seekers’ expertise, task complexity, and PVC features on the adoption of knowledge-seeking strategies in a PVC context. Our findings provide deep insights into seekers’ behavior in a PVC, given the popularity of the search for knowledge in PVCs.
Details
Keywords
Morteza Mohammadi Ostani, Jafar Ebadollah Amoughin and Mohadeseh Jalili Manaf
This study aims to adjust Thesis-type properties on Schema.org using metadata models and standards (MS) (Bibframe, electronic thesis and dissertations [ETD]-MS, Common European…
Abstract
Purpose
This study aims to adjust Thesis-type properties on Schema.org using metadata models and standards (MS) (Bibframe, electronic thesis and dissertations [ETD]-MS, Common European Research Information Format [CERIF] and Dublin Core [DC]) to enrich the Thesis-type properties for better description and processing on the Web.
Design/methodology/approach
This study is applied, descriptive analysis in nature and is based on content analysis in terms of method. The research population consisted of elements and attributes of the metadata model and standards (Bibframe, ETD-MS, CERIF and DC) and Thesis-type properties in the Schema.org. The data collection tool was a researcher-made checklist, and the data collection method was structured observation.
Findings
The results show that the 65 Thesis-type properties and the two levels of Thing and CreativeWork as its parents on Schema.org that corresponds to the elements and attributes of related models and standards. In addition, 12 properties are special to the Thesis type for better comprehensive description and processing, and 27 properties are added to the CreativeWork type.
Practical implications
Enrichment and expansion of Thesis-type properties on Schema.org is one of the practical applications of the present study, which have enabled more comprehensive description and processing and increased access points and visibility for ETDs in the environment Web and digital libraries.
Originality/value
This study has offered some new Thesis type properties and CreativeWork levels on Schema.org. To the best of the authors’ knowledge, this is the first time this issue is investigated.
Details
Keywords
Giovanni De Luca and Monica Rosciano
The tourist industry has to adopt a big data-driven foresight approach to enhance decision-making in a post-COVID international landscape still marked by significant uncertainty…
Abstract
Purpose
The tourist industry has to adopt a big data-driven foresight approach to enhance decision-making in a post-COVID international landscape still marked by significant uncertainty and in which some megatrends have the potential to reshape society in the next decades. This paper, considering the opportunity offered by the application of the quantitative analysis on internet new data sources, proposes a prediction method using Google Trends data based on an estimated transfer function model.
Design/methodology/approach
The paper uses the time-series methods to model and predict Google Trends data. A transfer function model is used to transform the prediction of Google Trends data into predictions of tourist arrivals. It predicts the United States tourism demand in Italy.
Findings
The results highlight the potential expressed by the use of big data-driven foresight approach. Applying a transfer function model on internet search data, timely forecasts of tourism flows are obtained. The two scenarios emerged can be used in tourism stakeholders’ decision-making process. In a future perspective, the methodological path could be applied to other tourism origin markets, to other internet search engine or other socioeconomic and environmental contexts.
Originality/value
The study raises awareness of foresight literacy in the tourism sector. Secondly, it complements the research on tourism demand forecasting by evaluating the performance of quantitative forecasting techniques on new data sources. Thirdly, it is the first paper that makes the United States arrival predictions in Italy. Finally, the findings provide immediate valuable information to tourism stakeholders that could be used to make decisions.
Details
Keywords
Pia Borlund, Nils Pharo and Ying-Hsang Liu
The PICCH research project contributes to opening a dialogue between cultural heritage archives and users. Hence, the users are identified and their information needs, the search…
Abstract
Purpose
The PICCH research project contributes to opening a dialogue between cultural heritage archives and users. Hence, the users are identified and their information needs, the search strategies they apply and the search challenges they experience are uncovered.
Design/methodology/approach
A combination of questionnaires and interviews is used for collection of data. Questionnaire data were collected from users of three different audiovisual archives. Semi-structured interviews were conducted with two user groups: (1) scholars searching information for research projects and (2) archivists who perform their own scholarly work and search information on behalf of others.
Findings
The questionnaire results show that the archive users mainly have an academic background. Hence, scholars and archivists constitute the target group for in-depth interviews. The interviews reveal that their information needs are multi-faceted and match the information need typology by Ingwersen. The scholars mainly apply collection-specific search strategies but have in common primarily doing keyword searching, which they typically plan in advance. The archivists do less planning owing to their knowledge of the collections. All interviewees demonstrate domain knowledge, archival intelligence and artefactual literacy in their use and mastering of the archives. The search challenges they experience can be characterised as search system complexity challenges, material challenges and metadata challenges.
Originality/value
The paper provides a rare insight into the complexity of the search situation of cultural heritage archives, and the users’ multi-facetted information needs and hence contributes to the dialogue between the archives and the users.
Details
Keywords
Nedra Ibrahim, Anja Habacha Chaibi and Henda Ben Ghézala
Given the magnitude of the literature, a researcher must be selective of research papers and publications in general. In other words, only papers that meet strict standards of…
Abstract
Purpose
Given the magnitude of the literature, a researcher must be selective of research papers and publications in general. In other words, only papers that meet strict standards of academic integrity and adhere to reliable and credible sources should be referenced. The purpose of this paper is to approach this issue from the prism of scientometrics according to the following research questions: Is it necessary to judge the quality of scientific production? How do we evaluate scientific production? What are the tools to be used in evaluation?
Design/methodology/approach
This paper presents a comparative study of scientometric evaluation practices and tools. A systematic literature review is conducted based on articles published in the field of scientometrics between 1951 and 2022. To analyze data, the authors performed three different aspects of analysis: usage analysis based on classification and comparison between the different scientific evaluation practices, type and level analysis based on classifying different scientometric indicators according to their types and application levels and similarity analysis based on studying the correlation between different quantitative metrics to identify similarity between them.
Findings
This comparative study leads to classify different scientific evaluation practices into externalist and internalist approaches. The authors categorized the different quantitative metrics according to their types (impact, production and composite indicators), their levels of application (micro, meso and macro) and their use (internalist and externalist). Moreover, the similarity analysis has revealed a high correlation between several scientometric indicators such as author h-index, author publications, citations and journal citations.
Originality/value
The interest in this study lies deeply in identifying the strengths and weaknesses of research groups and guides their actions. This evaluation contributes to the advancement of scientific research and to the motivation of researchers. Moreover, this paper can be applied as a complete in-depth guide to help new researchers select appropriate measurements to evaluate scientific production. The selection of evaluation measures is made according to their types, usage and levels of application. Furthermore, our analysis shows the similarity between the different indicators which can limit the overuse of similar measures.
Details
Keywords
Zhixue Liao, Xinyu Gou, Qiang Wei and Zhibin Xing
Online reviews serve as valuable sources of information, reflecting tourists’ attentions, preferences and sentiments. However, although the existing research has demonstrated that…
Abstract
Purpose
Online reviews serve as valuable sources of information, reflecting tourists’ attentions, preferences and sentiments. However, although the existing research has demonstrated that incorporating online review data can enhance the performance of tourism demand forecasting models, the reliability of online review data and consumers’ decision-making process have not been given adequate attention. To address the aforementioned problem, the purpose of this study is to forecast tourism demand using online review data derived from the analysis of review helpfulness.
Design/methodology/approach
The authors propose a novel “identification-first, forecasting-second” framework. This framework prioritizes the identification of helpful reviews through a comprehensive analysis of review helpfulness, followed by the integration of helpful online review data into the forecasting system. Using the SARIMAX model with helpful online review data sourced from TripAdvisor, this study forecasts tourist arrivals in Hong Kong during the period from August 2012 to June 2019. The SNAÏVE/SARIMA model was used as the benchmark model. Additionally, artificial intelligence models including long short-term memory, back propagation neural network, extreme learning machine and random forest models were used to assess the robustness of the results.
Findings
The results demonstrate that online review data are subject to noise and bias, which can adversely affect the accuracy of predictions when used directly. However, by identifying helpful online reviews beforehand and incorporating them into the forecasting process, a notable enhancement in predictive performance can be realized.
Originality/value
First, to the best of the authors’ knowledge, this study is one of the first to focus on the data issue of online reviews on tourism arrivals forecasting. Second, this study pioneers the integration of the consumer decision-making process into the domain of tourism demand forecasting, marking one of the earliest endeavors in this area. Third, this study makes a novel attempt to identify helpful online reviews based on reviews helpfulness analysis.
Details