Search results
1 – 10 of 623Koraljka Golub, Pawel Michal Ziolkowski and Goran Zlodi
The study aims to paint a representative picture of the current state of search interfaces of Swedish online museum collections, focussing on search functionalities with…
Abstract
Purpose
The study aims to paint a representative picture of the current state of search interfaces of Swedish online museum collections, focussing on search functionalities with particular reference to subject searching, as well as the use of controlled vocabularies, with the purpose of identifying which improvements of the search interfaces are needed to ensure high-quality information retrieval for the end user.
Design/methodology/approach
In the first step, a set of 21 search interface criteria was identified, based on related research and current standards in the domain of cultural heritage knowledge organization. Secondly, a complete set of Swedish museums that provide online access to their collections was identified, comprising nine cross-search services and 91 individual museums' websites. These 100 websites were each evaluated against the 21 criteria, between 1 July and 31 August 2020.
Findings
Although many standards and guidelines are in place to ensure quality-controlled subject indexing, which in turn support information retrieval of relevant resources (as individual or full search results), the study shows that they are not broadly implemented, resulting in information retrieval failures for the end user. The study also demonstrates a strong need for the implementation of controlled vocabularies in these museums.
Originality/value
This study is a rare piece of research which examines subject searching in online museums; the 21 search criteria and their use in the analysis of the complete set of online collections of a country represents a considerable and unique contribution to the fields of knowledge organization and information retrieval of cultural heritage. Its particular value lies in showing how the needs of end users, many of which are documented and reflected in international standards and guidelines, should be taken into account in designing search tools for these museums; especially so in subject searching, which is the most complex and yet the most common type of search. Much effort has been invested into digitizing cultural heritage collections, but access to them is hindered by poor search functionality. This study identifies which are the most important aspects to improve.
Details
Keywords
Wei Xiong, Ziyi Xiong and Tina Tian
The performance of behavioral targeting (BT) mainly relies on the effectiveness of user classification since advertisers always want to target their advertisements to the most…
Abstract
Purpose
The performance of behavioral targeting (BT) mainly relies on the effectiveness of user classification since advertisers always want to target their advertisements to the most relevant users. In this paper, the authors frame the BT as a user classification problem and describe a machine learning–based approach for solving it.
Design/methodology/approach
To perform such a study, two major research questions are investigated: the first question is how to represent a user’s online behavior. A good representation strategy should be able to effectively classify users based on their online activities. The second question is how different representation strategies affect the targeting performance. The authors propose three user behavior representation methods and compare them empirically using the area under the receiver operating characteristic curve (AUC) as a performance measure.
Findings
The experimental results indicate that ad campaign effectiveness can be significantly improved by combining user search queries, clicked URLs and clicked ads as a user profile. In addition, the authors also explore the temporal aspect of user behavior history by investigating the effect of history length on targeting performance. The authors note that an improvement of approximately 6.5% in AUC is achieved when user history is extended from 1 day to 14 days, which is substantial in targeting performance.
Originality/value
This paper confirms the effectiveness of BT on user classification and provides a validation of BT for Internet advertising.
Details
Keywords
Koraljka Golub, Xu Tan, Ying-Hsang Liu and Jukka Tyrkkö
This exploratory study aims to help contribute to the understanding of online information search behaviour of PhD students from different humanities fields, with a focus on…
Abstract
Purpose
This exploratory study aims to help contribute to the understanding of online information search behaviour of PhD students from different humanities fields, with a focus on subject searching.
Design/methodology/approach
The methodology is based on a semi-structured interview within which the participants are asked to conduct both a controlled search task and a free search task. The sample comprises eight PhD students in several humanities disciplines at Linnaeus University, a medium-sized Swedish university from 2020.
Findings
Most humanities PhD students in the study have received training in information searching, but it has been too basic. Most rely on web search engines like Google and Google Scholar for publications' search, and university's discovery system for known-item searching. As these systems do not rely on controlled vocabularies, the participants often struggle with too many retrieved documents that are not relevant. Most only rarely or never use disciplinary bibliographic databases. The controlled search task has shown some benefits of using controlled vocabularies in the disciplinary databases, but incomplete synonym or concept coverage as well as user unfriendly search interface present hindrances.
Originality/value
The paper illuminates an often-forgotten but pervasive challenge of subject searching, especially for humanities researchers. It demonstrates difficulties and shows how most PhD students have missed finding an important resource in their research. It calls for the need to reconsider training in information searching and the need to make use of controlled vocabularies implemented in various search systems with usable search and browse user interfaces.
Details
Keywords
Dinda Thalia Andariesta and Meditya Wasesa
This research presents machine learning models for predicting international tourist arrivals in Indonesia during the COVID-19 pandemic using multisource Internet data.
Abstract
Purpose
This research presents machine learning models for predicting international tourist arrivals in Indonesia during the COVID-19 pandemic using multisource Internet data.
Design/methodology/approach
To develop the prediction models, this research utilizes multisource Internet data from TripAdvisor travel forum and Google Trends. Temporal factors, posts and comments, search queries index and previous tourist arrivals records are set as predictors. Four sets of predictors and three distinct data compositions were utilized for training the machine learning models, namely artificial neural networks (ANNs), support vector regression (SVR) and random forest (RF). To evaluate the models, this research uses three accuracy metrics, namely root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE).
Findings
Prediction models trained using multisource Internet data predictors have better accuracy than those trained using single-source Internet data or other predictors. In addition, using more training sets that cover the phenomenon of interest, such as COVID-19, will enhance the prediction model's learning process and accuracy. The experiments show that the RF models have better prediction accuracy than the ANN and SVR models.
Originality/value
First, this study pioneers the practice of a multisource Internet data approach in predicting tourist arrivals amid the unprecedented COVID-19 pandemic. Second, the use of multisource Internet data to improve prediction performance is validated with real empirical data. Finally, this is one of the few papers to provide perspectives on the current dynamics of Indonesia's tourism demand.
Details
Keywords
Mari Vallez, Carlos Lopezosa and Rafael Pedraza-Jiménez
Universities play an important role in the promotion and implementation of the 2030 Agenda for Sustainable Development. This study aims to examine the visibility of information…
Abstract
Purpose
Universities play an important role in the promotion and implementation of the 2030 Agenda for Sustainable Development. This study aims to examine the visibility of information about the Sustainable Development Goals (SDGs) on the websites of Spanish and major international universities, by means of a quantitative and qualitative analysis with an online visibility management platform that makes use of big data technology.
Design/methodology/approach
The Web visibility of the universities studied in relation to the terms “SDG”, “Sustainable Development Goals” and “2030 Agenda” was determined using the SEMrush tool. Information was obtained on the number of web pages accessed and the queries formulated (query expansion). The content indexed by Google for these universities was compiled, and finally, the search engine optimization (SEO) factors applicable to the websites with the highest Web visibility were identified.
Findings
The universities analysed are content creators but do not have very high Web visibility in Web searches for information on the SDGs. Of the 98 universities analysed, only four feature prominently in search results.
Originality/value
Although research exists on the application of SEO to different areas, there have not, to date, been any studies examining the Web visibility of universities in relation to Web searches for information on the 2030 Agenda. The main contributions of this study are the global perspective it provides on the Web visibility of content produced by universities about the SDGs and the recommendations it offers for improving that visibility.
Details
Keywords
Kimmo Kettunen, Heikki Keskustalo, Sanna Kumpulainen, Tuula Pääkkönen and Juha Rautiainen
This study aims to identify user perception of different qualities of optical character recognition (OCR) in texts. The purpose of this paper is to study the effect of different…
Abstract
Purpose
This study aims to identify user perception of different qualities of optical character recognition (OCR) in texts. The purpose of this paper is to study the effect of different quality OCR on users' subjective perception through an interactive information retrieval task with a collection of one digitized historical Finnish newspaper.
Design/methodology/approach
This study is based on the simulated work task model used in interactive information retrieval. Thirty-two users made searches to an article collection of Finnish newspaper Uusi Suometar 1869–1918 which consists of ca. 1.45 million autosegmented articles. The article search database had two versions of each article with different quality OCR. Each user performed six pre-formulated and six self-formulated short queries and evaluated subjectively the top 10 results using a graded relevance scale of 0–3. Users were not informed about the OCR quality differences of the otherwise identical articles.
Findings
The main result of the study is that improved OCR quality affects subjective user perception of historical newspaper articles positively: higher relevance scores are given to better-quality texts.
Originality/value
To the best of the authors’ knowledge, this simulated interactive work task experiment is the first one showing empirically that users' subjective relevance assessments are affected by a change in the quality of an optically read text.
Details
Keywords
The author investigates whether investors’ online information demand measured by Google search query and the changes in the numbers of Wikipedia page view can explain and predict…
Abstract
Purpose
The author investigates whether investors’ online information demand measured by Google search query and the changes in the numbers of Wikipedia page view can explain and predict stock return, trading volume and volatility dynamics of companies listed on the Nigerian Stock Exchange.
Design/methodology/approach
The multiple regression model which encompasses both the univariate and multivariate regression framework was employed as the research methodology. As part of our pre-analysis, we test for multicollinearity and applied the Wu/Hausman specification test to detect whether endogeneity exist in the regression model.
Findings
We provide novel and robust evidence that Google searches neither explain the contemporaneous nor predict stock return, trading volume and volatility dynamics. Similarly, results also indicate that trading volume and volatility dynamics have no relationship with changes in the numbers of Wikipedia pages view related to stock activities.
Originality/value
This study opens new strand of empirical literature of “investors' attention” in the context of African stock markets as empirical evidence. No evidence from previous studies on investors' attention exist, whether in Google search query or Wikipedia page view, with respect to African stock markets, particularly the Nigerian stock market. This study seeks to bridge these knowledge gaps by examining these relations.
Details
Keywords
Dušan Mladenović, Anida Rajapakse, Nikola Kožuljević and Yupal Shukla
Given that online search visibility is influenced by various determinants, and that influence may vary across industries, this study aims in investigating the major predictors of…
Abstract
Purpose
Given that online search visibility is influenced by various determinants, and that influence may vary across industries, this study aims in investigating the major predictors of online search visibility in the context of blood banks.
Design/methodology/approach
To formalize the online visibility, the authors have found theoretical foundations in activity theory, while to quantify online visiblity the authors have used the search engine optimization (SEO) Index, ranking, and a number of visitors. The examined model includes ten hypotheses and was tested on data from 57 blood banks.
Findings
Results challenge shallow domain knowledge. The major predictors of online search visibility are Alternative Text Attribute (ALT) text, backlinks, robots, domain authority (DA) and bounce rate (BR). The issues are related to the number of backlinks, social score, and DA. Polarized utilization of SEO techniques is evident.
Practical implications
The methodology can be used to analyze the online search visibility of other industries or similar not-for-profit organizations. Findings in terms of individual predictors can be useful for marketers to better manage online search visibility.
Social implications
The acute blood donation problems may be to a certain degree level as the information flow between donors and blood banks will be facilitated.
Originality/value
This is the first study to analyze the blood bank context. The results provide invaluable inputs to marketers, managers, and policymakers.
Details