Search results

1 – 10 of 403
Article
Publication date: 5 June 2024

Azanzi Jiomekong and Sanju Tiwari

This paper aims to curate open research knowledge graph (ORKG) with papers related to ontology learning and define an approach using ORKG as a computer-assisted tool to organize…

Abstract

Purpose

This paper aims to curate open research knowledge graph (ORKG) with papers related to ontology learning and define an approach using ORKG as a computer-assisted tool to organize key-insights extracted from research papers.

Design/methodology/approach

Action research was used to explore, test and evaluate the use of the Open Research Knowledge Graph as a computer assistant tool for knowledge acquisition from scientific papers.

Findings

To extract, structure and describe research contributions, the granularity of information should be decided; to facilitate the comparison of scientific papers, one should design a common template that will be used to describe the state of the art of a domain.

Originality/value

This approach is currently used to document “food information engineering,” “tabular data to knowledge graph matching” and “question answering” research problems and the “neurosymbolic AI” domain. More than 200 papers are ingested in ORKG. From these papers, more than 800 contributions are documented and these contributions are used to build over 100 comparison tables. At the end of this work, we found that ORKG is a valuable tool that can reduce the working curve of state-of-the-art research.

Article
Publication date: 29 May 2024

Lino Gonzalez-Garcia, Gema González-Carreño, Ana María Rivas Machota and Juan Padilla Fernández-Vega

Knowledge graphs (KGs) are structured knowledge bases that represent real-world entities and are used in a variety of applications. Many of them are created and curated from a…

Abstract

Purpose

Knowledge graphs (KGs) are structured knowledge bases that represent real-world entities and are used in a variety of applications. Many of them are created and curated from a combination of automated and manual processes. Microdata embedded in Web pages for purposes of facilitating indexing and search engine optimization are a potential source to augment KGs under some assumptions of complementarity and quality that have not been thoroughly explored to date. In that direction, this paper aims to report results on a study that evaluates the potential of using microdata extracted from the Web to augment the large, open and manually curated Wikidata KG for the domain of touristic information. As large corpora of Web text is currently being leveraged via large language models (LLMs), these are used to compare the effectiveness of the microdata enhancement method.

Design/methodology/approach

The Schema.org taxonomy was used as the source to determine the annotation types to be collected. Here, the authors focused on tourism-related pages as a case study, selecting the relevant Schema.org concepts as point of departure. The large CommonCrawl resource was used to select those annotations from a large recent sample of the World Wide Web. The extracted annotations were processed and matched with Wikidata to estimate the degree to which microdata produced for SEO might become a valuable resource to complement KGs or vice versa. The Web pages themselves can also serve as a context to produce additional metadata elements using them as context in pipelines of an existing LLMs. That way, both the annotations and the contents itself can be used as sources.

Findings

The samples extracted revealed a concentration of metadata annotations in only a few of the relevant Schema.org attributes and also revealed the possible influence of authoring tools in a significant fraction of microdata produced. The analysis of the overlapping of attributes in the sample with those of Wikidata showed the potential of the technique, limited by the disbalance of the presence of attributes. The combination of those with the use of LLMs to produce additional annotations demonstrates the feasibility of the approach in the population of existing Wikidata locations. However, in both cases, the effectiveness appears to be lower in the cases of less content in the KG, which are arguably the most relevant when considering the scenario of an automated population approach.

Originality/value

The research reports novel empirical findings on the way touristic annotations with a SEO orientation are being produced in the wild and provides an assessment of their potential to complement KGs, or reuse information from those graphs. It also provides insights on the potential of using LLMs for the task.

Details

The Electronic Library , vol. 42 no. 3
Type: Research Article
ISSN: 0264-0473

Keywords

Article
Publication date: 19 July 2024

Kuoyi Lin, Xiaoyang Kan and Meilian Liu

This study develops and validates an innovative approach for extracting knowledge from online user reviews by integrating textual content and emojis. Recognizing the pivotal role…

Abstract

Purpose

This study develops and validates an innovative approach for extracting knowledge from online user reviews by integrating textual content and emojis. Recognizing the pivotal role emojis play in enhancing the expressiveness and emotional depth of digital communication, this study aims to address the significant gap in existing sentiment analysis models, which have largely overlooked the contribution of emojis in interpreting user preferences and sentiments. By constructing a comprehensive model that synergizes emotional and semantic information conveyed through emojis and text, this study seeks to provide a more nuanced understanding of user preferences, thereby enhancing the accuracy and depth of knowledge extraction from online reviews. The goal is to offer a robust framework that enables more effective and empathetic engagement with user-generated content on digital platforms, paving the way for improved service delivery, product development and customer satisfaction through informed insights into consumer behavior and sentiments.

Design/methodology/approach

This study uses a structured methodology to integrate and analyze text and emojis from online reviews for effective knowledge extraction, focusing on user preferences and sentiments. This methodology consists of four key stages. First, this study leverages high-frequency noun analysis to identify and extract product attributes mentioned in online user reviews. By focusing on nouns that appear frequently, the authors can systematically discern the primary features or aspects of products that users discuss, thereby providing a foundation for a more detailed sentiment and preference analysis. Second, a foundational sentiment dictionary is established that incorporates sentiment-bearing words, intensifiers and negation terms to analyze the textual part of the reviews. This dictionary is used to assign sentiment scores to phrases and sentences within reviews, allowing the quantification of textual sentiments based on the presence and combination of these predefined lexical items. Third, an emoticon sentiment dictionary is developed to address the emotional content conveyed through emojis. This dictionary categorizes emojis based on their associated sentiments, thus enabling the quantification of emotional expressions in reviews. The sentiment scores derived from the emojis are then integrated with those from the textual analysis. This integration considers the weights of text- and emoji-based emotions to compute a comprehensive attribute sentiment score that reflects a nuanced understanding of user sentiments and preferences. Finally, the authors conduct an empirical study to validate the effectiveness of the proposed methodology in mining user preferences from online reviews by applying the approach to a data set of online reviews and evaluating its ability to accurately identify product attributes and user sentiments. The validation process assessed the reliability and accuracy of the methodology in extracting meaningful insights from the complex interplay between text and emojis. This study offers a holistic and nuanced framework for knowledge extraction from online reviews, capturing both explicit and implicit sentiments expressed by users through text and emojis. By integrating these elements, this study seeks to provide a comprehensive understanding of user preferences, contributing to improved consumer insight and strategic decision-making for businesses and researchers.

Findings

The application of the proposed methodology for integrating emojis with text in online reviews yields significant findings that underscore the feasibility and value of extracting realistic user knowledge to gain insights from user-generated content. The analysis successfully captured consumer preferences, which are instrumental in informing service decisions and driving innovation. This achievement is largely attributed to the development and utilization of a comprehensive emotion-sentiment dictionary tailored to interpret the complex interplay between textual and emoji-based expressions in online reviews. By implementing a sentiment calculation model that intricately combines textual sentiment analysis with emoji sentiment analysis, this study was able to accurately determine the final attribute emotion for various product features discussed in the reviews. This model effectively characterized the emotional knowledge of online users and provided a nuanced understanding of their sentiments and preferences. The emotional knowledge extracted is not only quantifiable but also rich in context, offering deeper insights into consumer behavior and attitudes. Furthermore, a case analysis is conducted to rigorously test the validity of the proposed model in a real-world scenario. This practical examination revealed that the model is not only capable of accurately extracting and analyzing user preferences but is also adaptable to different contexts and product categories. The case analysis highlights the robustness and flexibility of the model, demonstrating its potential to enhance the precision of knowledge extraction processes significantly. Overall, the results confirm the effectiveness of the proposed approach in integrating text and emojis for comprehensive knowledge extraction from online reviews. The findings validate the model’s capability to offer actionable insights into consumer preferences, thereby supporting more informed and strategic decision-making by businesses. This study contributes to the broader field of sentiment analysis by showcasing the untapped potential of emojis as valuable indicators of user sentiments, opening new avenues for research and applications in digital marketing and consumer behavior analysis.

Originality/value

This study introduces a pioneering approach to extract knowledge from Web user interactions, notably through the integration of online reviews that incorporate both textual content and emoticons. This innovative methodology stands out because it holistically considers the dual channels of communication, text and emojis, to comprehensively mine Web user preferences. The key contribution of this study lies in its novel insights into the extraction of consumer preferences, advancing beyond traditional text-based analysis to embrace nuanced expressions conveyed through emoticons. The originality of this study is underpinned by its acknowledgment of emoticons as a significant and untapped source of sentiment and preference indicators in online reviews. By effectively merging emoticon analysis and emoji emotion scoring with textual sentiment analysis, this study enriches the understanding of Web user preferences and enhances the accuracy and depth of consumer preference insights. This dual-analysis approach represents a significant leap forward in sentiment analysis, setting a new standard for how digital communication can be leveraged to derive meaningful insights into consumer behavior. Furthermore, the results have practical implications to businesses and marketers. The insights gained from this integrated analytical approach offer a more granular and emotionally nuanced view of customer feedback, which can inform more effective marketing strategies, product development and customer service practices. By pioneering this comprehensive method of knowledge extraction, this study paves the way for future research and practice to interpret and respond more accurately to the complex landscape of online consumer expressions. This study’s originality and value lie in its innovative method of capturing and analyzing the rich tapestry of Web user communication, offering a ground-breaking perspective on consumer preference extraction that promises to enhance both academic research and practical applications in the digital era.

Details

Journal of Knowledge Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1367-3270

Keywords

Article
Publication date: 8 December 2022

Khurram Shahzad and Shakeel Ahmad Khan

This study aims to investigate the current practices being implemented against the dissemination of fake online news, identify the relationship of new media literacy (NML) with…

Abstract

Purpose

This study aims to investigate the current practices being implemented against the dissemination of fake online news, identify the relationship of new media literacy (NML) with fake news epidemic control and find out the challenges in identifying valid sources of information.

Design/methodology/approach

To accomplish constructed objectives of this study, a systematic literature review (SLR) was conducted. The authors carried out the “Preferred Reporting Items for the Systematic Review and Meta-analysis” guidelines as a research methodology. The data were retrieved from ten world’s leading digital databases and online tools. A total of 25 key studies published in impact factor (IF) journals were included for systematic review vis-à-vis standard approaches.

Findings

This study revealed trending practices to control fake news consisted of critical information literacy, civic education, new thinking patterns, fact-checkers, automatic fake news detection tools, employment of ethical norms and deep learning via neural networks. Results of the synthesized studies revealed that media literacy, web literacy, digital literation, social media literacy skills and NML assisted acted as frontline soldiers in combating the fake news war. The findings of this research also exhibited different challenges to control fake news perils.

Research limitations/implications

This study provides pertinent theoretical contributions in the body of existing knowledge through the addition of valuable literature by conducting in-depth systematic review of 25 IF articles on a need-based topic.

Practical implications

This scholarly contribution is fruitful and practically productive for the policymakers belonging to different spectrums to effectively control web-based fake news epidemic.

Social implications

This intellectual piece is a benchmark to address fake news calamities to save the social system and to educate citizens from harms of false online stories on social networking websites.

Originality/value

This study vivifies new vistas via a reinvigorated outlook to address fake news perils embedded in dynamic, rigorous and heuristic strategies for redefining a predetermined set of social values.

Details

Global Knowledge, Memory and Communication, vol. 73 no. 6/7
Type: Research Article
ISSN: 2514-9342

Keywords

Open Access
Article
Publication date: 5 August 2024

James Christopher Westland and Jian Mou

Internet search is a $120bn business that answers lists of search terms or keywords with relevant links to Internet webpages. Only a few companies have sufficient scale to compete…

Abstract

Purpose

Internet search is a $120bn business that answers lists of search terms or keywords with relevant links to Internet webpages. Only a few companies have sufficient scale to compete and thus economics of the process are paramount. This study aims to develop a detailed industry-specific modeling of the economics of internet search.

Design/methodology/approach

The current research develops a stochastic model of the process of Internet indexing, search and retrieval in order to predict expected costs and revenues of particular configurations and usages.

Findings

The models define behavior and economics of parameters that are not directly observable, where it is difficult to empirically determine the distributions and economics.

Originality/value

The model may be used to guide the economics of large search engine operations, including the advertising platforms that depend on them and largely fund them.

Details

Journal of Electronic Business & Digital Economics, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2754-4214

Keywords

Article
Publication date: 14 August 2024

Simon Knight, Isabella Bowdler, Heather Ford and Jianlong Zhou

Informational conflict and uncertainty are common features across a range of sources, topics and tasks. Search engines and their presentation of results via search engine results…

Abstract

Purpose

Informational conflict and uncertainty are common features across a range of sources, topics and tasks. Search engines and their presentation of results via search engine results pages (SERPs) often underpinned by knowledge graphs (KGs) are commonly used across tasks. Yet, it is not clear how search does, or could, represent the informational conflict that exists across and within returned results. The purpose of this paper is to review KG and SERP designs for representation of uncertainty or disagreement.

Design/methodology/approach

The authors address the aim through a systematic analysis of material regarding uncertainty and disagreement in KG and SERP contexts. Specifically, the authors focus on the material representation – user interface design features – that have been developed in the context of uncertainty and disagreement representation for KGs and SERPs.

Findings

Searches identified n = 136 items as relevant, with n = 4 sets of visual materials identified from these for analysis of their design features. Design elements were extracted against sets of design principles, highlighting tensions in the design of such features.

Originality/value

The authors conclude by highlighting two key challenges for interface design and recommending six design principles in representing uncertainty and conflict in SERPs. Given the important role technologies play in mediating information access and learning, addressing the representation of uncertainty and disagreement in the representation of information is crucial.

Details

Information and Learning Sciences, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2398-5348

Keywords

Article
Publication date: 8 August 2024

Chih-Ming Chen and Xian-Xu Chen

This study aims to develop an associative text analyzer (ATA) to support users in quickly grasping and interpreting the content of large amounts of text through text association…

Abstract

Purpose

This study aims to develop an associative text analyzer (ATA) to support users in quickly grasping and interpreting the content of large amounts of text through text association recommendations, facilitating the identification of the contextual relationships between people, events, organization and locations for digital humanities. Additionally, by providing text summaries, the tool allows users to link between distant and close readings, thereby enabling more efficient exploration of related texts.

Design/methodology/approach

To verify the effectiveness of this tool in supporting exploration of historical texts, this study uses a counterbalanced design to compare the use of the digital humanities platform for Mr. Lo Chia-Lun’s Writings (DHP-LCLW) with and without the ATA to assist in exploring different aspects of text. The study investigated whether there were significant differences in effectiveness for exploring textual contexts and technological acceptance as well as used semi-structured in-depth interviews to understand the research participants’ viewpoints and experiences with the ATA.

Findings

The results of the experiment revealed that the effectiveness of text exploration using the DHP-LCLW with and without the ATA varied significantly depending on the topic of the text being explored. The DHP-LCLW with the ATA was found to be more suitable for exploring historical texts, while the DHP-LCLW without the ATA was more suitable for exploring educational texts. The DHP-LCLW with the DHP-LCLW was found to be significantly more useful in terms of perceived usefulness than the DHP-LCLW without the ATA, indicating that the research participants believed the ATA was more effective in helping them efficiently grasp the related texts and topics during text exploration.

Practical implications

The study’s practical implications lie in the development of an ATA for digital humanities, offering a valuable tool for efficiently exploring historical texts. The ATA enhances users’ ability to grasp and interpret large volumes of text, facilitating contextual relationship identification. Its practical utility is evident in the improved effectiveness of text exploration, particularly for historical content, as indicated by users’ perceived usefulness.

Originality/value

This study proposes an ATA for digital humanities, enhancing text exploration by offering association recommendations and efficient linking between distant and close readings. The study contributes by providing a specialized tool and demonstrating its perceived usefulness in facilitating efficient exploration of related texts in digital humanities.

Details

Aslib Journal of Information Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2050-3806

Keywords

Article
Publication date: 25 October 2022

Victor Diogho Heuer de Carvalho and Ana Paula Cabral Seixas Costa

This article presents two Brazilian Portuguese corpora collected from different media concerning public security issues in a specific location. The primary motivation is…

Abstract

Purpose

This article presents two Brazilian Portuguese corpora collected from different media concerning public security issues in a specific location. The primary motivation is supporting analyses, so security authorities can make appropriate decisions about their actions.

Design/methodology/approach

The corpora were obtained through web scraping from a newspaper's website and tweets from a Brazilian metropolitan region. Natural language processing was applied considering: text cleaning, lemmatization, summarization, part-of-speech and dependencies parsing, named entities recognition, and topic modeling.

Findings

Several results were obtained based on the methodology used, highlighting some: an example of a summarization using an automated process; dependency parsing; the most common topics in each corpus; the forty named entities and the most common slogans were extracted, highlighting those linked to public security.

Research limitations/implications

Some critical tasks were identified for the research perspective, related to the applied methodology: the treatment of noise from obtaining news on their source websites, passing through textual elements quite present in social network posts such as abbreviations, emojis/emoticons, and even writing errors; the treatment of subjectivity, to eliminate noise from irony and sarcasm; the search for authentic news of issues within the target domain. All these tasks aim to improve the process to enable interested authorities to perform accurate analyses.

Practical implications

The corpora dedicated to the public security domain enable several analyses, such as mining public opinion on security actions in a given location; understanding criminals' behaviors reported in the news or even on social networks and drawing their attitudes timeline; detecting movements that may cause damage to public property and people welfare through texts from social networks; extracting the history and repercussions of police actions, crossing news with records on social networks; among many other possibilities.

Originality/value

The work on behalf of the corpora reported in this text represents one of the first initiatives to create textual bases in Portuguese, dedicated to Brazil's specific public security domain.

Details

Library Hi Tech, vol. 42 no. 4
Type: Research Article
ISSN: 0737-8831

Keywords

Open Access
Article
Publication date: 5 July 2024

Garret Murray, Malin Falkeling and Shang Gao

The purpose of this paper is to provide an overview of the trends and challenges relating to research into the human aspects of ransomware.

Abstract

Purpose

The purpose of this paper is to provide an overview of the trends and challenges relating to research into the human aspects of ransomware.

Design/methodology/approach

A systematic mapping study was carried out to investigate the trends in studies into the human aspects of ransomware, identify challenges encountered by researchers and propose directions for future research. For each of the identified papers from this study, the authors mapped the year of publication, the type of paper, research strategy and data generation method, types of participants included, theories incorporated and lastly, the authors mapped the challenges encountered by the researchers.

Findings

Fifty-nine papers published between 2006 and 2022 are included in the study. The findings indicate that literature on the human aspects of ransomware was scarce prior to 2016. The most-used participant groups in this area are students and cybersecurity professionals, and most studies rely on a survey strategy using the questionnaire to collect data. In addition, many papers did not use theories for their research, but from those that did, game theory was used most often. Furthermore, the most reported challenge is that being hit with ransomware is a sensitive topic, which results in individuals and organisations being reluctant to share their experiences.

Research limitations/implications

This mapping study reveals that the body of literature in the area of human aspects of ransomware has increased over the past couple of years. The findings highlight that being transparent about ransomware attacks, when possible, can help others. Moreover, senior management plays an important role in shaping the information security culture of an organisation, whether to have a culture of transparency or of secrecy.

Originality/value

This study is the first of its kind of systematic mapping studies contributing to the body of knowledge on the human aspects of ransomware.

Details

Information & Computer Security, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2056-4961

Keywords

Open Access
Article
Publication date: 12 July 2024

Trinidad Domínguez Vila, Lucía Rubio-Escuderos and Elisa Alén González

Information and communication technologies are being increasingly used across various sectors including the tourism industry. However, equitable access to online information…

Abstract

Purpose

Information and communication technologies are being increasingly used across various sectors including the tourism industry. However, equitable access to online information remains a significant challenge, especially for people with disabilities (PwD). There is a pressing need for research into the accessibility of the internet to promote social equality. This study aims to identify patterns in both the technical accessibility and the content information related to accessibility and disability that is available on the official websites of leading global tourist destinations.

Design/methodology/approach

A cluster analysis assessed the technical accessibility of the websites, while a principal component analysis evaluated the content information concerning accessibility and disability.

Findings

There has been a substantial improvement in the technical accessibility of tourism websites over that described in earlier studies. There have been no advances in content information on accessibility and disability, which continues to be very heterogeneous and dispersed.

Originality/value

This evaluation of the technical accessibility and content related to accessibility and disability on tourism websites provides a basis for developing strategies to eliminate barriers that PwD encounter in accessing tourism information. To augment the efficacy of big data inputs, it is imperative to homogenise variables associated with technical access and content information on accessibility. Such standardisation will improve the functionality of algorithms critical to the Internet of Things and artificial intelligence technologies. These enhancements are likely to spur innovations that bridge the inequality gap and promote environments where technology serves as a cornerstone of social inclusion and equality.

目的

信息和通信技术在包括旅游业在内的很多行业的应用越来越广泛。 互联网是游客不可或缺的工具, 但并非每个人(在本研究中为残疾人、PwD)都能以相同的方式获取可用信息。有必要对无障碍使用互联网进行研究, 以促进社会平等。本研究旨在识别全球主要旅游目的地官方网站的技术可及性以及网站内容上有关可及性和残疾信息的规律。

设计/方法/途径

聚类分析评估了网站的技术可及性, 主成成分分析评估了网站的可及性和残疾的相关内容信息。

研究结果

与早期研究中描述的相比, 旅游网站的技术可访问性有了实质性的改善。关于无障碍和残疾的内容信息没有任何改善, 仍然非常异质性和分散性。

原创性

本研究对旅游网站的技术可及性以及有关可及性和残障人士的内容信息的评估为制定以消除残疾人旅游所面临的障碍的未来战略奠定了基础。为了提高大数据输入的有效性, 技术可及性和可及性内容信息相关的变量必须标准化和同质化。这将提高关键算法的效率,以增加物联网和人工智能技术的功能。这些改进可以促进创新, 缩小不平等差距, 并营造让技术成为社会包容和平等基石的环境因素。

Objetivo

Las tecnologías de la información y la comunicación (TIC) se utilizan cada vez más en diversos sectores, incluido el turístico. Sin embargo, el acceso equitativo a la información online sigue siendo un reto importante, especialmente para las personas con discapacidad. Existe una necesidad acuciante de investigar la accesibilidad de Internet para promover la igualdad social. Este estudio identifica patrones en la accesibilidad técnica y en el contenido de la información sobre accesibilidad y discapacidad disponible en las páginas web oficiales de los principales destinos turísticos mundiales.

Diseño/metodología/enfoque

Un análisis de conglomerados evaluó la accesibilidad técnica y un análisis de componentes principales analizó el contenido de la información sobre accesibilidad y discapacidad en los sitios web.

Resultados

Se constata una mejora sustancial en la accesibilidad técnica de las páginas web de turismo con respecto a los resultados de estudios anteriores. No ha habido avances en el contenido de la información sobre accesibilidad y discapacidad, que sigue siendo muy heterogénea y dispersa.

Originalidad

Esta evaluación de la accesibilidad técnica y del contenido de la información relativo a la accesibilidad y la discapacidad en las páginas web turísticas proporciona una base para desarrollar estrategias que eliminen las barreras con las que se encuentran las personas con discapacidad para acceder a la información turística. Para mejorar la eficacia de las entradas de big data, es necesario estandarizar las variables relacionadas con la accesibilidad técnica y el contenido de la información sobre accesibilidad. Esta normalización mejorará la funcionalidad de los algoritmos fundamentales para el internet de las cosas y las tecnologías de inteligencia artificial. Es probable que estas mejoras impulsen innovaciones que reduzcan la brecha de la desigualdad y promuevan entornos en los que la tecnología sirva como piedra angular de la inclusión social e igualdad.

1 – 10 of 403