Search results

1 – 2 of 2
Article
Publication date: 19 March 2018

Hyo-Jung Oh, Dong-Hyun Won, Chonghyuck Kim, Sung-Hee Park and Yong Kim

The purpose of this paper is to describe the development of an algorithm for realizing web crawlers that automatically collect dynamically generated webpages from the deep web.

Abstract

Purpose

The purpose of this paper is to describe the development of an algorithm for realizing web crawlers that automatically collect dynamically generated webpages from the deep web.

Design/methodology/approach

This study proposes and develops an algorithm to collect web information as if the web crawler gathers static webpages by managing script commands as links. The proposed web crawler actually experiments with the algorithm by collecting deep webpages.

Findings

Among the findings of this study is that if the actual crawling process provides search results as script pages, the outcome only collects the first page. However, the proposed algorithm can collect deep webpages in this case.

Research limitations/implications

To use a script as a link, a human must first analyze the web document. This study uses the web browser object provided by Microsoft Visual Studio as a script launcher, so it cannot collect deep webpages if the web browser object cannot launch the script, or if the web document contains script errors.

Practical implications

The research results show deep webs are estimated to have 450 to 550 times more information than surface webpages, and it is difficult to collect web documents. However, this algorithm helps to enable deep web collection through script runs.

Originality/value

This study presents a new method to be utilized with script links instead of adopting previous keywords. The proposed algorithm is available as an ordinary URL. From the conducted experiment, analysis of scripts on individual websites is needed to employ them as links.

Details

Data Technologies and Applications, vol. 52 no. 2
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 5 January 2024

Dennis Mathaisel

This paper aims to review and critically assess the role that data visualizations played as communication media tools to help society during a worldwide crisis. This paper…

Abstract

Purpose

This paper aims to review and critically assess the role that data visualizations played as communication media tools to help society during a worldwide crisis. This paper re-creates and analyzes several visualizations, critically and ethically assesses their strengths and limitations and provides a set of best practices that are informative, accurate, ethical and engaging at each stage in a reader’s interest.

Design/methodology/approach

The paper bases its methodology on the construct of “The Network Society” (Van Dijk, 2006; Castells, 2000, 2006) by creating a series of social networked visualizations, identifying the challenges and pitfalls associated with this communication approach and suggesting best practices in information communication technology. The case study is COVID-19.

Findings

The research in this study found that visual data dashboards and interactive Web-based charts did play a significant role in helping society understand COVID-19’s impact to make better informed decisions about society’s health and safety.

Research limitations/implications

Visual expositions of data do have strengths and weaknesses depending on how they are designed, how they communicate the story and how they are ethically deployed. Best practices are provided to help mitigate these limitations.

Practical implications

Visualizations are certainly not new, but the technology for rapidly developing and sharing them is new. Visual expositions provide an effective media for communicating complex information to a networked society.

Social implications

Visual expositions provide an effective media for communicating complex information to a networked society.

Originality/value

This paper highlights the significance of the need to understand complex data in a crisis in a visual format and to communicate the information quickly, persuasively, effectively and ethically to a networked audience.

Details

Journal of Information, Communication and Ethics in Society, vol. 22 no. 1
Type: Research Article
ISSN: 1477-996X

Keywords

1 – 2 of 2