Search results

1 – 6 of 6
Article
Publication date: 4 April 2008

Hisashi Miyamori, Susumu Akamine, Yoshikiyo Kato, Ken Kaneiwa, Kaoru Sumi, Kentaro Inui and Sadao Kurohashi

The purpose of this paper is to describe evaluation data and a prototype system named WISDOM used for analyzing information credibility based on natural language processing.

Abstract

Purpose

The purpose of this paper is to describe evaluation data and a prototype system named WISDOM used for analyzing information credibility based on natural language processing.

Design/methodology/approach

The authors started the Information Credibility Criteria project in April, 2007, mainly to analyze the credibility of information (text) on the web. The project proposes to capture information credibility based on four criteria (content, sender, appearance, and social valuation) and aims to analyze and organize them logically using natural language processing based on predicate argument structure.

Findings

The evaluation data described in this paper were developed as learning and verifying data for these various analysis modules and are composed of manually‐annotated data based on each evaluation criteria about several pre‐selected topics such as current events and medical issues. The prototype system WISDOM was developed to provide information credibility from different perspectives.

Orginality/value

Users will be able to find credible information more reliably by browsing information using different evaluation criteria and conditions provided by the system.

Details

Internet Research, vol. 18 no. 2
Type: Research Article
ISSN: 1066-2243

Keywords

Article
Publication date: 4 April 2008

Yoshikiyo Kato, Sadao Kurohashi and Kentaro Inui

To develop a method for classifying information sender of web documents, which constitutes an important part of information credibility analysis.

597

Abstract

Purpose

To develop a method for classifying information sender of web documents, which constitutes an important part of information credibility analysis.

Design/methodology/approach

Machine learning approach was employed. About 2,000 human‐annotated web documents were prepared for training and evaluation. The classification model was based on support vector machine, and the features used for the classification included the title and URL of documents, as well as information of the top page.

Findings

With relatively small set of features, the proposed method achieved over 50 per cent accuracy.

Research limitations/implications

Some of the information sender categories were found to be more difficult to classify. This is due to the subjective nature of the categories, and further refinement of the categories is needed.

Practical implications

When combined with opinion/sentiment analysis techniques, information sender classification allows more profound analysis based on interactions between opinions and senders. Such analysis forms a basis of information credibility analysis.

Originality/value

This study formulated the problem of information sender classification. It proposed a method which achieves moderate performance. It also identified some of the issues related to information sender classification.

Details

Internet Research, vol. 18 no. 2
Type: Research Article
ISSN: 1066-2243

Keywords

Content available
Article
Publication date: 4 April 2008

Yoshikiyo Kato, Sadao Kurohashi and Kentaro Inui

1273

Abstract

Details

Internet Research, vol. 18 no. 2
Type: Research Article
ISSN: 1066-2243

Article
Publication date: 4 April 2008

Yasuhiko Watanabe, Ryo Nishimura and Yoshihiro Okada

This paper aims to report a QA system that can answer how‐type questions based on confirmed knowledge acquired from mails, posted to a mailing list. It aims to propose a method of…

Abstract

Purpose

This paper aims to report a QA system that can answer how‐type questions based on confirmed knowledge acquired from mails, posted to a mailing list. It aims to propose a method of detecting incorrect information in mails posted to a mailing list (ML) by using mails that ML participants submitted for correcting incorrect information in previous mails.

Design/methodology/approach

The paper discusses a problem of acquiring knowledge from natural language documents, then proposes a method to give these mails three kinds of confirmation labels, positive, negative, and other, depending on their credibility.

Findings

The paper shows a QA system based on the confirmed knowledge. It finds mail questions that are similar to the user's question and gives answers and their confirmation labels to the user. By using the confirmation labels, the user can easily choose the information that can solve his or her problem.

Originality/value

The study describes a method of detecting incorrect information in mails posted to a mailing list and acquiring confirmed knowledge from them.

Details

Internet Research, vol. 18 no. 2
Type: Research Article
ISSN: 1066-2243

Keywords

Article
Publication date: 4 April 2008

Yayoi Hirose and Noboru Sonehara

The purpose of this paper is to propose a new direction for managing information‐credibility risk in the current information and communications technology (ICT) era, where ICT has…

1934

Abstract

Purpose

The purpose of this paper is to propose a new direction for managing information‐credibility risk in the current information and communications technology (ICT) era, where ICT has had both positive and negative effects on contemporary society.

Design/methodology/approach

The paper takes a practical and inductive approach to study the Kyoto avian influenza panic and countermeasures taken in 2004.

Findings

The paper identifies factors which led to enormous damage through harmful rumors and proposes new perspectives for devising countermeasures, such as increasing consumer confidence in an agency as a source of information and effective management of knowledge transfer from experts to non‐experts.

Practical implications

The study gains a better understanding of both technological and social factors that enable or detract from effective nationwide management of information‐credibility risk. Many related ICT projects have been based on either human resource systems or advanced technology. It considers the integration of both factors from three perspectives.

Originality/value

This is a new perspective for examining the transfer of knowledge from experts to consumers in terms of practical solutions, in contrast to the many existing knowledge‐related articles that have mainly focused on knowledge management among experts.

Details

Internet Research, vol. 18 no. 2
Type: Research Article
ISSN: 1066-2243

Keywords

Article
Publication date: 4 April 2008

Robert Malouf and Tony Mullen

To evaluate and extend, existing natural language processing techniques into the domain of informal online political discussions.

3126

Abstract

Purpose

To evaluate and extend, existing natural language processing techniques into the domain of informal online political discussions.

Design/methodology/approach

A database of postings from a US political discussion site was collected, along with self‐reported political orientation data for the users. A variety of sentiment analysis, text classification, and social network analysis methods were applied to the postings and evaluated against the users' self‐descriptions.

Findings

Purely text‐based methods performed poorly, but could be improved using techniques which took into account the users' position in the online community.

Research limitations/implications

The techniques we applied here are fairly simple, and more sophisticated learning algorithms may yield better results for text‐based classification.

Practical implications

This work suggests that social network analysis is an important tool for performing natural language processing tasks with informal web texts.

Originality/value

This research extends sentiment analysis to a new subject domain (US politics) and a new text genre (informal online discusssions).

Details

Internet Research, vol. 18 no. 2
Type: Research Article
ISSN: 1066-2243

Keywords

1 – 6 of 6