Search results
1 – 10 of over 6000Baoyao Zhou, Siu Cheung Hui and Alvis C. M. Fong
With the explosive growth of information available on the World Wide Web, it has become much more difficult to access relevant information from the Web. One possible approach to…
Abstract
With the explosive growth of information available on the World Wide Web, it has become much more difficult to access relevant information from the Web. One possible approach to solve this problem is web personalization. In this paper, we propose a novel WUL (Web Usage Lattice) based mining approach for mining association access pattern rules for personalized web recommendations. The proposed approach aims to mine a reduced set of effective association pattern rules for enhancing the online performance of web recommendations. We have incorporated the proposed approach into a personalized web recommender system known as AWARS. The performance of the proposed approach is evaluated based on the efficiency and the quality. In the efficiency evaluation, we measure the number of generated rules and the runtime for online recommendations. In the quality evaluation, we measure the quality of the recommendation service based on precision, satisfactory and applicability. This paper will discuss the proposed WUL‐based mining approach, and give the performance of the proposed approach in comparison with the Apriori‐based algorithms.
Details
Keywords
Stefan Strohmeier and Franca Piazza
Numerous research questions in e-HRM research are directly related to the usage of diverse information systems by HR professionals, line managers, employees, and/or applicants…
Abstract
Numerous research questions in e-HRM research are directly related to the usage of diverse information systems by HR professionals, line managers, employees, and/or applicants. Since they are regularly based on Internet technologies, information systems in e-HRM automatically store detailed usage data in log files of web servers. Subsumed as “web mining,” such data are frequently used as inputs for innovative data analysis in e-commerce practice. Though also promising in empirical e-HRM research, web mining is neither discussed nor applied in this area at present. Our chapter therefore aims at a methodological evaluation of web mining as an e-HRM research approach. After introducing web mining as a possible approach in e-HRM research, we examine its applicability by discussing available data, feasible methods, coverable topics, and confirmable privacy. Subsequently, we classify the approach methodologically by examining major issues. Our evaluation reveals that “web mining” constitutes a promising additional research approach that enables research to answer numerous relevant questions related to the actual usage of information systems in e-HRM.
The focus of this paper is a survey of web‐mining research related to areas of societal benefit. The article aims to focus particularly on web mining which may benefit societal…
Abstract
Purpose
The focus of this paper is a survey of web‐mining research related to areas of societal benefit. The article aims to focus particularly on web mining which may benefit societal areas by extracting new knowledge, providing support for decision making and empowering the effective management of societal issues.
Design/methodology/approach
E‐commerce and e‐business are two fields that have been empowered by web mining, having many applications for increasing online sales and doing intelligent business. Have areas of social interest also been empowered by web mining applications? What are the current ongoing research and trends in e‐services fields such as e‐learning, e‐government, e‐politics and e‐democracy? What other areas of social interest can benefit from web mining? This work will try to provide the answers by reviewing the literature for the applications and methods applied to the above fields.
Findings
There is a growing interest in applications of web mining that are of social interest. This reveals that one of the current trends of web mining is toward the connection between intelligent web services and societal benefit applications, which denotes the need for interdisciplinary collaboration between researchers from various fields.
Originality/value
On the one hand, this work presents to the web‐mining community an overview of research opportunities in societal benefit areas. On the other hand, it presents to web researchers from various disciplines an approach for improving their web studies by considering web mining as a powerful research tool.
Details
Keywords
Richard S. Segall and Qingyu Zhang
The purpose of this paper is to illustrate the usefulness and results of applying web mining as extensions of data mining.
Abstract
Purpose
The purpose of this paper is to illustrate the usefulness and results of applying web mining as extensions of data mining.
Design/methodology/approach
Web mining is performed using three selected software to databases related to customer survey, marketing campaign data, and web site usage. The three selected software are PolyAnalyst® of Megaputer Intelligence, Inc., SPSS Clementine®, and ClickTracks by Web Analytics.
Findings
This paper discusses and compares the web mining technologies used by the selected software as applied to text, web, and click stream data.
Research limitations/implications
The limitations include the availability of databases and software to perform the web mining. The implications include that this methodology can be extended to other databases.
Practical implications
The methodology used in this paper could be representative of that used for managers to manage their relationships with customers, their marketing campaigns, and their web site activities.
Originality/value
PolyAnalyst is applied to analyze text data of actual written hotel comments. SPSS Clementine is applied to customer web data collected in response to several different marketing campaigns, including age, gender, and income. ClickTracks is applied to click‐stream data for Bob's Fruit web site to generate click fraud report, search report with revenues, pay‐per‐click, and search keywords for all visitors.
Details
Keywords
Guillermo Navarro‐Arribas and Vicenç Torra
The purpose of this paper is to anonymize web server log files used in e‐commerce web mining processes.
Abstract
Purpose
The purpose of this paper is to anonymize web server log files used in e‐commerce web mining processes.
Design/methodology/approach
The paper has applied statistical disclosure control (SDC) techniques to achieve its goal. More precisely, it has introduced the micro‐aggregation of web access logs.
Findings
The experiments show that the proposed technique provides good results in general, but it is especially outstanding when dealing with relatively small websites.
Research limitations/implications
As in all SDC techniques there is always a trade‐off between privacy and utility or, in other words, between disclosure risk and information loss. In this proposal, it has borne this issue in mind, providing k‐anonymity, while preserving acceptable information accuracy.
Practical implications
Web server logs are valuable information used nowadays for user profiling and general data‐mining analysis of a website in e‐commerce and e‐services. This proposal allows anonymizing such logs, so they can be safely outsourced to other companies for marketing purposes, stored for further analysis, or made publicly available, without risking customer privacy.
Originality/value
Current solutions to the problem presented here are very poor and scarce. They are normally reduced to the elimination of sensitive information from query strings of URLs in general. Moreover, to its knowledge, the use of SDC techniques has never been applied to the anonymization of web logs.
Details
Keywords
Chetna Choudhary, Deepti Mehrotra and Avinash K. Shrivastava
As the number of web applications is increasing day by day web mining acts as an important tool to extract useful information from weblogs and analyse them according to the…
Abstract
Purpose
As the number of web applications is increasing day by day web mining acts as an important tool to extract useful information from weblogs and analyse them according to the attributes and predict the usage of a website. The main aim of this paper is to inspect how process mining can be used to predict the web usability of hotel booking sites based on the number of users on each page, and the time of stay of each user. Through this paper, the authors analyse the web usability of a website through process mining by finding the web usability metrics. This work proposes an approach to finding the usage of a website using the attributes available in the weblog which predicts the actual footfall on a website.
Design/methodology/approach
PROM (Process Mining tool) is used for the analysis of the event log of a hotel booking site. In this work, authors have used a case study to apply the PROM (process mining tool) to pre-process the event log dataset for analysis to discover better-structured process maps than without pre-processing.
Findings
This article first provided an overview of process mining, then focused on web mining and later discussed process mining techniques. It also described different target languages: system nets (i.e. Petri nets with an initial and a final state), inductive miner and heuristic miner, graphs showing the change in behaviour of the dataset and predicting the outcome, that is the webpage having the maximum number of hits.
Originality/value
In this work, a case study has been used to apply the PROM (process mining tool) to pre-process the event log dataset for analysis to discover better-structured process maps than without pre-processing.
Details
Keywords
Rong Gu, Miaoliang Zhu, Liying Zhao and Ningning Zhang
Behaviour in virtual learning environments (VLE), including travel, gaze, manipulate, gesture and conversation, offer considerable information about the user's implicit interest…
Abstract
Purpose
Behaviour in virtual learning environments (VLE), including travel, gaze, manipulate, gesture and conversation, offer considerable information about the user's implicit interest. The purpose of this study is to find an approach for user interest mining via behaviour analysis in a VLE.
Design/methodology/approach
According to research in psychology, any interaction in a VLE has implications for the user's implicit interest. In order to mine a user's implicit interest, an explicit interaction‐interest model needs to be established. This paper presents findings from the concept classification of behaviour in a VLE. Based on this classification, the paper proposes a hierarchical interaction model. In this model the relation between interaction and user interest can be described and used to improve system performance.
Findings
In the experimental prototype the authors found that user‐implicit interest could be mined via stages of web mining, i.e. capture the user's original gesture signal, data pre‐process, pattern discovery, interaction goal and interest mining. The mined user's interest information can be used to update the state of local interest, leading to a reduction in network traffic and promotion of better system performance.
Originality/value
This is an original study using behaviour analysis for interest mining in e‐learning. Research on interest mining in e‐learning focused on content mining or search engine and usage mining in web courses. The paper provides valuable clues regarding user interest mining in a VLE, in which the context is different from usual web courses. The research output can be implemented widely, including online learning, and especially in the VLE.
Details
Keywords
Qingyu Zhang and Richard S. Segall
The purpose of this paper is to review and compare selected software for data mining, text mining (TM), and web mining that are not available as free open‐source software.
Abstract
Purpose
The purpose of this paper is to review and compare selected software for data mining, text mining (TM), and web mining that are not available as free open‐source software.
Design/methodology/approach
Selected softwares are compared with their common and unique features. The software for data mining are SAS® Enterprise Miner™, Megaputer PolyAnalyst® 5.0, NeuralWare Predict®, and BioDiscovery GeneSight®. The software for TM are CompareSuite, SAS® Text Miner, TextAnalyst, VisualText, Megaputer PolyAnalyst® 5.0, and WordStat. The software for web mining are Megaputer PolyAnalyst®, SPSS Clementine®, ClickTracks, and QL2.
Findings
This paper discusses and compares the existing features, characteristics, and algorithms of selected software for data mining, TM, and web mining, respectively. These softwares are also applied to available data sets.
Research limitations/implications
The limitations are the inclusion of selected software and datasets rather than considering the entire realm of these. This review could be used as a framework for comparing other data, text, and web mining software.
Practical implications
This paper can be helpful for an organization or individual when choosing proper software to meet their mining needs.
Originality/value
Each of the software selected for this research has its own unique characteristics, properties, and algorithms. No other paper compares these selected softwares both visually and descriptively for all the three types of data, text, and web mining.
Details
Keywords
Advanced analytics‐driven data analyses allow enterprises to have a complete or “360 degrees” view of their operations and customers. The insight that they gain from such analyses…
Abstract
Purpose
Advanced analytics‐driven data analyses allow enterprises to have a complete or “360 degrees” view of their operations and customers. The insight that they gain from such analyses is then used to direct, optimize, and automate their decision making to successfully achieve their organizational goals. Data, text, and web mining technologies are some of the key contributors to making advanced analytics possible. This paper aims to investigate these three mining technologies in terms of how they are used and the issues that are related to their effective implementation and management within the broader context of predictive or advanced analytics.
Design/methodology/approach
A range of recently published research literature on business intelligence (BI); predictive analytics; and data, text and web mining is reviewed to explore their current state, issues and challenges learned from their practice.
Findings
The findings are reported in two parts. The first part discusses a framework for BI using the data, text, and web mining technologies for advanced analytics; and the second part identifies and discusses the opportunities and challenges the business managers dealing with these technologies face for gaining competitive advantages for their businesses.
Originality/value
The study findings are intended to assist business managers to effectively understand the issues and emerging technologies behind advanced analytics implementation.
Details
Keywords
Tim France, Dave Yen, Jyun‐Cheng Wang and Chia‐Ming Chang
In recent years, the World Wide Web (WWW) has become incredibly popular in homes and offices alike. Consumers need to search for relevant information to help solve purchasing…
Abstract
In recent years, the World Wide Web (WWW) has become incredibly popular in homes and offices alike. Consumers need to search for relevant information to help solve purchasing problems on various Web sites. Although there is no question that great numbers of WWW users will continue using search engines for information retrieval, consumers still hesitate before making a final decision, often because only rough and limited information about the products is made available. Consequently, consumers need the help of data mining in order to help them make informed decisions. Herein we propose a new approach to integrating a search engine with data mining in an effort to help support customer‐oriented information search action. This approach also illustrates how to reduce the consumer’s information search perplexity.
Details