Search results

1 – 10 of over 12000
Article
Publication date: 5 June 2009

Bruno Feres de Souza, Carlos Soares and André C.P.L.F. de Carvalho

The purpose of this paper is to investigate the applicability of meta‐learning to the problem of algorithm recommendation for gene expression data classification.

Abstract

Purpose

The purpose of this paper is to investigate the applicability of meta‐learning to the problem of algorithm recommendation for gene expression data classification.

Design/methodology/approach

Meta‐learning was used to provide a preference order of machine learning algorithms, based on their expected performances. Two approaches were considered for such: k‐nearest neighbors and support vector machine‐based ranking methods. They were applied to a set of 49 publicly available microarray datasets. The evaluation of the methods followed standard procedures suggested in the meta‐learning literature.

Findings

Empirical evidences show that both ranking methods produce more interesting suggestions for gene expression data classification than the baseline method. Although the rankings are more accurate, a significant difference in the performances of the top classifiers was not observed.

Practical implications

As the experiments conducted in this paper suggest, the use of meta‐learning approaches can provide an efficient data driven way to select algorithms for gene expression data classification.

Originality/value

This paper reports contributions to the areas of meta‐learning and gene expression data analysis. Regarding the former, it supports the claim that meta‐learning can be suitably applied to problems of a specific domain, expanding its current practice. To the latter, it introduces a cost effective approach to better deal with classification tasks.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 2 no. 2
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 24 April 2018

Abhishek Kumar Singh, Naresh Kumar Nagwani and Sudhakar Pandey

Recently, with a high volume of users and user’s content in Community Question Answering (CQA) sites, the quality of answers provided by users has raised a big concern. Finding…

Abstract

Purpose

Recently, with a high volume of users and user’s content in Community Question Answering (CQA) sites, the quality of answers provided by users has raised a big concern. Finding the expert users can be a method to address this problem, which aims to find the suitable users (answerers) who can provide high-quality relevant answers. The purpose of this paper is to find the expert users for the newly posted questions of the CQA sites.

Design/methodology/approach

In this paper, a new algorithm, RANKuser, is proposed for identifying the expert users of CQA sites. The proposed RANKuser algorithm consists of three major stages. In the first stage, folksonomy relation between users, tags, and queries is established. User profile attributes, namely, reputation, tags, and badges, are also considered in folksonomy. In the second stage, expertise scores of the user are calculated based on reputation, badges, and tags. Finally, in the third stage, the expert users are identified by extracting top N users based on expertise score.

Findings

In this work, with the help of proposed ranking algorithm, expert users are identified for newly posted questions. In this paper, comparison of proposed user ranking algorithm (RANKuser) is also performed with other existing ranking algorithms, namely, ML-KNN, rankSVM, LDA, STM CQARank, and EV-based model using performance parameters such as hamming loss, accuracy, average precision, one error, F-measure, and normalized discounted cumulative gain. The proposed ranking method is also compared to the original ranking of CQA sites using the paired t-test. The experimental results demonstrate the effectiveness of the proposed RANKuser algorithm in comparison with the existing ranking algorithms.

Originality/value

This paper proposes and implements a new algorithm for expert user identification in CQA sites. By utilizing the folksonomy in CQA sites and information of user profile, this algorithm identifies the experts.

Details

Data Technologies and Applications, vol. 52 no. 3
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 29 June 2021

Xin Pan, Hanqi Wen, Ziwei Wang, Jie Song and Xing Lin Feng

Digital healthcare has become one of the most important Internet applications in the recent years, and digital platforms have been acting as interfaces between the patients and…

Abstract

Purpose

Digital healthcare has become one of the most important Internet applications in the recent years, and digital platforms have been acting as interfaces between the patients and physicians. Although these technologies enhance patient convenience, they create new challenges in platform management. For instance, on physician rating websites, information overload negatively influences patients' decision-making in relation to selecting a physician. This scenario calls for an automated mechanism to provide real-time rankings of physicians. Motivated by an online healthcare platform, this study develops a method to deliver physician ranking on platforms by considering patients' browse behaviors and the capacities of service resources.

Design/methodology/approach

The authors use a probabilistic model for explicitly capturing the browse behaviors of patients. Since the large volume of information in digital systems makes it intractable to solve the dynamic ranking problem, we design a ranking with value approximation algorithm that combines a greedy ranking policy and the value function approximation methods.

Findings

The authors found that the approximation methods are quite effective in dealing with the ranking optimization on the digital healthcare system, and it is mainly because the authors incorporate the patient behaviors and patient availability in the model.

Originality/value

To the best of the authors’ knowledge, this is one of the first studies to present solutions to the dynamic physician ranking problem. The ranking algorithms can also help platforms improve system and operational performance.

Details

Internet Research, vol. 31 no. 6
Type: Research Article
ISSN: 1066-2243

Keywords

Article
Publication date: 30 June 2020

Yin-Ju Chen and Jian-Ming Lo

Decision-making is always an issue that managers have to deal with. Keenly observing to different preferences of the targets provides useful information for decision-makers who do…

Abstract

Purpose

Decision-making is always an issue that managers have to deal with. Keenly observing to different preferences of the targets provides useful information for decision-makers who do not require too much information to make decisions. The main purpose is to avoid decision-makers in a dilemma because of too much or opaque information. Based on problem-oriented, this research aims to help decision-makers to develop a macro-vision strategy that fits the needs of different clusters of customers in terms of their favorite restaurants. This research also focuses on providing the rules to rank data sets for decision-makers to make choices for their favorite restaurant.

Design/methodology/approach

When the decision-makers need to rethink a new strategic planning, they have to think about whether they want to retain or rebuild their relationship with the old consumers or continue to care for new customers. Furthermore, many of the lecturers show that the relative concept will be more effective than the absolute one. Therefore, based on rough set theory, this research proposes an algorithm of related concepts and sends questionnaires to verify the efficiency of the algorithm.

Findings

By feeding the relative order of calculating the ranking rules, we find that it will be more efficient to deal with the faced problems.

Originality/value

The algorithm proposed in this research is applied to the ranking data of food. This research proves that the algorithm is practical and has the potential to reveal important patterns in the data set.

Details

Data Technologies and Applications, vol. 55 no. 2
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 7 November 2016

Amir Hosein Keyhanipour, Behzad Moshiri, Maryam Piroozmand, Farhad Oroumchian and Ali Moeini

Learning to rank algorithms inherently faces many challenges. The most important challenges could be listed as high-dimensionality of the training data, the dynamic nature of Web…

Abstract

Purpose

Learning to rank algorithms inherently faces many challenges. The most important challenges could be listed as high-dimensionality of the training data, the dynamic nature of Web information resources and lack of click-through data. High dimensionality of the training data affects effectiveness and efficiency of learning algorithms. Besides, most of learning to rank benchmark datasets do not include click-through data as a very rich source of information about the search behavior of users while dealing with the ranked lists of search results. To deal with these limitations, this paper aims to introduce a novel learning to rank algorithm by using a set of complex click-through features in a reinforcement learning (RL) model. These features are calculated from the existing click-through information in the data set or even from data sets without any explicit click-through information.

Design/methodology/approach

The proposed ranking algorithm (QRC-Rank) applies RL techniques on a set of calculated click-through features. QRC-Rank is as a two-steps process. In the first step, Transformation phase, a compact benchmark data set is created which contains a set of click-through features. These feature are calculated from the original click-through information available in the data set and constitute a compact representation of click-through information. To find most effective click-through feature, a number of scenarios are investigated. The second phase is Model-Generation, in which a RL model is built to rank the documents. This model is created by applying temporal difference learning methods such as Q-Learning and SARSA.

Findings

The proposed learning to rank method, QRC-rank, is evaluated on WCL2R and LETOR4.0 data sets. Experimental results demonstrate that QRC-Rank outperforms the state-of-the-art learning to rank methods such as SVMRank, RankBoost, ListNet and AdaRank based on the precision and normalized discount cumulative gain evaluation criteria. The use of the click-through features calculated from the training data set is a major contributor to the performance of the system.

Originality/value

In this paper, we have demonstrated the viability of the proposed features that provide a compact representation for the click through data in a learning to rank application. These compact click-through features are calculated from the original features of the learning to rank benchmark data set. In addition, a Markov Decision Process model is proposed for the learning to rank problem using RL, including the sets of states, actions, rewarding strategy and the transition function.

Details

International Journal of Web Information Systems, vol. 12 no. 4
Type: Research Article
ISSN: 1744-0084

Keywords

Article
Publication date: 8 September 2022

Amir Hosein Keyhanipour and Farhad Oroumchian

User feedback inferred from the user's search-time behavior could improve the learning to rank (L2R) algorithms. Click models (CMs) present probabilistic frameworks for describing…

Abstract

Purpose

User feedback inferred from the user's search-time behavior could improve the learning to rank (L2R) algorithms. Click models (CMs) present probabilistic frameworks for describing and predicting the user's clicks during search sessions. Most of these CMs are based on common assumptions such as Attractiveness, Examination and User Satisfaction. CMs usually consider the Attractiveness and Examination as pre- and post-estimators of the actual relevance. They also assume that User Satisfaction is a function of the actual relevance. This paper extends the authors' previous work by building a reinforcement learning (RL) model to predict the relevance. The Attractiveness, Examination and User Satisfaction are estimated using a limited number of the features of the utilized benchmark data set and then they are incorporated in the construction of an RL agent. The proposed RL model learns to predict the relevance label of documents with respect to a given query more effectively than the baseline RL models for those data sets.

Design/methodology/approach

In this paper, User Satisfaction is used as an indication of the relevance level of a query to a document. User Satisfaction itself is estimated through Attractiveness and Examination, and in turn, Attractiveness and Examination are calculated by the random forest algorithm. In this process, only a small subset of top information retrieval (IR) features are used, which are selected based on their mean average precision and normalized discounted cumulative gain values. Based on the authors' observations, the multiplication of the Attractiveness and Examination values of a given query–document pair closely approximates the User Satisfaction and hence the relevance level. Besides, an RL model is designed in such a way that the current state of the RL agent is determined by discretization of the estimated Attractiveness and Examination values. In this way, each query–document pair would be mapped into a specific state based on its Attractiveness and Examination values. Then, based on the reward function, the RL agent would try to choose an action (relevance label) which maximizes the received reward in its current state. Using temporal difference (TD) learning algorithms, such as Q-learning and SARSA, the learning agent gradually learns to identify an appropriate relevance label in each state. The reward that is used in the RL agent is proportional to the difference between the User Satisfaction and the selected action.

Findings

Experimental results on MSLR-WEB10K and WCL2R benchmark data sets demonstrate that the proposed algorithm, named as SeaRank, outperforms baseline algorithms. Improvement is more noticeable in top-ranked results, which usually receive more attention from users.

Originality/value

This research provides a mapping from IR features to the CM features and thereafter utilizes these newly generated features to build an RL model. This RL model is proposed with the definition of the states, actions and reward function. By applying TD learning algorithms, such as the Q-learning and SARSA, within several learning episodes, the RL agent would be able to learn how to choose the most appropriate relevance label for a given pair of query–document.

Details

Data Technologies and Applications, vol. 57 no. 4
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 7 June 2021

Amir Hosein Keyhanipour and Farhad Oroumchian

Incorporating users’ behavior patterns could help in the ranking process. Different click models (CMs) are introduced to model the sophisticated search-time behavior of users…

Abstract

Purpose

Incorporating users’ behavior patterns could help in the ranking process. Different click models (CMs) are introduced to model the sophisticated search-time behavior of users among which commonly used the triple of attractiveness, examination and satisfaction. Inspired by this fact and considering the psychological definitions of these concepts, this paper aims to propose a novel learning to rank by redefining these concepts. The attractiveness and examination factors could be calculated using a limited subset of information retrieval (IR) features by the random forest algorithm, and then they are combined with each other to predicate the satisfaction factor which is considered as the relevance level.

Design/methodology/approach

The attractiveness and examination factors of a given document are usually considered as its perceived relevance and the fast scan of its snippet, respectively. Here, attractiveness and examination factors are regarded as the click-count and the investigation rate, respectively. Also, the satisfaction of a document is supposed to be the same as its relevance level for a given query. This idea is supported by the strong correlation between attractiveness-satisfaction and the examination-satisfaction. Applying random forest algorithm, the attractiveness and examination factors are calculated using a very limited set of the primitive features of query-document pairs. Then, by using the ordered weighted averaging operator, these factors are aggregated to estimate the satisfaction.

Findings

Experimental results on MSLR-WEB10K and WCL2R data sets show the superiority of this algorithm over the state-of-the-art ranking algorithms in terms of P@n and NDCG criteria. The enhancement is more noticeable in top-ranked items which are reviewed more by the users.

Originality/value

This paper proposes a novel learning to rank based on the redefinition of major building blocks of the CMs which are the attractiveness, examination and satisfactory. It proposes a method to use a very limited number of selected IR features to estimate the attractiveness and examination factors and then combines these factors to predicate the satisfactory which is regarded as the relevance level of a document with respect to a given query.

Details

International Journal of Web Information Systems, vol. 17 no. 4
Type: Research Article
ISSN: 1744-0084

Keywords

Article
Publication date: 14 October 2020

Haihua Chen, Yunhan Yang, Wei Lu and Jiangping Chen

Citation contexts have been found useful in many scenarios. However, existing context-based recommendations ignored the importance of diversity in reducing the redundant issues…

Abstract

Purpose

Citation contexts have been found useful in many scenarios. However, existing context-based recommendations ignored the importance of diversity in reducing the redundant issues and thus cannot cover the broad range of user interests. To address this gap, the paper aims to propose a novelty task that can recommend a set of diverse citation contexts extracted from a list of citing articles. This will assist users in understanding how other scholars have cited an article and deciding which articles they should cite in their own writing.

Design/methodology/approach

This research combines three semantic distance algorithms and three diversification re-ranking algorithms for the diversifying recommendation based on the CiteSeerX data set and then evaluates the generated citation context lists by applying a user case study on 30 articles.

Findings

Results show that a diversification strategy that combined “word2vec” and “Integer Linear Programming” leads to better reading experience for participants than other diversification strategies, such as CiteSeerX using a list sorted by citation counts.

Practical implications

This diversifying recommendation task is valuable for developing better systems in information retrieval, automatic academic recommendations and summarization.

Originality/value

The originality of the research lies in the proposal of a novelty task that can recommend a diversification context list describing how other scholars cited an article, thereby making citing decisions easier. A novel mixed approach is explored to generate the most efficient diversifying strategy. Besides, rather than traditional information retrieval evaluation, a user evaluation framework is introduced to reflect user information needs more objectively.

Article
Publication date: 5 January 2018

Tehmina Amjad, Ali Daud and Naif Radi Aljohani

This study reviews the methods found in the literature for the ranking of authors, identifies the pros and cons of these methods, discusses and compares these methods. The purpose…

1452

Abstract

Purpose

This study reviews the methods found in the literature for the ranking of authors, identifies the pros and cons of these methods, discusses and compares these methods. The purpose of this paper is to study is to find the challenges and future directions of ranking of academic objects, especially authors, for future researchers.

Design/methodology/approach

This study reviews the methods found in the literature for the ranking of authors, classifies them into subcategories by studying and analyzing their way of achieving the objectives, discusses and compares them. The data sets used in the literature and the evaluation measures applicable in the domain are also presented.

Findings

The survey identifies the challenges involved in the field of ranking of authors and future directions.

Originality/value

To the best of the knowledge, this is the first survey that studies the author ranking problem in detail and classifies them according to their key functionalities, features and way of achieving the objective according to the requirement of the problem.

Details

Library Hi Tech, vol. 36 no. 1
Type: Research Article
ISSN: 0737-8831

Keywords

Article
Publication date: 8 April 2014

Fahimeh Ramezani and Jie Lu

In any organization there are main goals, with lots of projects designed to achieve these goals. It is important for any organization to determine how much these projects affect…

1998

Abstract

Purpose

In any organization there are main goals, with lots of projects designed to achieve these goals. It is important for any organization to determine how much these projects affect the achievement of these goals. The purpose of this paper is to develop a fuzzy multiple attribute-based group decision-support system (FMAGDSS) to evaluate projects’ performance in promoting the organization's goals utilizing simple additive weighting (SAW) algorithm and technique for order of preference by similarity to ideal solution (TOPSIS) algorithm. The proposed FMAGDSS deals with choosing the most appropriate fuzzy ranking algorithm for solving a given fuzzy multi attribute decision making (FMADM) problem with both qualitative and quantitative criteria (attributes), and uncertain judgments of decision makers.

Design/methodology/approach

In this paper, a FMAGDSS model is designed to determine scores and ranks of every project in promoting the organization's goals. In the first step of FMAGDSS model, all projects are assessed by experts based on evaluation criteria and the organization's goals. The proposed FMAGDSS model will then choose the most appropriate fuzzy ranking method to solve the given FMADM problem. Finally, a sensitivity analysis system is developed to assess the reliability of the decision-making process and provide an opportunity to analyze the impacts of “criteria weights” and “projects” performance’ on evaluating projects in achieving the organizations’ goals, and to assess the reliability of the decision-making process. In addition, a software prototype has been developed on the basis of FMAGDSS model that can be applied to solve every FMADM problem that needs to rank alternatives according to certain attributes.

Findings

The result of this study simplifies and accelerates the evaluation process. The proposed system not only helps organizations to choose the most efficient projects for sustainable development, but also helps them to assess the reliability of the decision-making process, and decrease the uncertainty in final decision caused by uncertain judgment of decision makers.

Research limitations/implications

Future studies are suggested to expand this system to evaluate and rank the project proposals. To achieve this goal, the efficiency of the projects in line with organization's goals, should be predicted.

Originality/value

This study contributes to the relevant literature by proposing a FMAGDSS model to evaluate projects in promoting organization's goals. The proposed FMAGDSS has ability to choose the most appropriate fuzzy ranking algorithm to solve a given FMADM problem based on the type and the number of attributes and alternatives, considering the least computation and time consumption for ranking alternatives.

Details

Journal of Enterprise Information Management, vol. 27 no. 3
Type: Research Article
ISSN: 1741-0398

Keywords

1 – 10 of over 12000