Search results

1 – 10 of 104
To view the access options for this content please click here
Article
Publication date: 3 June 2019

Hongqi Han, Yongsheng Yu, Lijun Wang, Xiaorui Zhai, Yaxin Ran and Jingpeng Han

The aim of this study is to present a novel approach based on semantic fingerprinting and a clustering algorithm called density-based spatial clustering of applications…

Abstract

Purpose

The aim of this study is to present a novel approach based on semantic fingerprinting and a clustering algorithm called density-based spatial clustering of applications with noise (DBSCAN), which can be used to convert investor records into 128-bit semantic fingerprints. Inventor disambiguation is a method used to discover a unique set of underlying inventors and map a set of patents to their corresponding inventors. Resolving the ambiguities between inventors is necessary to improve the quality of the patent database and to ensure accurate entity-level analysis. Most existing methods are based on machine learning and, while they often show good performance, this comes at the cost of time, computational power and storage space.

Design/methodology/approach

Using DBSCAN, the meta and textual data in inventor records are converted into 128-bit semantic fingerprints. However, rather than using a string comparison or cosine similarity to calculate the distance between pair-wise fingerprint records, a binary number comparison function was used in DBSCAN. DBSCAN then clusters the inventor records based on this distance to disambiguate inventor names.

Findings

Experiments conducted on the PatentsView campaign database of the United States Patent and Trademark Office show that this method disambiguates inventor names with recall greater than 99 per cent in less time and with substantially smaller storage requirement.

Research limitations/implications

A better semantic fingerprint algorithm and a better distance function may improve precision. Setting of different clustering parameters for each block or other clustering algorithms will be considered to improve the accuracy of the disambiguation results even further.

Originality/value

Compared with the existing methods, the proposed method does not rely on feature selection and complex feature comparison computation. Most importantly, running time and storage requirements are drastically reduced.

Details

The Electronic Library , vol. 37 no. 2
Type: Research Article
ISSN: 0264-0473

Keywords

To view the access options for this content please click here
Article
Publication date: 8 June 2015

Taiseer Abdalla Elfadil Eisa, Naomie Salim and Salha Alzahrani

– The purpose of this paper is to analyse the state-of-the-art techniques used to detect plagiarism in terms of their limitations, features, taxonomies and processes.

Abstract

Purpose

The purpose of this paper is to analyse the state-of-the-art techniques used to detect plagiarism in terms of their limitations, features, taxonomies and processes.

Design/methodology/approach

The method used to execute this study consisted of a comprehensive search for relevant literature via six online database repositories namely; IEEE xplore, ACM Digital Library, ScienceDirect, EI Compendex, Web of Science and Springer using search strings obtained from the subject of discussion.

Findings

The findings revealed that existing plagiarism detection techniques require further enhancements as existing techniques are incapable of efficiently detecting plagiarised ideas, figures, tables, formulas and scanned documents.

Originality/value

The contribution of this study lies in its ability to have exposed the current trends in plagiarism detection researches and identify areas where further improvements are required so as to complement the performances of existing techniques.

Details

Online Information Review, vol. 39 no. 3
Type: Research Article
ISSN: 1468-4527

Keywords

To view the access options for this content please click here
Article
Publication date: 1 February 2005

Ronald van Eijk, Alfons Salden, Johan de Heer, Arjan Peddemors, Petri Määttä and Ville Haataja

The increasing number of context aware services, which depend on various multimodal sensing, processing and actuating techniques, technologies and formats ask for a…

Abstract

The increasing number of context aware services, which depend on various multimodal sensing, processing and actuating techniques, technologies and formats ask for a physical framework that is able to handle their heterogeneity. Thereto, we propose a context model bridging the semantic gaps between context aware services. In addition we propose a simple system architecture of Distribution Servers and Transformation Servers that bridge semantic gaps among context aware services. Applying our framework we solve the heterogeneity problem existing for location services. Location is typically a form of context where heterogeneity is a problem.

Details

International Journal of Pervasive Computing and Communications, vol. 1 no. 1
Type: Research Article
ISSN: 1742-7371

Keywords

To view the access options for this content please click here
Article
Publication date: 27 July 2018

Evangelia Triperina, Georgios Bardis, Cleo Sgouropoulou, Ioannis Xydas, Olivier Terraz and Georgios Miaoulis

The purpose of this paper is to introduce a novel framework for visual-aided ontology-based multidimensional ranking and to demonstrate a case study in the academic domain.

Abstract

Purpose

The purpose of this paper is to introduce a novel framework for visual-aided ontology-based multidimensional ranking and to demonstrate a case study in the academic domain.

Design/methodology/approach

The paper presents a method for adapting semantic web technologies on multiple criteria decision-making algorithms to endow to them dynamic characteristics. It also showcases the enhancement of the decision-making process by visual analytics.

Findings

The semantic enhanced ranking method enables the reproducibility and transparency of ranking results, while the visual representation of this information further benefits decision makers into making well-informed and insightful deductions about the problem.

Research limitations/implications

This approach is suitable for application domains that are ranked on the basis of multiple criteria.

Originality/value

The discussed approach provides a dynamic ranking methodology, instead of focusing only on one application field, or one multiple criteria decision-making method. It proposes a framework that allows integration of multidimensional, domain-specific information and produces complex ranking results in both textual and visual form.

Details

Data Technologies and Applications, vol. 52 no. 3
Type: Research Article
ISSN: 2514-9288

Keywords

To view the access options for this content please click here
Article
Publication date: 26 March 2021

Azra Nazir, Roohie Naaz Mir and Shaima Qureshi

Natural languages have a fundamental quality of suppleness that makes it possible to present a single idea in plenty of different ways. This feature is often exploited in…

Abstract

Purpose

Natural languages have a fundamental quality of suppleness that makes it possible to present a single idea in plenty of different ways. This feature is often exploited in the academic world, leading to the theft of work referred to as plagiarism. Many approaches have been put forward to detect such cases based on various text features and grammatical structures of languages. However, there is a huge scope of improvement for detecting intelligent plagiarism.

Design/methodology/approach

To realize this, the paper introduces a hybrid model to detect intelligent plagiarism by breaking the entire process into three stages: (1) clustering, (2) vector formulation in each cluster based on semantic roles, normalization and similarity index calculation and (3) Summary generation using encoder-decoder. An effective weighing scheme has been introduced to select terms used to build vectors based on K-means, which is calculated on the synonym set for the said term. If the value calculated in the last stage lies above a predefined threshold, only then the next semantic argument is analyzed. When the similarity score for two documents is beyond the threshold, a short summary for plagiarized documents is created.

Findings

Experimental results show that this method is able to detect connotation and concealment used in idea plagiarism besides detecting literal plagiarism.

Originality/value

The proposed model can help academics stay updated by providing summaries of relevant articles. It would eliminate the practice of plagiarism infesting the academic community at an unprecedented pace. The model will also accelerate the process of reviewing academic documents, aiding in the speedy publishing of research articles.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 14 no. 3
Type: Research Article
ISSN: 1756-378X

Keywords

Content available
Article
Publication date: 30 June 2020

Xiaodong Zhang, Ping Li, Xiaoning Ma and Yanjun Liu

The operating wagon records were produced from distinct railway information systems, which resulted in the wagon routing record with the same oriental destination (OD) was…

Abstract

Purpose

The operating wagon records were produced from distinct railway information systems, which resulted in the wagon routing record with the same oriental destination (OD) was different. This phenomenon has brought considerable difficulties to the railway wagon flow forecast. Some were because of poor data quality, which misled the actual prediction, while others were because of the existence of another actual wagon routings. This paper aims at finding all the wagon routing locus patterns from the history records, and thus puts forward an intelligent recognition method for the actual routing locus pattern of railway wagon flow based on SST algorithm.

Design/methodology/approach

Based on the big data of railway wagon flow records, the routing metadata model is constructed, and the historical data and real-time data are fused to improve the reliability of the path forecast results in the work of railway wagon flow forecast. Based on the division of spatial characteristics and the reduction of dimension in the distributary station, the improved Simhash algorithm is used to calculate the routing fingerprint. Combined with Squared Error Adjacency Matrix Clustering algorithm and Tarjan algorithm, the fingerprint similarity is calculated, the spatial characteristics are clustering and identified, the routing locus mode is formed and then the intelligent recognition of the actual wagon flow routing locus is realized.

Findings

This paper puts forward a more realistic method of railway wagon routing pattern recognition algorithm. The problem of traditional railway wagon routing planning is converted into the routing locus pattern recognition problem, and the wagon routing pattern of all OD streams is excavated from the historical data results. The analysis is carried out from three aspects: routing metadata, routing locus fingerprint and routing locus pattern. Then, the intelligent recognition SST-based algorithm of railway wagon routing locus pattern is proposed, which combines the history data and instant data to improve the reliability of the wagon routing selection result. Finally, railway wagon routing locus could be found out accurately, and the case study tests the validity of the algorithm.

Practical implications

Before the forecasting work of railway wagon flow, it needs to know how many kinds of wagon routing locus exist in a certain OD. Mining all the OD routing locus patterns from the railway wagon operating records is helpful to forecast the future routing combined with the wagon characteristics. The work of this paper is the basis of the railway wagon routing forecast.

Originality/value

As the basis of the railway wagon routing forecast, this research not only improves the accuracy and efficiency for the railway wagon routing forecast but also provides the further support of decision-making for the railway freight transportation organization.

Details

Smart and Resilient Transport, vol. 2 no. 1
Type: Research Article
ISSN: 2632-0487

Keywords

To view the access options for this content please click here
Article
Publication date: 7 June 2013

Younghee Noh

This study seeks to examine the concepts of context, context‐awareness, and context‐awareness technology needed for applying context‐awareness technology to the…

Abstract

Purpose

This study seeks to examine the concepts of context, context‐awareness, and context‐awareness technology needed for applying context‐awareness technology to the next‐generation of digital libraries, and proposed context‐aware services that can be applied to any situation by illustrating some library contexts.

Design/methodology/approach

The paper investigated both theoretical research and case analysis studies before suggesting a service model for context‐awareness‐based libraries by examining the context, context‐awareness, and context‐awareness technology in depth.

Findings

This paper derived possible library services which could be provided if context‐awareness services are implemented by examining and analyzing case studies and systems constructed in other fields. A library‐applied context‐aware system could recognize users entering the library and provide optimal services tailored to each situation for both new and existing users. In addition, the context‐awareness‐based library could provide context‐awareness‐based reference services, context‐awareness‐based loan services, and cater to other user needs in the stacks, research space, and a variety of other information spaces. The context‐awareness‐based library could also recognize users in need of emergency assistance by detecting the user's behavior, movement path, and temperature, etc. Comfort or climate‐control services could provide the user with control of the temperature, humidity, illumination and other environmental elements to fit the circumstances of users, books, and instruments through context‐aware technology.

Practical implications

Next‐generation digital libraries apply new concepts such as semantic retrieval, real‐time web, cloud computing, mobile web, linked data, and context‐awareness. Context‐awareness‐based libraries can provide applied context‐awareness access service, reactive space according to the user's access, applied context‐awareness lobbies, applied context‐awareness reference services, and applied context‐awareness safety services, context‐awareness‐based comfort services and so on.

Originality/value

Real instances of libraries applying context‐aware technology are few, according to the investigative results of this study. The study finds that the next‐generation digital library using context‐awareness technology can provide the best possible service for the convenience of its users.

Content available
Book part
Publication date: 9 November 2020

Abstract

Details

Aesthetics and Style in Strategy
Type: Book
ISBN: 978-1-80043-236-9

To view the access options for this content please click here
Article
Publication date: 18 September 2018

Xueling Liang and Yong Chen

Internet of Things (IoT) has been widely adopted in the commercial sector. Although IoT enables traditional libraries to convert into digital ones, the implementation of…

Abstract

Purpose

Internet of Things (IoT) has been widely adopted in the commercial sector. Although IoT enables traditional libraries to convert into digital ones, the implementation of the IoT in libraries is slow. The purpose of this paper is to report on the current state of research on applications of IoT in libraries, describe challenges that IoT applications face in libraries and discus directions of adopting IoT in libraries in the future.

Design/methodology/approach

To conduct this research, the literature of IoT and its application in libraries were reviewed by examining existing literature in Institute of Electrical and Electronics Engineers (IEEE) Xplore.

Findings

The literature review finds that radio-frequency identification has been adopted by digital libraries. The slow implementation of IoT is caused by security and privacy issues, lack of standards and the lack of financial, technological and organizational resources. This study provides a prospective for the application of IoT in libraries; the technologies of IoT have the potential in betterment of library services.

Research limitations/implications

The limitation of this study is that only IEEE Xplore is included. Other database should be explored in future research.

Originality/value

The application of IoT in libraries is an emerging issue; a systematic and extensive review of recent research on applications of IoT in libraries is unavailable. This paper presents an overview of IoT in libraries, findings and potential research opportunities.

Details

Library Hi Tech, vol. 38 no. 1
Type: Research Article
ISSN: 0737-8831

Keywords

To view the access options for this content please click here
Article
Publication date: 27 September 2011

Marios Poulos, Nikolaos Korfiatis and George Bokos

This paper aims to present the semantic content identifier (SCI), a permanent identifier, computed through a linear‐time onion‐peeling algorithm that enables the…

Abstract

Purpose

This paper aims to present the semantic content identifier (SCI), a permanent identifier, computed through a linear‐time onion‐peeling algorithm that enables the extraction of semantic features from a text, and the integration of this information within the permanent identifier.

Design/methodology/approach

The authors employ SCI to propose a mechanism for simultaneously checking the authenticity and degrees of similarity between different information objects, and present an empirical investigation of the method. A management scenario for the control of the authentication process and the detection of the degree of violation of documents is proposed.

Findings

Such a mechanism could be adopted as a component of libraries' strategy for the protection of the copyrights for documents published on the web.

Practical implications

The use of the proposed numeric code can be utilised efficiently as a constituent part of the digital object identifier (DOI) system, making its computation more efficient and meaningful.

Originality/value

The identifier proposed in the paper can result in a more efficient index for identifying and retrieving objects in a digital library, as well as online repositories and commercial applications that can handle information retrieval requests more effectively.

1 – 10 of 104