Search results
1 – 10 of over 1000Jenq-Muh Hsu, Jui-Yang Chang and Chih-Hung Wang
Named Data Networking (NDN) is a content-centric network differing from the traditional IP-based network. It adopts the name prefix to identify, query and route the information…
Abstract
Purpose
Named Data Networking (NDN) is a content-centric network differing from the traditional IP-based network. It adopts the name prefix to identify, query and route the information content instead of IP-based addressing and routing. NDN provides a convenient way to access the content without knowing the originated location of the requested information. However, the length of name prefix varies. It is not like the fixed-length IP addresses that makes handling queries or searching the requested information in NDN easier. An efficient name lookup mechanism of name prefix will efficiently increase the performance of prefix identifying, name searching and content retrieving. Therefore, this paper aims to propose a partial name prefix merging and shortening scheme for enhancing the efficiency of name lookup in NDN.
Design/methodology/approach
To reduce the work involved in name prefix identifying, querying, storing and routing, this work adopts a cyclic redundancy check-based encoding scheme to shorten the variable length of the name prefix into a proper and fixed length of encoded numerical information. In fact, the structure of a name prefix is presented in a combination of word segments with the slash symbol. The shortening procedure of name prefix can also be applied to adjacent word segments forming fixed-length encoded data for further efficiently matching the name prefix for name lookup in NDN.
Findings
The experimental results show that the shorter length of encoded name prefix can effectively reduce the access time of name lookup and increasingly retrieve the corresponding named content in NDN. Through partial merging and shortening of name prefix, the length of encoded prefix name may be larger than the whole encoding of name prefix. It retains the information differences from different parts of various name prefixes. Thus, it can avoid collision problems with the same encoded information from various name prefixes.
Originality/value
From the experimental results, it is observed that partial merging and shortening of name prefix is useful for name look up in NDN. It can increase the efficiency of name prefix matching and retrieving in NDN. It can also save memory space to store the name prefix in an NDN node.
Details
Keywords
Abdelhalim Saadi and Hacene Belhadef
The purpose of this paper is to present a system based on deep neural networks to extract particular entities from natural language text, knowing that a massive amount of textual…
Abstract
Purpose
The purpose of this paper is to present a system based on deep neural networks to extract particular entities from natural language text, knowing that a massive amount of textual information is electronically available at present. Notably, a large amount of electronic text data indicates great difficulty in finding or extracting relevant information from them.
Design/methodology/approach
This study presents an original system to extract Arabic-named entities by combining a deep neural network-based part-of-speech tagger and a neural network-based named entity extractor. Firstly, the system extracts the grammatical classes of the words with high precision depending on the context of the word. This module plays the role of the disambiguation process. Then, a second module is used to extract the named entities.
Findings
Using deep neural networks in natural language processing, requires tuning many hyperparameters, which is a time-consuming process. To deal with this problem, applying statistical methods like the Taguchi method is much requested. In this study, the system is successfully applied to the Arabic-named entities recognition, where accuracy of 96.81 per cent was reported, which is better than the state-of-the-art results.
Research limitations/implications
The system is designed and trained for the Arabic language, but the architecture can be used for other languages.
Practical implications
Information extraction systems are developed for different applications, such as analysing newspaper articles and databases for commercial, political and social objectives. Information extraction systems also can be built over an information retrieval (IR) system. The IR system eliminates irrelevant documents and paragraphs.
Originality/value
The proposed system can be regarded as the first attempt to use double deep neural networks to increase the accuracy. It also can be built over an IR system. The IR system eliminates irrelevant documents and paragraphs. This process reduces the mass number of documents from which the authors wish to extract the relevant information using an information extraction system.
Details
Keywords
Turkka Näppilä, Katja Moilanen and Timo Niemi
The purpose of this paper is to introduce an expressive query language, called relational XML query language (RXQL), capable of dealing with heterogeneous Extensible Markup…
Abstract
Purpose
The purpose of this paper is to introduce an expressive query language, called relational XML query language (RXQL), capable of dealing with heterogeneous Extensible Markup Language (XML) documents in data‐centric applications. In RXQL, data harmonization (i.e. the removal of heterogeneous factors from XML data) is integrated with typical data‐centric features (e.g. grouping, ordering, and aggregation).
Design/methodology/approach
RXQL is based on the XML relation representation, developed in the authors' previous work. This is a novel approach to unambiguously represent semistructured data relationally, which makes it possible in RXQL to manipulate XML data in a tuple‐oriented way, while XML data are typically manipulated in a path‐oriented way.
Findings
The user is able to describe the result of an RXQL query straightforwardly based on non‐XML syntax. The analysis of this description, through the mechanism developed in this paper, affords the automatic construction of the query result. This feature increases significantly the declarativeness of RXQL compared to the path‐oriented XML languages where the user needs to control the construction of the result extensively.
Practical implications
The authors' formal specification of the construction of the query result can be considered as an abstract implementation of RXQL.
Originality/value
RXQL is a declarative query language capable of integrating data harmonization seamlessly with other data‐centric features in the manipulation of heterogeneous XML data. So far, these kinds of XML query languages have been missing. Obviously, the expressive power of RXQL can be achieved by computationally complete XML languages, such as XQuery. However, these are not actual query languages, and the query formulation in them usually presupposes programming skills that are beyond the ordinary end‐user.
Details
Keywords
Bilal Hawashin, Shadi Alzubi, Tarek Kanan and Ayman Mansour
This paper aims to propose a new efficient semantic recommender method for Arabic content.
Abstract
Purpose
This paper aims to propose a new efficient semantic recommender method for Arabic content.
Design/methodology/approach
Three semantic similarities were proposed to be integrated with the recommender system to improve its ability to recommend based on the semantic aspect. The proposed similarities are CHI-based semantic similarity, singular value decomposition (SVD)-based semantic similarity and Arabic WordNet-based semantic similarity. These similarities were compared with the existing similarities used by recommender systems from the literature.
Findings
Experiments show that the proposed semantic method using CHI-based similarity and using SVD-based similarity are more efficient than the existing methods on Arabic text in term of accuracy and execution time.
Originality/value
Although many previous works proposed recommender system methods for English text, very few works concentrated on Arabic Text. The field of Arabic Recommender Systems is largely understudied in the literature. Aside from this, there is a vital need to consider the semantic relationships behind user preferences to improve the accuracy of the recommendations. The contributions of this work are the following. First, as many recommender methods were proposed for English text and have never been tested on Arabic text, this work compares the performance of these widely used methods on Arabic text. Second, it proposes a novel semantic recommender method for Arabic text. As this method uses semantic similarity, three novel base semantic similarities were proposed and evaluated. Third, this work would direct the attention to more studies in this understudied topic in the literature.
Details
Keywords
Kinh Nguyen, Tharam S. Dillon and Erik Danielsen
This article proposes the concept of web clientserver event together with its associated taxonomy which yields a formal specification for such an event. The concept, in…
Abstract
This article proposes the concept of web clientserver event together with its associated taxonomy which yields a formal specification for such an event. The concept, in conjunction with the concept of atomic use case (reviewed in the article), is then used as a key element for a model‐driven approach to web information system development. The outcome is a new method for web information systems development that reduces the complex web‐based hypermedia navigation behaviour to a much simpler event‐driven behaviour. On the strength of that realized simplicity, the method provides (i) a set of platform‐independent models that completely characterizes the application, and (ii) a well‐defined process to map the combined model to any chosen platform‐dependent implementation.
Details
Keywords
Daniel Hofer, Markus Jäger, Aya Khaled Youssef Sayed Mohamed and Josef Küng
For aiding computer security experts in their study, log files are a crucial piece of information. Especially the time domain is very important for us because in most cases…
Abstract
Purpose
For aiding computer security experts in their study, log files are a crucial piece of information. Especially the time domain is very important for us because in most cases, timestamps are the only linking points between events caused by attackers, faulty systems or simple errors and their corresponding entries in log files. With the idea of storing and analyzing this log information in graph databases, we need a suitable model to store and connect timestamps and their events. This paper aims to find and evaluate different approaches how to store timestamps in graph databases and their individual benefits and drawbacks.
Design/methodology/approach
We analyse three different approaches, how timestamp information can be represented and stored in graph databases. For checking the models, we set up four typical questions that are important for log file analysis and tested them for each of the models. During the evaluation, we used the performance and other properties as metrics, how suitable each of the models is for representing the log files’ timestamp information. In the last part, we try to improve one promising looking model.
Findings
We come to the conclusion, that the simplest model with the least graph database-specific concepts in use is also the one yielding the simplest and fastest queries.
Research limitations/implications
Limitations to this research are that only one graph database was studied and also improvements to the query engine might change future results.
Originality/value
In the study, we addressed the issue of storing timestamps in graph databases in a meaningful, practical and efficient way. The results can be used as a pattern for similar scenarios and applications.
Details
Keywords
Edgar R. Weippl and A Min Tjoa
Privacy is a requirement that has not received the required attention in most e‐learning platforms. Based on the results of a survey we identified weaknesses of e‐learning…
Abstract
Privacy is a requirement that has not received the required attention in most e‐learning platforms. Based on the results of a survey we identified weaknesses of e‐learning platforms and describe improvements we made in Moodle as a proof‐of‐concept.
Details
Keywords
The purpose of this paper is to present a concept of the protocol for public registries based on blockchain. New database protocol aims to use the benefits of blockchain…
Abstract
Purpose
The purpose of this paper is to present a concept of the protocol for public registries based on blockchain. New database protocol aims to use the benefits of blockchain technologies and ensure their interoperability.
Design/methodology/approach
This paper is framed with design science research (DSR). The primary method is exaptation, i.e. adoption of solutions from other fields. The research is looking into existing technologies which are applied here as elements of the protocol: Name-Value Storage (NVS), Berkley DB, RAID protocol, among others. The choice of NVS as a reference technology for creating a database over blockchain is based on the analysis and comparison with two other similar technologies Bigchain and Amazon QLDB.
Findings
The proposed mechanism allows creating a standard database over a bundle of distributed ledgers. It ensures a blockchain agnostic approach and uses the benefits of various blockchain technologies in one ecosystem. In this scheme, blockchains play the role of journal storages (immutable log), whereas the overlaid database is the indexed storage. The distinctive feature of such a system is that in blockchain, users can perform peer-to-peer transactions directly in the ledger using blockchain native mechanism of user access management with public-key cryptography (blockchain does not require to administrate its database).
Originality/value
This paper presents a new method of creating a public peer-to-peer database across a bundle of distributed ledgers.
Details
Keywords
Business searchers of all types are asked at times to identify a company's corporate family structure. Searches of this type involve identifying the parent company and any other…
Abstract
Business searchers of all types are asked at times to identify a company's corporate family structure. Searches of this type involve identifying the parent company and any other subordinate firms. Knowing which databases to search on Dialog and how to search them effectively is the focus of this paper.
The first volume of the Isis cumulative bibliography, based on the critical bibliographies published annually in Isis, will be a kind of biobibliography. It will contain all…
Abstract
The first volume of the Isis cumulative bibliography, based on the critical bibliographies published annually in Isis, will be a kind of biobibliography. It will contain all entries relating to men of science or to personalities of interest to the historian of science that have appeared in the annual critical bibliographies, forming a personal name index. The names will be in alphabetical order.