Search results
1 – 10 of 322Muhammad Arhami, Anita Desiani, Munawar and Raisah Hayati
Purpose – The purpose of this research is to study the ecological developments that are growing rapidly and are complemented by technological developments that make ecology a…
Abstract
Purpose – The purpose of this research is to study the ecological developments that are growing rapidly and are complemented by technological developments that make ecology a discipline which is able to collaborate, integrate, and use data for the development of science.
Design/Methodology/Approach – The method involves integration, analysis, and conclusion, drawing knowledge dissemination from heterogeneous ecological data that make the ecological research so complex requiring an approach to simplify the problem.
Findings – The data involved in ecology are very complex and diverse and spread from various sources, which are not mutually integrated so that a structured arrangement is required through the arrangement of computer-based data management.
Research Limitations/Implications – Eco-informatics is one of the options to manage the data, settings, and transform it into information and knowledge.
Details
Keywords
This paper aims to investigate the use of crowdsourcing in the enhancement of an ontology of taxonomic knowledge. The paper proposes a conceptual architecture for the…
Abstract
Purpose
This paper aims to investigate the use of crowdsourcing in the enhancement of an ontology of taxonomic knowledge. The paper proposes a conceptual architecture for the incorporation of crowdsourcing into the creation of ontologies.
Design/methodology/approach
The research adopted the design science research approach characterised by cycles of “build” and “evaluate” until a refined artefact was established.
Findings
Data from a case of a fruit fly platform demonstrates that online crowds can contribute to ontology enhancement if engaged in a structured manner that feeds into a defined ontology model.
Research limitations/implications
The research contributes an architecture to the crowdsourcing body knowledge. The research also makes a methodological contribution for the development of ontologies using crowdsourcing.
Practical implications
Creating ontologies is a demanding task and most ontologies are not exhaustive on the targeted domain knowledge. The proposed architecture provides a guiding structure for the engagement of online crowds in the creation and enhancement of domain ontologies. The research uses a case of taxonomic knowledge ontology.
Originality/value
Crowdsourcing for creation and enhancement of ontologies by non-experts is novel and presents opportunity to build and refine ontologies for different domains by engaging online crowds. The process of ontology creation is also prone to errors and engaging crowds presents opportunity for corrections and enhancements.
Details
Keywords
Ana Maria de Carvalho Moura, Fabio Porto, Vania Vidal, Regis Pires Magalhães, Macedo Maia, Maira Poltosi and Daniele Palazzi
The purpose of this paper is to present a four-level architecture that aims at integrating, publishing and retrieving ecological data making use of linked data (LD). It allows…
Abstract
Purpose
The purpose of this paper is to present a four-level architecture that aims at integrating, publishing and retrieving ecological data making use of linked data (LD). It allows scientists to explore taxonomical, spatial and temporal ecological information, access trophic chain relations between species and complement this information with other data sets published on the Web of data. The development of ecological information repositories is a crucial step to organize and catalog natural reserves. However, they present some challenges regarding their effectiveness to provide a shared and global view of biodiversity data, such as data heterogeneity, lack of metadata standardization and data interoperability. LD rose as an interesting technology to solve some of these challenges.
Design/methodology/approach
Ecological data, which is produced and collected from different media resources, is stored in distinct relational databases and published as RDF triples, using a relational-Resource Description Format mapping language. An application ontology reflects a global view of these datasets and share with them the same vocabulary. Scientists specify their data views by selecting their objects of interest in a friendly way. A data view is internally represented as an algebraic scientific workflow that applies data transformation operations to integrate data sources.
Findings
Despite of years of investment, data integration continues offering scientists challenges in obtaining consolidated data views of a large number of heterogeneous scientific data sources. The semantic integration approach presented in this paper simplifies this process both in terms of mappings and query answering through data views.
Social implications
This work provides knowledge about the Guanabara Bay ecosystem, as well as to be a source of answers to the anthropic and climatic impacts on the bay ecosystem. Additionally, this work will enable evaluating the adequacy of actions that are being taken to clean up Guanabara Bay, regarding the marine ecology.
Originality/value
Mapping complexity is traded by the process of generating the exported ontology. The approach reduces the problem of integration to that of mappings between homogeneous ontologies. As a byproduct, data views are easily rewritten into queries over data sources. The architecture is general and although applied to the ecological context, it can be extended to other domains.
Details
Keywords
Yannis Tzitzikas, Carlo Allocca, Chryssoula Bekiari, Yannis Marketakis, Pavlos Fafalios, Martin Doerr, Nikos Minadakis, Theodore Patkos and Leonardo Candela
Marine species data are scattered across a series of heterogeneous repositories and information systems. There is no repository that can claim to have all marine species data…
Abstract
Purpose
Marine species data are scattered across a series of heterogeneous repositories and information systems. There is no repository that can claim to have all marine species data. Moreover, information on marine species are made available through different formats and protocols. The purpose of this paper is to provide models and methods that allow integrating such information either for publishing it, browsing it or querying it. Aiming at providing a valid and reliable knowledge ground for enabling semantic interoperability of marine species data, in this paper the authors motivate a top level ontology, called MarineTLO and discuss its use for creating MarineTLO-based warehouses.
Design/methodology/approach
In this paper the authors introduce a set of motivating scenarios that highlight the need of having a top level ontology. Afterwards the authors describe the main data sources (Fisheries Linked Open Data, ECOSCOPE, WoRMS, FishBase and DBpedia) that will be used as a basis for constructing the MarineTLO.
Findings
The paper discusses about the exploitation of MarineTLO for the construction of a warehouse. Furthermore a series of uses of the MarineTLO-based warehouse is being reported.
Originality/value
In this paper the authors described the design of a top level ontology for the marine domain able to satisfy the need for maintaining integrated sets of facts about marine species and thus assisting ongoing research on biodiversity. Apart from the ontology the authors also elaborated with the mappings that are required for building integrated warehouses.
Details
Keywords
Abstract
Details
Keywords
Shubhada Prashant Nagarkar and Rajendra Kumbhar
The purpose of this paper was to analyse text mining (TM) literature indexed in the Web of Science (WoS) under the “Information Science Library Science” subcategory. More…
Abstract
Purpose
The purpose of this paper was to analyse text mining (TM) literature indexed in the Web of Science (WoS) under the “Information Science Library Science” subcategory. More specifically, it analyses the chronological growth of TM literature, and the major countries, institutions, departments and individuals contributing to TM literature. Collaboration in TM research is also analysed.
Design/methodology/approach
Bibliographic and citation data required for this research were retrieved from the WoS database. TM being a multidisciplinary field, the search was restricted to “Information Science Library Science” subcategory in the WoS. A comprehensive query statement covering all synonyms of “text mining” was prepared using the Boolean operator “OR”. Microsoft Excel and HistCite software were used for data analysis. Pajek and VoSviewer were used for data visualization.
Findings
It was found that USA is the major producer of TM research literature, and the highest number of papers were published in the Journal of The American Medical Informatics. Columbia University ranked first both in number of articles and citations received in the top ten institutes publishing TM literature. It was also observed that six of the top ten subdivisions of institutions are either from medicine or medical informatics or biomedical information. H.C. Chen and C. Friedman were seen to be the most prolific authors.
Research limitations/implications
The paper analyses articles on TM published during 1999-2013 in WoS under the subcategory Information Science Library Science’.
Originality/value
The paper is based on empirical data exclusively gathered for this research.
Details
Keywords
Mike Thelwall and Kayvan Kousha
A number of subject-orientated and general websites have emerged to host academic resources. The purpose of this paper is to evaluate the uptake of such services in order to…
Abstract
Purpose
A number of subject-orientated and general websites have emerged to host academic resources. The purpose of this paper is to evaluate the uptake of such services in order to decide which depositing strategies are effective and should be encouraged.
Design/methodology/approach
This paper evaluates the views and shares of resources in the generic repository Figshare by subject category and resource type.
Findings
Figshare use and common resource types vary substantially by subject category but resources can be highly viewed even in subjects with few members. More active subject areas do not tend to have more viewed or shared resources.
Research limitations/implications
The view counts and share counts analysed may reflect author accesses or may be spammed.
Practical implications
Limited uptake of Figshare within a subject area should not be a barrier to its use. Several highly successful innovative uses for Figshare show that it can reach beyond a purely academic audience.
Originality/value
This is the first analysis of the uptake and use of a generic academic resource sharing repository.
Details
Keywords
Constanze Curdt and Dirk Hoffmeister
Research data management (RDM) comprises all processes, which ensure that research data are well-organized, documented, stored, backed up, accessible, and reusable. RDM systems…
Abstract
Purpose
Research data management (RDM) comprises all processes, which ensure that research data are well-organized, documented, stored, backed up, accessible, and reusable. RDM systems form the technical framework. The purpose of this paper is to present the design and implementation of a RDM system for an interdisciplinary, collaborative, long-term research project with focus on Soil-Vegetation-Atmosphere data.
Design/methodology/approach
The presented RDM system is based on a three-tier (client-server) architecture. This includes a file-based data storage, a database-based metadata storage, and a self-designed user-friendly web-interface. The system is designed in cooperation with the local computing centre, where it is also hosted. A self-designed interoperable, project-specific metadata schema ensures the accurate documentation of all data.
Findings
A RDM system has to be designed and implemented according to requirements of the project participants. General challenges and problems of RDM should be considered. Thus, a close cooperation with the scientists obtains the acceptance and usage of the system.
Originality/value
This paper provides evidence that the implementation of a RDM system in the provided and maintained infrastructure of a computing centre offers many advantages. Consequently, the designed system is independent of the project funding. In addition, access and re-use of all involved project data is ensured. A transferability of the presented approach to another interdisciplinary research project was already successful. Furthermore, the designed metadata schema can be expanded according to changing project requirements.
Details