Search results
1 – 10 of over 39000Xiaoming Zhang, Huilin Chen, Yanqin Ruan, Dongyu Pan and Chongchong Zhao
With the rapid development of materials informatics and the Semantic Web, the semantic-driven solution has emerged to improve traditional query technology, which is hard to…
Abstract
Purpose
With the rapid development of materials informatics and the Semantic Web, the semantic-driven solution has emerged to improve traditional query technology, which is hard to discover implicit knowledge from materials data. However, it is a nontrivial thing for materials scientists to construct a semantic query, and the query results are usually presented in RDF/XML format which is not convenient for users to understand. This paper aims to propose an approach to construct semantic query and visualize the query results for metallic materials domain.
Design/methodology/approach
The authors design a query builder to generate SPARQL query statements automatically based on domain ontology and query conditions inputted by users. Moreover, a semantic visualization model is defined based on the materials science tetrahedron to support the visualization of query results in an intuitive, dynamic and interactive way.
Findings
Based on the Semantic Web technology, the authors design an automatic semantic query builder to help domain experts write the normative semantic query statements quickly and simply, as well as a prototype (named MatViz) is developed to visually show query results, which could help experts discover implicit knowledge from materials data. Moreover, the experiments demonstrate that the proposed system in this paper can rapidly and effectively return visualized query results over the metallic materials data set.
Originality/value
This paper mainly discusses an approach to support semantic query and visualization of metallic materials data. The implementation of MatViz will be a meaningful work for the research of metal materials data integration.
Details
Keywords
Chongchong Zhao, Chao Dong and Xiaoming Zhang
The integration and retrieval of the vast data have attracted sufficient attention, thus the W3C workgroup releases R2RML to standardize the transformation from relational data to…
Abstract
Purpose
The integration and retrieval of the vast data have attracted sufficient attention, thus the W3C workgroup releases R2RML to standardize the transformation from relational data to semantic-aware data. However, it only provides a data transform mechanism to resource description framework (RDF). The generation of mapping alignments still needs manual work or other algorithms. Therefore, the purpose of this paper is to propose a domain-oriented automatic mapping method and an application of the R2RML standard.
Design/methodology/approach
In this paper, materials science is focussed to show an example of domain-oriented mapping. source field concept and M3B2 (Metal Materials Mapping Background Base) knowledge bases are established to support the auto-recommending algorithm. As for the generation of RDF files, the idea is to generate the triples and the links, respectively. The links of the triples follow the object-subject relationship, and the links of the object properties can be achieved by the range individuals and the trail path.
Findings
Consequently based on the previous work, the authors proposed Engine for Metal Materials Mapping Background Base (EM3B2), a semantic integration engine for materials science. EM3B2 not only offers friendly graphical interfaces, but also provides auto-recommending mapping based on materials knowledge to enable users to avoid vast manually work. The experimental result indicates that EM3B2 supplies accurate mapping. Moreover, the running time of E3MB2 is also competitive as classical methods.
Originality/value
This paper proposed EM3B2 semantic integration engine, which contributes to the relational database-to-RDF mapping by the application of W3C R2RML standard and the domain-oriented mapping.
Details
Keywords
Xiaoming Zhang, Kai Li, Chongchong Zhao and Dongyu Pan
With the increasing spread of ontologies in various domains, units have gradually become an essential part of ontologies and units ontologies have been developed to offer a better…
Abstract
Purpose
With the increasing spread of ontologies in various domains, units have gradually become an essential part of ontologies and units ontologies have been developed to offer a better expression ability for the practical usage. From the perspectives of architecture, comparison and reuse, the purpose of this paper is to provide a comprehensive survey on four mainstream units ontologies: quantity-unit-dimension-type, quantities, units, dimensions and values, ontology of units of measure and units ontology (UO) of the open biomedical ontologies, in order to address well the state of the art and the reuse strategies of the UO.
Design/methodology/approach
An architecture of units ontologies is presented, in which the relations between key factors (i.e. units of measure, quantity and dimension) are discussed. The criteria for comparing units ontologies are developed from the perspectives of organizational structure, pattern design and application scenario. Then, the authors compare four typical units ontologies based on the proposed comparison criteria. Furthermore, how to reuse these units ontologies is discussed in materials science domain by utilizing two reuse strategies of partial reference and complete reference.
Findings
Units ontologies have attracted high attention in the scientific domain. Based on the comparison of four popular units ontologies, this paper finds that different units ontologies have different design features from the perspectives of basis structure, units conversion and axioms design; a UO is better to be applied to the application areas that satisfy its design features; and many challenges remain to be done in the future research of the UO.
Originality/value
This paper makes an extensive review on units ontologies, by defining the comparison criteria and discussing the reuse strategies in the materials domain. Based on this investigation, guidelines are summarized for the selection and reuse of units ontologies.
Details
Keywords
Xintong Zhao, Jane Greenberg, Vanessa Meschke, Eric Toberer and Xiaohua Hu
The output of academic literature has increased significantly due to digital technology, presenting researchers with a challenge across every discipline, including materials…
Abstract
Purpose
The output of academic literature has increased significantly due to digital technology, presenting researchers with a challenge across every discipline, including materials science, as it is impossible to manually read and extract knowledge from millions of published literature. The purpose of this study is to address this challenge by exploring knowledge extraction in materials science, as applied to digital scholarship. An overriding goal is to help inform readers about the status knowledge extraction in materials science.
Design/methodology/approach
The authors conducted a two-part analysis, comparing knowledge extraction methods applied materials science scholarship, across a sample of 22 articles; followed by a comparison of HIVE-4-MAT, an ontology-based knowledge extraction and MatScholar, a named entity recognition (NER) application. This paper covers contextual background, and a review of three tiers of knowledge extraction (ontology-based, NER and relation extraction), followed by the research goals and approach.
Findings
The results indicate three key needs for researchers to consider for advancing knowledge extraction: the need for materials science focused corpora; the need for researchers to define the scope of the research being pursued, and the need to understand the tradeoffs among different knowledge extraction methods. This paper also points to future material science research potential with relation extraction and increased availability of ontologies.
Originality/value
To the best of the authors’ knowledge, there are very few studies examining knowledge extraction in materials science. This work makes an important contribution to this underexplored research area.
Details
Keywords
Satanu Ghosh and Kun Lu
The purpose of this paper is to present a preliminary work on extracting band gap information of materials from academic papers. With increasing demand for renewable energy, band…
Abstract
Purpose
The purpose of this paper is to present a preliminary work on extracting band gap information of materials from academic papers. With increasing demand for renewable energy, band gap information will help material scientists design and implement novel photovoltaic (PV) cells.
Design/methodology/approach
The authors collected 1.44 million titles and abstracts of scholarly articles related to materials science, and then filtered the collection to 11,939 articles that potentially contain relevant information about materials and their band gap values. ChemDataExtractor was extended to extract information about PV materials and their band gap information. Evaluation was performed on randomly sampled information records of 415 papers.
Findings
The findings of this study show that the current system is able to correctly extract information for 51.32% articles, with partially correct extraction for 36.62% articles and incorrect for 12.04%. The authors have also identified the errors belonging to three main categories pertaining to chemical entity identification, band gap information and interdependency resolution. Future work will focus on addressing these errors to improve the performance of the system.
Originality/value
The authors did not find any literature to date on band gap information extraction from academic text using automated methods. This work is unique and original. Band gap information is of importance to materials scientists in applications such as solar cells, light emitting diodes and laser diodes.
Details
Keywords
Michael Trachtengerts, Adilbek Erkimbaev, Vladimir Zitserman and Georgii Kobzev
The purpose of this paper is to reveal main advantages of digital libraries in comparison with technology of common database for data-oriented fields of modern science. As an…
Abstract
Purpose
The purpose of this paper is to reveal main advantages of digital libraries in comparison with technology of common database for data-oriented fields of modern science. As an example, the subject domain “nanomaterials and nanotechnologies” with new features due to evolution of concepts and objects is presented.
Design/methodology/approach
An analysis of the information system ABCD as a basis for science-oriented digital library was fulfilled. Also, a survey of peculiarities of data in fast developing fields of science was prepared.
Findings
The results of this paper showed that functional capacities of ABCD satisfy requirements for complex collections and archives of scientific documents. Based on the ABCD tools and this concept, the digital library for storage and systematization of data and documents on nanomaterials and nanotechnologies for the power engineering was constructed. The library combines opportunities of bibliographic, full text and factual information systems.
Originality/value
This paper gives the foundation for creation of a library that combines services of bibliographic, full text and factual (numerical) information systems. Some analyses of ABCD tools were made before elsewhere, but they did not point on data peculiarities of complexly organized domains: semi-structured data, multitude formats (text, image and tables), interconnection of content with external sources located on other servers or in the Web.
Details
Keywords
This paper aims to look at the pros and cons of Microsoft Academic Search (MAS) from a bibliometric perspective.
Abstract
Purpose
This paper aims to look at the pros and cons of Microsoft Academic Search (MAS) from a bibliometric perspective.
Design/methodology/approach
This paper describes the major content and software features of MAS, and its shortcomings.
Findings
The paper recommends some further enhancements, and the use of care and caution when interpreting the metrics produced by cited reference enhanced databases, especially those created on the basis of the idea of autonomous citation indexing.
Originality/value
The paper reveals pros and cons of MAS. A free bibliometric service is a project of great interest to those interested in metrics‐based research performance evaluation.
Details
Keywords
Renze Zhou, Zhiguo Xing, Haidou Wang, Zhongyu Piao, Yanfei Huang, Weiling Guo and Runbo Ma
With the development of deep learning-based analytical techniques, increased research has focused on fatigue data analysis methods based on deep learning, which are gaining in…
Abstract
Purpose
With the development of deep learning-based analytical techniques, increased research has focused on fatigue data analysis methods based on deep learning, which are gaining in popularity. However, the application of deep neural networks in the material science domain is mainly inhibited by data availability. In this paper, to overcome the difficulty of multifactor fatigue life prediction with small data sets,
Design/methodology/approach
A multiple neural network ensemble (MNNE) is used, and an MNNE with a general and flexible explicit function is developed to accurately quantify the complicated relationships hidden in multivariable data sets. Moreover, a variational autoencoder-based data generator is trained with small sample sets to expand the size of the training data set. A comparative study involving the proposed method and traditional models is performed. In addition, a filtering rule based on the R2 score is proposed and applied in the training process of the MNNE, and this approach has a beneficial effect on the prediction accuracy and generalization ability.
Findings
A comparative study involving the proposed method and traditional models is performed. The comparative experiment confirms that the use of hybrid data can improve the accuracy and generalization ability of the deep neural network and that the MNNE outperforms support vector machines, multilayer perceptron and deep neural network models based on the goodness of fit and robustness in the small sample case.
Practical implications
The experimental results imply that the proposed algorithm is a sophisticated and promising multivariate method for predicting the contact fatigue life of a coating when data availability is limited.
Originality/value
A data generated model based on variational autoencoder was used to make up lack of data. An MNNE method was proposed to apply in the small data case of fatigue life prediction.
Details
Keywords
Binbin Zhang, Prakhar Jaiswal, Rahul Rai, Paul Guerrier and George Baggs
Part quality inspection is playing a critical role in the metal additive manufacturing (AM) industry. It produces a part quality analysis report which can be adopted to further…
Abstract
Purpose
Part quality inspection is playing a critical role in the metal additive manufacturing (AM) industry. It produces a part quality analysis report which can be adopted to further improve the overall part quality. However, the part quality inspection process puts heavy reliance on the engineer’s background and experience. This manual process suffers from both low efficiency and potential errors and, therefore, cannot meet the requirement of real-time detection. The purpose of this paper is to look into a deep neural network, Convolutional Neural Network (CNN), towards a robust method for online monitoring of AM parts.
Design/methodology/approach
The proposed online monitoring method relies on a deep CNN that takes a real metal AM part’s images as inputs and the part quality categories as network outputs. The authors validate the efficacy of the proposed methodology by recognizing the “beautiful-weld” category from material CoCrMo top surface images. The images of “beautiful-weld” parts that show even hatch lines and appropriate overlaps indicate a good quality of an AM part.
Findings
The classification accuracy of the developed method using limited information of a small local block of an image is 82 per cent. The classification accuracy using the full image and the ensemble of model outputs is 100 per cent.
Originality/value
A real-world data set of high resolution images of ASTM F75 I CoCrMo-based three-dimensional printed parts (Top surface images with magnification 63×) annotated with categories labels. Development of a CNN-based classification model for the supervised learning task of recognizing a “beautiful-weld” AM parts. The classification accuracy using the full image and the ensemble of model outputs is 100 per cent.
Details