Search results

1 – 10 of over 1000
Open Access
Article
Publication date: 14 August 2017

Xiu Susie Fang, Quan Z. Sheng, Xianzhi Wang, Anne H.H. Ngu and Yihong Zhang

This paper aims to propose a system for generating actionable knowledge from Big Data and use this system to construct a comprehensive knowledge base (KB), called GrandBase.

2049

Abstract

Purpose

This paper aims to propose a system for generating actionable knowledge from Big Data and use this system to construct a comprehensive knowledge base (KB), called GrandBase.

Design/methodology/approach

In particular, this study extracts new predicates from four types of data sources, namely, Web texts, Document Object Model (DOM) trees, existing KBs and query stream to augment the ontology of the existing KB (i.e. Freebase). In addition, a graph-based approach to conduct better truth discovery for multi-valued predicates is also proposed.

Findings

Empirical studies demonstrate the effectiveness of the approaches presented in this study and the potential of GrandBase. The future research directions regarding GrandBase construction and extension has also been discussed.

Originality/value

To revolutionize our modern society by using the wisdom of Big Data, considerable KBs have been constructed to feed the massive knowledge-driven applications with Resource Description Framework triples. The important challenges for KB construction include extracting information from large-scale, possibly conflicting and different-structured data sources (i.e. the knowledge extraction problem) and reconciling the conflicts that reside in the sources (i.e. the truth discovery problem). Tremendous research efforts have been contributed on both problems. However, the existing KBs are far from being comprehensive and accurate: first, existing knowledge extraction systems retrieve data from limited types of Web sources; second, existing truth discovery approaches commonly assume each predicate has only one true value. In this paper, the focus is on the problem of generating actionable knowledge from Big Data. A system is proposed, which consists of two phases, namely, knowledge extraction and truth discovery, to construct a broader KB, called GrandBase.

Details

PSU Research Review, vol. 1 no. 2
Type: Research Article
ISSN: 2399-1747

Keywords

Content available
Article
Publication date: 12 April 2022

Subhamoy Dhua, Kshitiz Kumar, Vijay Singh Sharanagat and Prabhat K. Nema

The amount of food wasted every year is 1.3 billion metric tonne (MT), out of which 0.5 billion MT is contributed by the fruits processing industries. The waste includes…

1224

Abstract

Purpose

The amount of food wasted every year is 1.3 billion metric tonne (MT), out of which 0.5 billion MT is contributed by the fruits processing industries. The waste includes by-products such as peels, pomace and seeds and is a good source of bioactive compounds like phenolic compounds, flavonoids, pectin lipids and dietary fibres. Hence, the purpose of the present study is to review the novel extraction techniques used for the extraction of the bio active compounds from food waste for the selection of suitable extraction method.

Design/methodology/approach

Novel extraction techniques such as ultrasound-assisted extraction, microwave-assisted extraction, enzyme-assisted extraction, supercritical fluid extraction, pulsed electric field extraction and pressurized liquid extraction have emerged to overcome the drawbacks and constraints of conventional extraction techniques. Hence, this study is focussed on novel extraction techniques, their limitations and optimization for the extraction of bioactive compounds from fruit and vegetable waste.

Findings

This study presents a comprehensive review on the novel extraction processes that have been adopted for the extraction of bioactive compounds from food waste. This paper also summarizes bioactive compounds' optimum extraction condition from various food waste using novel extraction techniques.

Research limitations/implications

Food waste is rich in bioactive compounds, and its efficient extraction may add value to the food processing industries. Hence, compressive analysis is needed to overcome the problem associated with the extraction and selection of suitable extraction techniques.

Social implications

Selection of a suitable extraction method will not only add value to food waste but also reduce waste dumping and the cost of bioactive compounds.

Originality/value

This paper presents the research progress on the extraction of bioactive active compounds from food waste using novel extraction techniques.

Details

Nutrition & Food Science , vol. 52 no. 8
Type: Research Article
ISSN: 0034-6659

Keywords

Open Access
Article
Publication date: 6 March 2017

Zhuoxuan Jiang, Chunyan Miao and Xiaoming Li

Recent years have witnessed the rapid development of massive open online courses (MOOCs). With more and more courses being produced by instructors and being participated by…

2120

Abstract

Purpose

Recent years have witnessed the rapid development of massive open online courses (MOOCs). With more and more courses being produced by instructors and being participated by learners all over the world, unprecedented massive educational resources are aggregated. The educational resources include videos, subtitles, lecture notes, quizzes, etc., on the teaching side, and forum contents, Wiki, log of learning behavior, log of homework, etc., on the learning side. However, the data are both unstructured and diverse. To facilitate knowledge management and mining on MOOCs, extracting keywords from the resources is important. This paper aims to adapt the state-of-the-art techniques to MOOC settings and evaluate the effectiveness on real data. In terms of practice, this paper also tries to answer the questions for the first time that to what extend can the MOOC resources support keyword extraction models, and how many human efforts are required to make the models work well.

Design/methodology/approach

Based on which side generates the data, i.e instructors or learners, the data are classified to teaching resources and learning resources, respectively. The approach used on teaching resources is based on machine learning models with labels, while the approach used on learning resources is based on graph model without labels.

Findings

From the teaching resources, the methods used by the authors can accurately extract keywords with only 10 per cent labeled data. The authors find a characteristic of the data that the resources of various forms, e.g. subtitles and PPTs, should be separately considered because they have the different model ability. From the learning resources, the keywords extracted from MOOC forums are not as domain-specific as those extracted from teaching resources, but they can reflect the topics which are lively discussed in forums. Then instructors can get feedback from the indication. The authors implement two applications with the extracted keywords: generating concept map and generating learning path. The visual demos show they have the potential to improve learning efficiency when they are integrated into a real MOOC platform.

Research limitations/implications

Conducting keyword extraction on MOOC resources is quite difficult because teaching resources are hard to be obtained due to copyrights. Also, getting labeled data is tough because usually expertise of the corresponding domain is required.

Practical implications

The experiment results support that MOOC resources are good enough for building models of keyword extraction, and an acceptable balance between human efforts and model accuracy can be achieved.

Originality/value

This paper presents a pioneer study on keyword extraction on MOOC resources and obtains some new findings.

Details

International Journal of Crowd Science, vol. 1 no. 1
Type: Research Article
ISSN: 2398-7294

Keywords

Open Access
Article
Publication date: 21 July 2020

Prajowal Manandhar, Prashanth Reddy Marpu and Zeyar Aung

We make use of the Volunteered Geographic Information (VGI) data to extract the total extent of the roads using remote sensing images. VGI data is often provided only as vector…

1242

Abstract

We make use of the Volunteered Geographic Information (VGI) data to extract the total extent of the roads using remote sensing images. VGI data is often provided only as vector data represented by lines and not as full extent. Also, high geolocation accuracy is not guaranteed and it is common to observe misalignment with the target road segments by several pixels on the images. In this work, we use the prior information provided by the VGI and extract the full road extent even if there is significant mis-registration between the VGI and the image. The method consists of image segmentation and traversal of multiple agents along available VGI information. First, we perform image segmentation, and then we traverse through the fragmented road segments using autonomous agents to obtain a complete road map in a semi-automatic way once the seed-points are defined. The road center-line in the VGI guides the process and allows us to discover and extract the full extent of the road network based on the image data. The results demonstrate the validity and good performance of the proposed method for road extraction that reflects the actual road width despite the presence of disturbances such as shadows, cars and trees which shows the efficiency of the fusion of the VGI and satellite images.

Details

Applied Computing and Informatics, vol. 17 no. 1
Type: Research Article
ISSN: 2634-1964

Keywords

Open Access
Article
Publication date: 4 August 2022

Mohd Hanafi Azman Ong, Norazlina Mohd Yasin and Nur Syafikah Ibrahim

Measuring internal response of online learning is seen as fundamental to absorptive capacity which stimulates knowledge assimilation. However, the evaluation of practice and…

Abstract

Purpose

Measuring internal response of online learning is seen as fundamental to absorptive capacity which stimulates knowledge assimilation. However, the evaluation of practice and research of validated instruments that could effectively measure online learning response behavior is limited. Thus, in this study, a new instrument was designed based on literature to determine the structural variables that exist in the online learning response behavior.

Design/methodology/approach

A structured survey was designed and distributed to 410 Malaysian students enrolled in higher-education institutions. The questionnaire has 38 items, all of which were scored using a seven-point likert scale. To begin with, exploratory factor analysis with three types of extraction methods (i.e. principal component, principal axis factoring and maximum likelihood) was used as the method for comparing the outcomes of each extraction method's grouping variables by constantly using a varimax rotation method. In the second phase, reliability analysis was performed to determine the reliability level of the grouping variables, and finally, correlation analysis was performed to determine the discriminant nomological validity of the grouping variables.

Findings

The findings revealed that nine grouping variables were retrieved, with all items having a good value of factor loading and communalities, as well as an adequate degree of reliability. These extracted variables have good discriminant and nomological validity, as evidenced by correlation analysis, which confirmed that the directions of relationships among the extracted dimensions follow the expected theory (i.e. positive direction) and the correlation coefficient is less than 0.70.

Research limitations/implications

This study proposes a comprehensive set of questionnaires that measure the student's online learning response behavior. These questionnaires have been developed on the basis of an extensive literature review and have undergone a rigorous process of validity and reliability for the purpose of enhancing students' online learning response behavior.

Originality/value

This study's findings will aid academic practitioners in assessing the online learning response behavior of students, as well as enhancing the questionnaire's boost factor when administered in an online learning environment.

Details

Asian Association of Open Universities Journal, vol. 17 no. 2
Type: Research Article
ISSN: 1858-3431

Keywords

Open Access
Article
Publication date: 31 July 2020

Omar Alqaryouti, Nur Siyam, Azza Abdel Monem and Khaled Shaalan

Digital resources such as smart applications reviews and online feedback information are important sources to seek customers’ feedback and input. This paper aims to help…

7041

Abstract

Digital resources such as smart applications reviews and online feedback information are important sources to seek customers’ feedback and input. This paper aims to help government entities gain insights on the needs and expectations of their customers. Towards this end, we propose an aspect-based sentiment analysis hybrid approach that integrates domain lexicons and rules to analyse the entities smart apps reviews. The proposed model aims to extract the important aspects from the reviews and classify the corresponding sentiments. This approach adopts language processing techniques, rules, and lexicons to address several sentiment analysis challenges, and produce summarized results. According to the reported results, the aspect extraction accuracy improves significantly when the implicit aspects are considered. Also, the integrated classification model outperforms the lexicon-based baseline and the other rules combinations by 5% in terms of Accuracy on average. Also, when using the same dataset, the proposed approach outperforms machine learning approaches that uses support vector machine (SVM). However, using these lexicons and rules as input features to the SVM model has achieved higher accuracy than other SVM models.

Details

Applied Computing and Informatics, vol. 20 no. 1/2
Type: Research Article
ISSN: 2634-1964

Keywords

Open Access
Article
Publication date: 12 June 2017

Lichao Zhu, Hangzhou Yang and Zhijun Yan

The purpose of this paper is to develop a new method to extract medical temporal information from online health communities.

Abstract

Purpose

The purpose of this paper is to develop a new method to extract medical temporal information from online health communities.

Design/methodology/approach

The authors trained a conditional random-filed model for the extraction of temporal expressions. The temporal relation identification is considered as a classification task and several support vector machine classifiers are built in the proposed method. For the model training, the authors extracted some high-level semantic features including co-reference relationship of medical concepts and the semantic similarity among words.

Findings

For the extraction of TIMEX, the authors find that well-formatted expressions are easy to recognize, and the main challenge is the relative TIMEX such as “three days after onset”. It also shows the same difficulty for normalization of absolute date or well-formatted duration, whereas frequency is easier to be normalized. For the identification of DocTimeRel, the result is fairly well, and the relation is difficult to identify when it involves a relative TIMEX or a hypothetical concept.

Originality/value

The authors proposed a new method to extract temporal information from the online clinical data and evaluated the usefulness of different level of syntactic features in this task.

Details

International Journal of Crowd Science, vol. 1 no. 2
Type: Research Article
ISSN: 2398-7294

Keywords

Open Access
Article
Publication date: 13 December 2022

Chau Thi Ngoc Pham, Hung Ngoc Phan, Thao Thanh Hoang, Tien Thi Thuy Dao and Huong Mai Bui

The health and environmental hazards associated with synthetic dyes have led to a revival of natural dyes that are non-toxic, environmentally benign and coupled with various…

1181

Abstract

Purpose

The health and environmental hazards associated with synthetic dyes have led to a revival of natural dyes that are non-toxic, environmentally benign and coupled with various functions. The study aims to investigate and develop the potentiality of a popular herb called Chromolaena odorata (C. odorata) as a sustainable and stable dyestuff in textiles.

Design/methodology/approach

Natural colorant extracted from C. odorata leaves is used to dye the worsted fabric, which is one of the premier end-use of wool in fashion, via the padding method associated with pre-, simultaneous and post-mordanting with chitosan, tannic acid and copper sulfate pentahydrate. The effects of extraction, dyeing and mordanting processes on fabric’s color strength K/S and color difference ΔECMC are investigated via International Commission on Illumination’s L*a*b* color space, Fourier transform infrared spectroscopy, scanning electron microscope, color fastness to washing, rubbing, perspiration and light.

Findings

The results obtained indicate extraction with ethanol 90% with a solid/liquid ratio of 1:5 within 1 h, and coloration with a liquor ratio of 1:5 (pH 5) within 2 h under padding pressure of 0.3 MPa are the most effective for coloring worsted fabric.

Practical implications

The C. odorata’s application as a highly effective dyestuff possessing good colorimetric effectiveness has expanded this herb's economic potential, contributing partly to economic growth and adding value to wool in global supply chain.

Originality/value

C. odorata dyestuff has prevailed over other natural colorants because of its impressive color fastness against washing, rubbing, perspiration and especially color stability for pH change.

Details

Research Journal of Textile and Apparel, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1560-6074

Keywords

Open Access
Article
Publication date: 30 July 2019

Zhizhou Wu, Yiming Zhang, Guishan Tan and Jia Hu

Traffic density is one of the most important parameters to consider in the traffic operation field. Owing to limited data sources, traditional methods cannot extract traffic…

1410

Abstract

Purpose

Traffic density is one of the most important parameters to consider in the traffic operation field. Owing to limited data sources, traditional methods cannot extract traffic density directly. In the vehicular ad hoc network (VANET) environment, the vehicle-to-vehicle (V2V) and vehicle-to-infrastructure (V2I) interaction technologies create better conditions for collecting the whole time-space and refined traffic data, which provides a new approach to solving this problem.

Design/methodology/approach

On that basis, a real-time traffic density extraction method has been proposed, including lane density, segment density and network density. Meanwhile, using SUMO and OMNet++ as traffic simulator and network simulator, respectively, the Veins framework as middleware and the two-way coupling VANET simulation platform was constructed.

Findings

Based on the simulation platform, a simulated intersection in Shanghai was developed to investigate the adaptability of the model.

Originality/value

Most research studies use separate simulation methods, importing trace data obtained by using from the simulation software to the communication simulation software. In this paper, the tight coupling simulation method is applied. Using real-time data and history data, the research focuses on the establishment and validation of the traffic density extraction model.

Details

Journal of Intelligent and Connected Vehicles, vol. 2 no. 1
Type: Research Article
ISSN: 2399-9802

Keywords

Open Access
Article
Publication date: 14 August 2020

Paramita Ray and Amlan Chakrabarti

Social networks have changed the communication patterns significantly. Information available from different social networking sites can be well utilized for the analysis of users…

6405

Abstract

Social networks have changed the communication patterns significantly. Information available from different social networking sites can be well utilized for the analysis of users opinion. Hence, the organizations would benefit through the development of a platform, which can analyze public sentiments in the social media about their products and services to provide a value addition in their business process. Over the last few years, deep learning is very popular in the areas of image classification, speech recognition, etc. However, research on the use of deep learning method in sentiment analysis is limited. It has been observed that in some cases the existing machine learning methods for sentiment analysis fail to extract some implicit aspects and might not be very useful. Therefore, we propose a deep learning approach for aspect extraction from text and analysis of users sentiment corresponding to the aspect. A seven layer deep convolutional neural network (CNN) is used to tag each aspect in the opinionated sentences. We have combined deep learning approach with a set of rule-based approach to improve the performance of aspect extraction method as well as sentiment scoring method. We have also tried to improve the existing rule-based approach of aspect extraction by aspect categorization with a predefined set of aspect categories using clustering method and compared our proposed method with some of the state-of-the-art methods. It has been observed that the overall accuracy of our proposed method is 0.87 while that of the other state-of-the-art methods like modified rule-based method and CNN are 0.75 and 0.80 respectively. The overall accuracy of our proposed method shows an increment of 7–12% from that of the state-of-the-art methods.

Details

Applied Computing and Informatics, vol. 18 no. 1/2
Type: Research Article
ISSN: 2634-1964

Keywords

1 – 10 of over 1000