Search results

1 – 10 of over 1000

Open Access

Article

Publication date: 14 August 2017

GrandBase: generating actionable knowledge from Big Data

Xiu Susie Fang, Quan Z. Sheng, Xianzhi Wang, Anne H.H. Ngu and Yihong Zhang

This paper aims to propose a system for generating actionable knowledge from Big Data and use this system to construct a comprehensive knowledge base (KB), called GrandBase.

HTML

PDF (755 KB)

Downloads

2049

Abstract

Purpose

This paper aims to propose a system for generating actionable knowledge from Big Data and use this system to construct a comprehensive knowledge base (KB), called GrandBase.

Design/methodology/approach

In particular, this study extracts new predicates from four types of data sources, namely, Web texts, Document Object Model (DOM) trees, existing KBs and query stream to augment the ontology of the existing KB (i.e. Freebase). In addition, a graph-based approach to conduct better truth discovery for multi-valued predicates is also proposed.

Findings

Empirical studies demonstrate the effectiveness of the approaches presented in this study and the potential of GrandBase. The future research directions regarding GrandBase construction and extension has also been discussed.

Originality/value

To revolutionize our modern society by using the wisdom of Big Data, considerable KBs have been constructed to feed the massive knowledge-driven applications with Resource Description Framework triples. The important challenges for KB construction include extracting information from large-scale, possibly conflicting and different-structured data sources (i.e. the knowledge extraction problem) and reconciling the conflicts that reside in the sources (i.e. the truth discovery problem). Tremendous research efforts have been contributed on both problems. However, the existing KBs are far from being comprehensive and accurate: first, existing knowledge extraction systems retrieve data from limited types of Web sources; second, existing truth discovery approaches commonly assume each predicate has only one true value. In this paper, the focus is on the problem of generating actionable knowledge from Big Data. A system is proposed, which consists of two phases, namely, knowledge extraction and truth discovery, to construct a broader KB, called GrandBase.

Details

PSU Research Review, vol. 1 no. 2

Type: Research Article

DOI:

ISSN: 2399-1747

Keywords

Content available

Article

Publication date: 12 April 2022

Bioactive compounds and its optimization from food waste: review on novel extraction techniques

Subhamoy Dhua, Kshitiz Kumar, Vijay Singh Sharanagat and Prabhat K. Nema

The amount of food wasted every year is 1.3 billion metric tonne (MT), out of which 0.5 billion MT is contributed by the fruits processing industries. The waste includes…

HTML

PDF (536 KB)

Downloads

1224

Abstract

Purpose

The amount of food wasted every year is 1.3 billion metric tonne (MT), out of which 0.5 billion MT is contributed by the fruits processing industries. The waste includes by-products such as peels, pomace and seeds and is a good source of bioactive compounds like phenolic compounds, flavonoids, pectin lipids and dietary fibres. Hence, the purpose of the present study is to review the novel extraction techniques used for the extraction of the bio active compounds from food waste for the selection of suitable extraction method.

Design/methodology/approach

Novel extraction techniques such as ultrasound-assisted extraction, microwave-assisted extraction, enzyme-assisted extraction, supercritical fluid extraction, pulsed electric field extraction and pressurized liquid extraction have emerged to overcome the drawbacks and constraints of conventional extraction techniques. Hence, this study is focussed on novel extraction techniques, their limitations and optimization for the extraction of bioactive compounds from fruit and vegetable waste.

Findings

This study presents a comprehensive review on the novel extraction processes that have been adopted for the extraction of bioactive compounds from food waste. This paper also summarizes bioactive compounds' optimum extraction condition from various food waste using novel extraction techniques.

Research limitations/implications

Food waste is rich in bioactive compounds, and its efficient extraction may add value to the food processing industries. Hence, compressive analysis is needed to overcome the problem associated with the extraction and selection of suitable extraction techniques.

Social implications

Selection of a suitable extraction method will not only add value to food waste but also reduce waste dumping and the cost of bioactive compounds.

Originality/value

This paper presents the research progress on the extraction of bioactive active compounds from food waste using novel extraction techniques.

Details

Nutrition & Food Science , vol. 52 no. 8

Type: Research Article

DOI:

ISSN: 0034-6659

Keywords

Open Access

Article

Publication date: 6 March 2017

Application of keyword extraction on MOOC resources

Zhuoxuan Jiang, Chunyan Miao and Xiaoming Li

Recent years have witnessed the rapid development of massive open online courses (MOOCs). With more and more courses being produced by instructors and being participated by…

HTML

PDF (1.2 MB)

Downloads

2120

Abstract

Purpose

Recent years have witnessed the rapid development of massive open online courses (MOOCs). With more and more courses being produced by instructors and being participated by learners all over the world, unprecedented massive educational resources are aggregated. The educational resources include videos, subtitles, lecture notes, quizzes, etc., on the teaching side, and forum contents, Wiki, log of learning behavior, log of homework, etc., on the learning side. However, the data are both unstructured and diverse. To facilitate knowledge management and mining on MOOCs, extracting keywords from the resources is important. This paper aims to adapt the state-of-the-art techniques to MOOC settings and evaluate the effectiveness on real data. In terms of practice, this paper also tries to answer the questions for the first time that to what extend can the MOOC resources support keyword extraction models, and how many human efforts are required to make the models work well.

Design/methodology/approach

Based on which side generates the data, i.e instructors or learners, the data are classified to teaching resources and learning resources, respectively. The approach used on teaching resources is based on machine learning models with labels, while the approach used on learning resources is based on graph model without labels.

Findings

From the teaching resources, the methods used by the authors can accurately extract keywords with only 10 per cent labeled data. The authors find a characteristic of the data that the resources of various forms, e.g. subtitles and PPTs, should be separately considered because they have the different model ability. From the learning resources, the keywords extracted from MOOC forums are not as domain-specific as those extracted from teaching resources, but they can reflect the topics which are lively discussed in forums. Then instructors can get feedback from the indication. The authors implement two applications with the extracted keywords: generating concept map and generating learning path. The visual demos show they have the potential to improve learning efficiency when they are integrated into a real MOOC platform.

Research limitations/implications

Conducting keyword extraction on MOOC resources is quite difficult because teaching resources are hard to be obtained due to copyrights. Also, getting labeled data is tough because usually expertise of the corresponding domain is required.

Practical implications

The experiment results support that MOOC resources are good enough for building models of keyword extraction, and an acceptable balance between human efforts and model accuracy can be achieved.

Originality/value

This paper presents a pioneer study on keyword extraction on MOOC resources and obtains some new findings.

Details

International Journal of Crowd Science, vol. 1 no. 1

Type: Research Article

DOI:

ISSN: 2398-7294

Keywords

Open Access

Article

Publication date: 21 July 2020

Segmentation based traversing-agent approach for road width extraction from satellite images using volunteered geographic information

Prajowal Manandhar, Prashanth Reddy Marpu and Zeyar Aung

We make use of the Volunteered Geographic Information (VGI) data to extract the total extent of the roads using remote sensing images. VGI data is often provided only as vector…

HTML

PDF (8.1 MB)

Downloads

1242

Abstract

We make use of the Volunteered Geographic Information (VGI) data to extract the total extent of the roads using remote sensing images. VGI data is often provided only as vector data represented by lines and not as full extent. Also, high geolocation accuracy is not guaranteed and it is common to observe misalignment with the target road segments by several pixels on the images. In this work, we use the prior information provided by the VGI and extract the full road extent even if there is significant mis-registration between the VGI and the image. The method consists of image segmentation and traversal of multiple agents along available VGI information. First, we perform image segmentation, and then we traverse through the fragmented road segments using autonomous agents to obtain a complete road map in a semi-automatic way once the seed-points are defined. The road center-line in the VGI guides the process and allows us to discover and extract the full extent of the road network based on the image data. The results demonstrate the validity and good performance of the proposed method for road extraction that reflects the actual road width despite the presence of disturbances such as shadows, cars and trees which shows the efficiency of the fusion of the VGI and satellite images.

Details

Applied Computing and Informatics, vol. 17 no. 1

Type: Research Article

DOI:

ISSN: 2634-1964

Keywords

Open Access

Article

Publication date: 4 August 2022

Structural variable validation of an Online Learning Response Behavior (OLRB) instrument: A comparison analysis of three extraction methods of Exploratory Factor Analysis

Mohd Hanafi Azman Ong, Norazlina Mohd Yasin and Nur Syafikah Ibrahim

Measuring internal response of online learning is seen as fundamental to absorptive capacity which stimulates knowledge assimilation. However, the evaluation of practice and…

HTML

PDF (193 KB)

Downloads

573

Abstract

Purpose

Measuring internal response of online learning is seen as fundamental to absorptive capacity which stimulates knowledge assimilation. However, the evaluation of practice and research of validated instruments that could effectively measure online learning response behavior is limited. Thus, in this study, a new instrument was designed based on literature to determine the structural variables that exist in the online learning response behavior.

Design/methodology/approach

A structured survey was designed and distributed to 410 Malaysian students enrolled in higher-education institutions. The questionnaire has 38 items, all of which were scored using a seven-point likert scale. To begin with, exploratory factor analysis with three types of extraction methods (i.e. principal component, principal axis factoring and maximum likelihood) was used as the method for comparing the outcomes of each extraction method's grouping variables by constantly using a varimax rotation method. In the second phase, reliability analysis was performed to determine the reliability level of the grouping variables, and finally, correlation analysis was performed to determine the discriminant nomological validity of the grouping variables.

Findings

The findings revealed that nine grouping variables were retrieved, with all items having a good value of factor loading and communalities, as well as an adequate degree of reliability. These extracted variables have good discriminant and nomological validity, as evidenced by correlation analysis, which confirmed that the directions of relationships among the extracted dimensions follow the expected theory (i.e. positive direction) and the correlation coefficient is less than 0.70.

Research limitations/implications

This study proposes a comprehensive set of questionnaires that measure the student's online learning response behavior. These questionnaires have been developed on the basis of an extensive literature review and have undergone a rigorous process of validity and reliability for the purpose of enhancing students' online learning response behavior.

Originality/value

This study's findings will aid academic practitioners in assessing the online learning response behavior of students, as well as enhancing the questionnaire's boost factor when administered in an online learning environment.

Details

Asian Association of Open Universities Journal, vol. 17 no. 2

Type: Research Article

DOI:

ISSN: 1858-3431

Keywords

Open Access

Article

Publication date: 31 July 2020

Aspect-based sentiment analysis using smart government review data

Omar Alqaryouti, Nur Siyam, Azza Abdel Monem and Khaled Shaalan

Digital resources such as smart applications reviews and online feedback information are important sources to seek customers’ feedback and input. This paper aims to help…

HTML

PDF (1.2 MB)

Downloads

7041

Abstract

Digital resources such as smart applications reviews and online feedback information are important sources to seek customers’ feedback and input. This paper aims to help government entities gain insights on the needs and expectations of their customers. Towards this end, we propose an aspect-based sentiment analysis hybrid approach that integrates domain lexicons and rules to analyse the entities smart apps reviews. The proposed model aims to extract the important aspects from the reviews and classify the corresponding sentiments. This approach adopts language processing techniques, rules, and lexicons to address several sentiment analysis challenges, and produce summarized results. According to the reported results, the aspect extraction accuracy improves significantly when the implicit aspects are considered. Also, the integrated classification model outperforms the lexicon-based baseline and the other rules combinations by 5% in terms of Accuracy on average. Also, when using the same dataset, the proposed approach outperforms machine learning approaches that uses support vector machine (SVM). However, using these lexicons and rules as input features to the SVM model has achieved higher accuracy than other SVM models.

Details

Applied Computing and Informatics, vol. 20 no. 1/2

Type: Research Article

DOI:

ISSN: 2634-1964

Keywords

Open Access

Article

Publication date: 12 June 2017

Mining medical related temporal information from patients’ self-description

Lichao Zhu, Hangzhou Yang and Zhijun Yan

The purpose of this paper is to develop a new method to extract medical temporal information from online health communities.

HTML

PDF (325 KB)

Downloads

875

Abstract

Purpose

The purpose of this paper is to develop a new method to extract medical temporal information from online health communities.

Design/methodology/approach

The authors trained a conditional random-filed model for the extraction of temporal expressions. The temporal relation identification is considered as a classification task and several support vector machine classifiers are built in the proposed method. For the model training, the authors extracted some high-level semantic features including co-reference relationship of medical concepts and the semantic similarity among words.

Findings

For the extraction of TIMEX, the authors find that well-formatted expressions are easy to recognize, and the main challenge is the relative TIMEX such as “three days after onset”. It also shows the same difficulty for normalization of absolute date or well-formatted duration, whereas frequency is easier to be normalized. For the identification of DocTimeRel, the result is fairly well, and the relation is difficult to identify when it involves a relative TIMEX or a hypothetical concept.

Originality/value

The authors proposed a new method to extract temporal information from the online clinical data and evaluated the usefulness of different level of syntactic features in this task.

Details

International Journal of Crowd Science, vol. 1 no. 2

Type: Research Article

DOI:

ISSN: 2398-7294

Keywords

Open Access

Article

Publication date: 13 December 2022

A study on the coloration effectiveness of Chromolaena odorata on the worsted wool fabric

Chau Thi Ngoc Pham, Hung Ngoc Phan, Thao Thanh Hoang, Tien Thi Thuy Dao and Huong Mai Bui

The health and environmental hazards associated with synthetic dyes have led to a revival of natural dyes that are non-toxic, environmentally benign and coupled with various…

HTML

PDF (1.6 MB)

Downloads

1181

Abstract

Purpose

The health and environmental hazards associated with synthetic dyes have led to a revival of natural dyes that are non-toxic, environmentally benign and coupled with various functions. The study aims to investigate and develop the potentiality of a popular herb called Chromolaena odorata (C. odorata) as a sustainable and stable dyestuff in textiles.

Design/methodology/approach

Natural colorant extracted from C. odorata leaves is used to dye the worsted fabric, which is one of the premier end-use of wool in fashion, via the padding method associated with pre-, simultaneous and post-mordanting with chitosan, tannic acid and copper sulfate pentahydrate. The effects of extraction, dyeing and mordanting processes on fabric’s color strength K/S and color difference ΔE_CMC are investigated via International Commission on Illumination’s L*a*b* color space, Fourier transform infrared spectroscopy, scanning electron microscope, color fastness to washing, rubbing, perspiration and light.

Findings

The results obtained indicate extraction with ethanol 90% with a solid/liquid ratio of 1:5 within 1 h, and coloration with a liquor ratio of 1:5 (pH 5) within 2 h under padding pressure of 0.3 MPa are the most effective for coloring worsted fabric.

Practical implications

The C. odorata’s application as a highly effective dyestuff possessing good colorimetric effectiveness has expanded this herb's economic potential, contributing partly to economic growth and adding value to wool in global supply chain.

Originality/value

C. odorata dyestuff has prevailed over other natural colorants because of its impressive color fastness against washing, rubbing, perspiration and especially color stability for pH change.

Details

Research Journal of Textile and Apparel, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 1560-6074

Keywords

Open Access

Article

Publication date: 30 July 2019

The research of traffic density extraction method under vehicular ad hoc network environment

Zhizhou Wu, Yiming Zhang, Guishan Tan and Jia Hu

Traffic density is one of the most important parameters to consider in the traffic operation field. Owing to limited data sources, traditional methods cannot extract traffic…

HTML

PDF (1.6 MB)

Downloads

1410

Abstract

Purpose

Traffic density is one of the most important parameters to consider in the traffic operation field. Owing to limited data sources, traditional methods cannot extract traffic density directly. In the vehicular ad hoc network (VANET) environment, the vehicle-to-vehicle (V2V) and vehicle-to-infrastructure (V2I) interaction technologies create better conditions for collecting the whole time-space and refined traffic data, which provides a new approach to solving this problem.

Design/methodology/approach

On that basis, a real-time traffic density extraction method has been proposed, including lane density, segment density and network density. Meanwhile, using SUMO and OMNet++ as traffic simulator and network simulator, respectively, the Veins framework as middleware and the two-way coupling VANET simulation platform was constructed.

Findings

Based on the simulation platform, a simulated intersection in Shanghai was developed to investigate the adaptability of the model.

Originality/value

Most research studies use separate simulation methods, importing trace data obtained by using from the simulation software to the communication simulation software. In this paper, the tight coupling simulation method is applied. Using real-time data and history data, the research focuses on the establishment and validation of the traffic density extraction model.

Details

Journal of Intelligent and Connected Vehicles, vol. 2 no. 1

Type: Research Article

DOI:

ISSN: 2399-9802

Keywords

Open Access

Article

Publication date: 14 August 2020

A Mixed approach of Deep Learning method and Rule-Based method to improve Aspect Level Sentiment Analysis

Paramita Ray and Amlan Chakrabarti

Social networks have changed the communication patterns significantly. Information available from different social networking sites can be well utilized for the analysis of users…

HTML

PDF (878 KB)

Downloads

6405

Abstract

Social networks have changed the communication patterns significantly. Information available from different social networking sites can be well utilized for the analysis of users opinion. Hence, the organizations would benefit through the development of a platform, which can analyze public sentiments in the social media about their products and services to provide a value addition in their business process. Over the last few years, deep learning is very popular in the areas of image classification, speech recognition, etc. However, research on the use of deep learning method in sentiment analysis is limited. It has been observed that in some cases the existing machine learning methods for sentiment analysis fail to extract some implicit aspects and might not be very useful. Therefore, we propose a deep learning approach for aspect extraction from text and analysis of users sentiment corresponding to the aspect. A seven layer deep convolutional neural network (CNN) is used to tag each aspect in the opinionated sentences. We have combined deep learning approach with a set of rule-based approach to improve the performance of aspect extraction method as well as sentiment scoring method. We have also tried to improve the existing rule-based approach of aspect extraction by aspect categorization with a predefined set of aspect categories using clustering method and compared our proposed method with some of the state-of-the-art methods. It has been observed that the overall accuracy of our proposed method is 0.87 while that of the other state-of-the-art methods like modified rule-based method and CNN are 0.75 and 0.80 respectively. The overall accuracy of our proposed method shows an increment of 7–12% from that of the state-of-the-art methods.

Details

Applied Computing and Informatics, vol. 18 no. 1/2

Type: Research Article

DOI:

ISSN: 2634-1964

Keywords

Access

Year

Content type

1 – 10 of over 1000

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Research limitations/implications

Social implications

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Research limitations/implications

Practical implications

Originality/value

Details

Keywords

Abstract

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Research limitations/implications

Originality/value

Details

Keywords

Abstract

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Practical implications

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Details

Keywords

Access

Year

Content type

We’re listening — tell us what you think

Something didn’t work…

All feedback is valuable

Join us on our journey

Platform update page

Questions & More Information