Search results

1 – 10 of 785
Article
Publication date: 15 August 2023

Yi-Hung Liu and Sheng-Fong Chen

Whether automatically generated summaries of health social media can assist users in appropriately managing their diseases and ensuring better communication with health…

Abstract

Purpose

Whether automatically generated summaries of health social media can assist users in appropriately managing their diseases and ensuring better communication with health professionals becomes an important issue. This paper aims to develop a novel deep learning-based summarization approach for obtaining the most informative summaries from online patient reviews accurately and effectively.

Design/methodology/approach

This paper proposes a framework to generate summaries that integrates a domain-specific pre-trained embedding model and a deep neural extractive summary approach by considering content features, text sentiment, review influence and readability features. Representative health-related summaries were identified, and user judgements were analysed.

Findings

Experimental results on the three real-world health forum data sets indicate that awarding sentences without incorporating all the adopted features leads to declining summarization performance. The proposed summarizer significantly outperformed the comparison baseline. User judgement through the questionnaire provides realistic and concrete evidence of crucial features that remarkably influence patient forum review summaries.

Originality/value

This study contributes to health analytics and management literature by exploring users’ expressions and opinions through the health deep learning summarization model. The research also developed an innovative mindset to design summarization weighting methods from user-created content on health topics.

Details

The Electronic Library , vol. 41 no. 5
Type: Research Article
ISSN: 0264-0473

Keywords

Article
Publication date: 29 November 2023

Tarun Jaiswal, Manju Pandey and Priyanka Tripathi

The purpose of this study is to investigate and demonstrate the advancements achieved in the field of chest X-ray image captioning through the utilization of dynamic convolutional…

Abstract

Purpose

The purpose of this study is to investigate and demonstrate the advancements achieved in the field of chest X-ray image captioning through the utilization of dynamic convolutional encoder–decoder networks (DyCNN). Typical convolutional neural networks (CNNs) are unable to capture both local and global contextual information effectively and apply a uniform operation to all pixels in an image. To address this, we propose an innovative approach that integrates a dynamic convolution operation at the encoder stage, improving image encoding quality and disease detection. In addition, a decoder based on the gated recurrent unit (GRU) is used for language modeling, and an attention network is incorporated to enhance consistency. This novel combination allows for improved feature extraction, mimicking the expertise of radiologists by selectively focusing on important areas and producing coherent captions with valuable clinical information.

Design/methodology/approach

In this study, we have presented a new report generation approach that utilizes dynamic convolution applied Resnet-101 (DyCNN) as an encoder (Verelst and Tuytelaars, 2019) and GRU as a decoder (Dey and Salemt, 2017; Pan et al., 2020), along with an attention network (see Figure 1). This integration innovatively extends the capabilities of image encoding and sequential caption generation, representing a shift from conventional CNN architectures. With its ability to dynamically adapt receptive fields, the DyCNN excels at capturing features of varying scales within the CXR images. This dynamic adaptability significantly enhances the granularity of feature extraction, enabling precise representation of localized abnormalities and structural intricacies. By incorporating this flexibility into the encoding process, our model can distil meaningful and contextually rich features from the radiographic data. While the attention mechanism enables the model to selectively focus on different regions of the image during caption generation. The attention mechanism enhances the report generation process by allowing the model to assign different importance weights to different regions of the image, mimicking human perception. In parallel, the GRU-based decoder adds a critical dimension to the process by ensuring a smooth, sequential generation of captions.

Findings

The findings of this study highlight the significant advancements achieved in chest X-ray image captioning through the utilization of dynamic convolutional encoder–decoder networks (DyCNN). Experiments conducted using the IU-Chest X-ray datasets showed that the proposed model outperformed other state-of-the-art approaches. The model achieved notable scores, including a BLEU_1 score of 0.591, a BLEU_2 score of 0.347, a BLEU_3 score of 0.277 and a BLEU_4 score of 0.155. These results highlight the efficiency and efficacy of the model in producing precise radiology reports, enhancing image interpretation and clinical decision-making.

Originality/value

This work is the first of its kind, which employs DyCNN as an encoder to extract features from CXR images. In addition, GRU as the decoder for language modeling was utilized and the attention mechanisms into the model architecture were incorporated.

Details

Data Technologies and Applications, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 16 December 2022

Kinjal Bhargavkumar Mistree, Devendra Thakor and Brijesh Bhatt

According to the Indian Sign Language Research and Training Centre (ISLRTC), India has approximately 300 certified human interpreters to help people with hearing loss. This paper…

Abstract

Purpose

According to the Indian Sign Language Research and Training Centre (ISLRTC), India has approximately 300 certified human interpreters to help people with hearing loss. This paper aims to address the issue of Indian Sign Language (ISL) sentence recognition and translation into semantically equivalent English text in a signer-independent mode.

Design/methodology/approach

This study presents an approach that translates ISL sentences into English text using the MobileNetV2 model and Neural Machine Translation (NMT). The authors have created an ISL corpus from the Brown corpus using ISL grammar rules to perform machine translation. The authors’ approach converts ISL videos of the newly created dataset into ISL gloss sequences using the MobileNetV2 model and the recognized ISL gloss sequence is then fed to a machine translation module that generates an English sentence for each ISL sentence.

Findings

As per the experimental results, pretrained MobileNetV2 model was proven the best-suited model for the recognition of ISL sentences and NMT provided better results than Statistical Machine Translation (SMT) to convert ISL text into English text. The automatic and human evaluation of the proposed approach yielded accuracies of 83.3 and 86.1%, respectively.

Research limitations/implications

It can be seen that the neural machine translation systems produced translations with repetitions of other translated words, strange translations when the total number of words per sentence is increased and one or more unexpected terms that had no relation to the source text on occasion. The most common type of error is the mistranslation of places, numbers and dates. Although this has little effect on the overall structure of the translated sentence, it indicates that the embedding learned for these few words could be improved.

Originality/value

Sign language recognition and translation is a crucial step toward improving communication between the deaf and the rest of society. Because of the shortage of human interpreters, an alternative approach is desired to help people achieve smooth communication with the Deaf. To motivate research in this field, the authors generated an ISL corpus of 13,720 sentences and a video dataset of 47,880 ISL videos. As there is no public dataset available for ISl videos incorporating signs released by ISLRTC, the authors created a new video dataset and ISL corpus.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 16 no. 3
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 13 December 2023

Sofia Martynovich

The interpretation of any emerging form or period in art history was never a trivial task. However, in the case of digital art, technology, becoming an integral part, multiplied…

Abstract

Purpose

The interpretation of any emerging form or period in art history was never a trivial task. However, in the case of digital art, technology, becoming an integral part, multiplied the complexity of describing, systematizing and evaluating it. This article investigates the most common metadata standards for the documentation of art as a broad category and suggests possible next steps toward an extended metadata standard for digital art.

Design/methodology/approach

Describing several techno-cultural phenomena formed in the last decade, manifesting the extendibility of digital art (its ability to be easily extended across multiple modalities), the article, at first, points to the long overdue need to re-evaluate the standards around it. Then it suggests a deeper analysis through a comparative study. In the scope of the study three artworks, The Arnolfini Portrait (Jan van Eyck), an iconic example of the early Renaissance, The World's First Collaborative Sentence (Douglas Davis), a classic example of early Internet art and Fake It Till You Make It (Maya Man), a prominent example of the blockchain art, are examined following the structure of the VRA Core 4.0 standard.

Findings

The comparative study demonstrates that digital art is more multi-semantic than traditional physical art, and requires new taxonomies as well as approaches for data acquisition.

Originality/value

Acknowledging that digital art simply has not yet evolved to the stage of being systematically collected by cultural institutions for documentation, curation and preservation, but otherwise, in the past few years, it has been at the front-center of social, economic and technological trends, the article suggests looking for hints on the future-proof extended metadata standard in some of those trends.

Details

Journal of Documentation, vol. 80 no. 2
Type: Research Article
ISSN: 0022-0418

Keywords

Book part
Publication date: 23 November 2023

Claudine Kuradusenge-McLeod

This chapter explores the dual, contentions spaces of consciousness the Rwandan diaspora communities navigate. First of which was created through the stories of trauma and…

Abstract

This chapter explores the dual, contentions spaces of consciousness the Rwandan diaspora communities navigate. First of which was created through the stories of trauma and displacement since the Rwandan genocide and is influenced by the current Rwandan government's control over narratives of identities and remembrance both socially and politically. The second originated from the younger generations' attempt to assimilate to the only country they have never lived in and personally known. In this second space, the younger generations were forced, consciously or unconsciously, to choose between their communities' attachment to the past or creating a new path or future. Most importantly, being in diaspora means accepting that the different generations will often remain at the periphery of the new country, like outsiders looking inward. This phenomenon of social exclusion is a result of different factors, such as social categorisation, collective trauma and the narratives of otherness, which shape the different generations' identity shifts and sense of belonging. Using a phenomenological research method, this study analysed how one event, the 1994 Rwandan genocide, changed the meaning of diaspora consciousness and divided the communities into social categories such as ‘victims’ and ‘perpetrators’. Using the experiences of Rwandan American diaspora communities, I explored the impact of the labels of ‘victim’ and ‘perpetrator’ and how they have not only created specific narratives around remembrance and accountability but also crystallised the normative ideas of who was harmed and who was responsible for inflicting that harm. This chapter analysed the Rwandan communities' social development and assimilation, their understanding of their pasts and their members' social and political engagements in addressing their roles in their communities and nations.

Details

Migrations and Diasporas
Type: Book
ISBN: 978-1-83797-147-3

Keywords

Content available
Book part
Publication date: 1 August 2023

Julie Stubbs, Sophie Russell, Eileen Baldry, David Brown, Chris Cunneen and Melanie Schwartz

Abstract

Details

Rethinking Community Sanctions
Type: Book
ISBN: 978-1-80117-641-5

Article
Publication date: 18 May 2023

Rongen Yan, Depeng Dang, Hu Gao, Yan Wu and Wenhui Yu

Question answering (QA) answers the questions asked by people in the form of natural language. In the QA, due to the subjectivity of users, the questions they query have different…

Abstract

Purpose

Question answering (QA) answers the questions asked by people in the form of natural language. In the QA, due to the subjectivity of users, the questions they query have different expressions, which increases the difficulty of text retrieval. Therefore, the purpose of this paper is to explore new query rewriting method for QA that integrates multiple related questions (RQs) to form an optimal question. Moreover, it is important to generate a new dataset of the original query (OQ) with multiple RQs.

Design/methodology/approach

This study collects a new dataset SQuAD_extend by crawling the QA community and uses word-graph to model the collected OQs. Next, Beam search finds the best path to get the best question. To deeply represent the features of the question, pretrained model BERT is used to model sentences.

Findings

The experimental results show three outstanding findings. (1) The quality of the answers is better after adding the RQs of the OQs. (2) The word-graph that is used to model the problem and choose the optimal path is conducive to finding the best question. (3) Finally, BERT can deeply characterize the semantics of the exact problem.

Originality/value

The proposed method can use word-graph to construct multiple questions and select the optimal path for rewriting the question, and the quality of answers is better than the baseline. In practice, the research results can help guide users to clarify their query intentions and finally achieve the best answer.

Details

Data Technologies and Applications, vol. 58 no. 1
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 19 January 2024

Meng Zhu and Xiaolong Xu

Intent detection (ID) and slot filling (SF) are two important tasks in natural language understanding. ID is to identify the main intent of a paragraph of text. The goal of SF is…

Abstract

Purpose

Intent detection (ID) and slot filling (SF) are two important tasks in natural language understanding. ID is to identify the main intent of a paragraph of text. The goal of SF is to extract the information that is important to the intent from the input sentence. However, most of the existing methods use sentence-level intention recognition, which has the risk of error propagation, and the relationship between intention recognition and SF is not explicitly modeled. Aiming at this problem, this paper proposes a collaborative model of ID and SF for intelligent spoken language understanding called ID-SF-Fusion.

Design/methodology/approach

ID-SF-Fusion uses Bidirectional Encoder Representation from Transformers (BERT) and Bidirectional Long Short-Term Memory (BiLSTM) to extract effective word embedding and context vectors containing the whole sentence information respectively. Fusion layer is used to provide intent–slot fusion information for SF task. In this way, the relationship between ID and SF task is fully explicitly modeled. This layer takes the result of ID and slot context vectors as input to obtain the fusion information which contains both ID result and slot information. Meanwhile, to further reduce error propagation, we use word-level ID for the ID-SF-Fusion model. Finally, two tasks of ID and SF are realized by joint optimization training.

Findings

We conducted experiments on two public datasets, Airline Travel Information Systems (ATIS) and Snips. The results show that the Intent ACC score and Slot F1 score of ID-SF-Fusion on ATIS and Snips are 98.0 per cent and 95.8 per cent, respectively, and the two indicators on Snips dataset are 98.6 per cent and 96.7 per cent, respectively. These models are superior to slot-gated, SF-ID NetWork, stack-Prop and other models. In addition, ablation experiments were performed to further analyze and discuss the proposed model.

Originality/value

This paper uses word-level intent recognition and introduces intent information into the SF process, which is a significant improvement on both data sets.

Details

Data Technologies and Applications, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 18 November 2022

Chaohua Huang, Shaoshuang Zhuang and Haiyan Ma

This study aims to examine the effects of pathos in sustainable brand stories featuring masculinity on brand masculinity and men’s sustainable brand attitude using Aristotle’s…

Abstract

Purpose

This study aims to examine the effects of pathos in sustainable brand stories featuring masculinity on brand masculinity and men’s sustainable brand attitude using Aristotle’s rhetoric theory.

Design/methodology/approach

Three independent online experiments (N = 398; N = 216; N = 247) were conducted to observe how participants responded to a sustainable brand story. Data collected through a post-experimental survey were used to test the proposed model. Research hypotheses were inspected using SPSS.

Findings

The authors reveal brand masculinity is influenced by varying degrees of pathos: participants who read stories with all three pathos elements (metaphor, humor and empathy) demonstrated the highest level of perceived brand masculinity. Male consumers showed more positive attitudes toward masculine sustainable brand stories than feminine ones. The authors also identify the moderating effect of consumer generation: Gen Z (versus Gen Y) consumers demonstrated stronger character identification with hybrid masculinity (versus hegemonic masculinity) sustainable brand stories, resulting in more favorable sustainable brand attitudes.

Originality/value

The study provides a new angle for exploring the relationship between gendered sustainable brand stories and sustainable brand attitudes. It is the first (to the authors’ knowledge) that links Aristotle’s rhetoric theory to brand gender research, and it empirically demonstrates how male consumers from different generational cohorts respond to different masculinity strategies used by sustainable brands.

Details

Asia Pacific Journal of Marketing and Logistics, vol. 35 no. 8
Type: Research Article
ISSN: 1355-5855

Keywords

1 – 10 of 785