Search results

1 – 10 of 118
Article
Publication date: 5 July 2024

Nouhaila Bensalah, Habib Ayad, Abdellah Adib and Abdelhamid Ibn El Farouk

The paper aims to enhance Arabic machine translation (MT) by proposing novel approaches: (1) a dimensionality reduction technique for word embeddings tailored for Arabic text…

Abstract

Purpose

The paper aims to enhance Arabic machine translation (MT) by proposing novel approaches: (1) a dimensionality reduction technique for word embeddings tailored for Arabic text, optimizing efficiency while retaining semantic information; (2) a comprehensive comparison of meta-embedding techniques to improve translation quality; and (3) a method leveraging self-attention and Gated CNNs to capture token dependencies, including temporal and hierarchical features within sentences, and interactions between different embedding types. These approaches collectively aim to enhance translation quality by combining different embedding schemes and leveraging advanced modeling techniques.

Design/methodology/approach

Recent works on MT in general and Arabic MT in particular often pick one type of word embedding model. In this paper, we present a novel approach to enhance Arabic MT by addressing three key aspects. Firstly, we propose a new dimensionality reduction technique for word embeddings, specifically tailored for Arabic text. This technique optimizes the efficiency of embeddings while retaining their semantic information. Secondly, we conduct an extensive comparison of different meta-embedding techniques, exploring the combination of static and contextual embeddings. Through this analysis, we identify the most effective approach to improve translation quality. Lastly, we introduce a novel method that leverages self-attention and Gated convolutional neural networks (CNNs) to capture token dependencies, including temporal and hierarchical features within sentences, as well as interactions between different types of embeddings. Our experimental results demonstrate the effectiveness of our proposed approach in significantly enhancing Arabic MT performance. It outperforms baseline models with a BLEU score increase of 2 points and achieves superior results compared to state-of-the-art approaches, with an average improvement of 4.6 points across all evaluation metrics.

Findings

The proposed approaches significantly enhance Arabic MT performance. The dimensionality reduction technique improves the efficiency of word embeddings while preserving semantic information. Comprehensive comparison identifies effective meta-embedding techniques, with the contextualized dynamic meta-embeddings (CDME) model showcasing competitive results. Integration of Gated CNNs with the transformer model surpasses baseline performance, leveraging both architectures' strengths. Overall, these findings demonstrate substantial improvements in translation quality, with a BLEU score increase of 2 points and an average improvement of 4.6 points across all evaluation metrics, outperforming state-of-the-art approaches.

Originality/value

The paper’s originality lies in its departure from simply fine-tuning the transformer model for a specific task. Instead, it introduces modifications to the internal architecture of the transformer, integrating Gated CNNs to enhance translation performance. This departure from traditional fine-tuning approaches demonstrates a novel perspective on model enhancement, offering unique insights into improving translation quality without solely relying on pre-existing architectures. The originality in dimensionality reduction lies in the tailored approach for Arabic text. While dimensionality reduction techniques are not new, the paper introduces a specific method optimized for Arabic word embeddings. By employing independent component analysis (ICA) and a post-processing method, the paper effectively reduces the dimensionality of word embeddings while preserving semantic information which has not been investigated before especially for MT task.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 17 no. 3
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 27 June 2023

Syihabuddin Syihabuddin, Nurul Murtadho, Yusring Sanusi Baso, Hikmah Maulani and Shofa Musthofa Khalid

Assessing whether a book is relevant or suitable for use in teaching materials is not an easy and haphazard matter, various methods and theories have been offered by researchers…

Abstract

Purpose

Assessing whether a book is relevant or suitable for use in teaching materials is not an easy and haphazard matter, various methods and theories have been offered by researchers in studying this matter. Taking a study of the context of textbooks, researchers found the urgency that textbooks are a foundation for education, socialization and transmission of knowledge and its construction. Researchers offer another approach, namely by using praxeology as a study tool so that the goals of the textbooks previously intended are fulfilled.

Design/methodology/approach

The researcher uses a qualitative approach through grounded theory. Grounded theory procedures are designed to develop a well-integrated set of concepts that provide a thorough theoretical explanation of the social phenomena under study. A grounded theory must explain as well as describe. It may also implicitly provide some degree of predictability, but only with respect to certain conditions (Corbin and Strauss, 1990). Document analysis in conducting this research study. Document analysis itself examines systematic procedures for reviewing or evaluating documents, both printed and electronic materials.

Findings

Two issues regarding gender acquisition have been investigated in L2 Arabic acquisition studies; the order in which L2 Arabic learners acquire certain grammatical features of the gender system and the effect of L1 on the acquisition of some grammatical features from L2 grammatical gender. Arabic has a two-gender system that classifies all nouns, animate and inanimate, as masculine or feminine. Verbs, nouns, adjectives, personal, demonstrative and relative pronouns related to nouns in the syntactic structure of sentences show gender agreement.

Research limitations/implications

In practice, as a book intended for non-speakers, the book is presented using a general view of linguistic theory. In relation to the gender agreement, the presentation of the book begins and is inserted with the concepts of nouns and verbs. Returning to the praxeology context, First, The Know How (Praxis) explains practice (i.e. the tasks performed and the techniques used). Second, To Know Why or Knowledge (logos) which explains and justifies practice from a technological and theoretical point of view. Answering the first concept, the exercise presented in the book is a concept with three clusters explained at the beginning of the discussion. And the second concept, explained with a task design approach which includes word categorization by separating masculine and feminine word forms.

Practical implications

Practically, this research obtains perspectives studied from a textbook, namely the Arabic gender agreement is presented with various examples of noun contexts; textbook authors present book concepts in a particular way with regard to curriculum features and this task design affects student performance, and which approach is more effective for developing student understanding. Empirically, the material is in line with the formulation of competency standards for non-Arabic speakers in Indonesia.

Originality/value

With this computational search, the researcher found a novelty that was considered accurate by taking the praxeology context as a review in the analysis of non-speaking Arabic textbooks, especially in the year 2022 (last data collection in September) there has been no study on this context. So then, the researcher finds other interests in that praxeology can examine more broadly parts of the task of the contents of the book with the approach of relevant linguistic theories.

Details

Journal of Applied Research in Higher Education, vol. 16 no. 4
Type: Research Article
ISSN: 2050-7003

Keywords

Open Access
Article
Publication date: 4 August 2020

Mohamed Boudchiche and Azzeddine Mazroui

We have developed in this paper a morphological disambiguation hybrid system for the Arabic language that identifies the stem, lemma and root of a given sentence words. Following…

Abstract

We have developed in this paper a morphological disambiguation hybrid system for the Arabic language that identifies the stem, lemma and root of a given sentence words. Following an out-of-context analysis performed by the morphological analyser Alkhalil Morpho Sys, the system first identifies all the potential tags of each word of the sentence. Then, a disambiguation phase is carried out to choose for each word the right solution among those obtained during the first phase. This problem has been solved by equating the disambiguation issue with a surface optimization problem of spline functions. Tests have shown the interest of this approach and the superiority of its performances compared to those of the state of the art.

Details

Applied Computing and Informatics, vol. 20 no. 3/4
Type: Research Article
ISSN: 2634-1964

Keywords

Article
Publication date: 10 November 2023

Wagdi Rashad Ali Bin-Hady, Arif Ahmed Mohammed Hassan Al-Ahdal and Samia Khalifa Abdullah

English as a foreign langauge (EFL) students find it difficult to apply the theoretical knowledge they acquire on translation in the practical world. Therefore, this study…

Abstract

Purpose

English as a foreign langauge (EFL) students find it difficult to apply the theoretical knowledge they acquire on translation in the practical world. Therefore, this study explored if training in pretranslation techniques (PTTs) (syntactic parsing) as suggested by Almanna (2018) could improve the translation proficiency of Yemeni EFL students. Moreover, the study also assessed which of the PTTs the intervention helped to develop.

Design/methodology/approach

The study adopted a primarily experimental pre- and posttests research design, and the sample comprised of an intake class with 16 students enrolled in the fourth year, Bachelor in Education (B.Ed), Hadhramout University. Six participants were also interviewed to gather the students' perceptions on using PTTs.

Findings

Results showed that students' performance in translation developed significantly (Sig. = 0.002). All the six PTTs showed development, though subject, tense and aspect developed more significantly (Sig. = 0.034, 0.002, 0.001 respectively). Finally, the study reported students' positive perceptions on the importance of using PTTs before doing any translation tasks.

Originality/value

One of the recurrent errors that can be noticed in Yemeni EFL students' production is their inability to transfer the grammatical elements of sentences from L1 (Arabic) into L2 (English) or the visa versa. The researchers thought though translation is more than the syntactic transmission of one language into another, analyzing the elements of sentences using syntactic and semantic parsing can help students to produce acceptable texts in the target language. These claims would be proved or refuted after analyzing the experiment result of the present study.

Details

Journal of Applied Research in Higher Education, vol. 16 no. 4
Type: Research Article
ISSN: 2050-7003

Keywords

Article
Publication date: 19 July 2024

Giulio Marchena Sekli

The aim of this study is to offer valuable insights to businesses and facilitate better understanding on transformer-based models (TBMs), which are among the widely employed…

Abstract

Purpose

The aim of this study is to offer valuable insights to businesses and facilitate better understanding on transformer-based models (TBMs), which are among the widely employed generative artificial intelligence (GAI) models, garnering substantial attention due to their ability to process and generate complex data.

Design/methodology/approach

Existing studies on TBMs tend to be limited in scope, either focusing on specific fields or being highly technical. To bridge this gap, this study conducts robust bibliometric analysis to explore the trends across journals, authors, affiliations, countries and research trajectories using science mapping techniques – co-citation, co-words and strategic diagram analysis.

Findings

Identified research gaps encompass the evolution of new closed and open-source TBMs; limited exploration across industries like education and disciplines like marketing; a lack of in-depth exploration on TBMs' adoption in the health sector; scarcity of research on TBMs' ethical considerations and potential TBMs' performance research in diverse applications, like image processing.

Originality/value

The study offers an updated TBMs landscape and proposes a theoretical framework for TBMs' adoption in organizations. Implications for managers and researchers along with suggested research questions to guide future investigations are provided.

Details

Kybernetes, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0368-492X

Keywords

Open Access
Article
Publication date: 18 April 2024

Joseph Nockels, Paul Gooding and Melissa Terras

This paper focuses on image-to-text manuscript processing through Handwritten Text Recognition (HTR), a Machine Learning (ML) approach enabled by Artificial Intelligence (AI)…

1343

Abstract

Purpose

This paper focuses on image-to-text manuscript processing through Handwritten Text Recognition (HTR), a Machine Learning (ML) approach enabled by Artificial Intelligence (AI). With HTR now achieving high levels of accuracy, we consider its potential impact on our near-future information environment and knowledge of the past.

Design/methodology/approach

In undertaking a more constructivist analysis, we identified gaps in the current literature through a Grounded Theory Method (GTM). This guided an iterative process of concept mapping through writing sprints in workshop settings. We identified, explored and confirmed themes through group discussion and a further interrogation of relevant literature, until reaching saturation.

Findings

Catalogued as part of our GTM, 120 published texts underpin this paper. We found that HTR facilitates accurate transcription and dataset cleaning, while facilitating access to a variety of historical material. HTR contributes to a virtuous cycle of dataset production and can inform the development of online cataloguing. However, current limitations include dependency on digitisation pipelines, potential archival history omission and entrenchment of bias. We also cite near-future HTR considerations. These include encouraging open access, integrating advanced AI processes and metadata extraction; legal and moral issues surrounding copyright and data ethics; crediting individuals’ transcription contributions and HTR’s environmental costs.

Originality/value

Our research produces a set of best practice recommendations for researchers, data providers and memory institutions, surrounding HTR use. This forms an initial, though not comprehensive, blueprint for directing future HTR research. In pursuing this, the narrative that HTR’s speed and efficiency will simply transform scholarship in archives is deconstructed.

Book part
Publication date: 24 June 2024

Fatma F. S. Said

For education systems to meet the demands of the knowledge economy and prepare their students to be adequately skilled for a more diversified economy in the Arabian Gulf, bold and…

Abstract

For education systems to meet the demands of the knowledge economy and prepare their students to be adequately skilled for a more diversified economy in the Arabian Gulf, bold and innovative initiatives must be taken in order to ensure that these skills contribute towards a sustainable knowledge economy. Gulf states have been preparing for a transition towards, what the World Bank calls ‘a knowledge economy’ (World Bank, 2013) where economies will be run by the skills and knowledge capital of their workforce with technology and its advancement playing a central role. Many governments have identified the education sector as a site in which such ambitions can be met and have therefore introduced models of education where English is the medium of instruction. The rationale behind such a decision is based on multiple reasons, mainly because English is considered by some as the language of science and discovery (see Crystal, 2003).

In all discussions surrounding the overhaul of education systems and the United Nations’ (UN) Sustainable Development Goals (SDGs) namely, goal number four (quality education), the notion of the language through which students learn is a neglected area of inquiry. English is increasingly becoming the language of instruction at the university and progressively at the school level too. This means that young students lose out on adequately learning their mother tongue. The chapter argues that only through forward, bold, and novel decisions to teach students in both Arabic and English can there be a guarantee of a more sustainable knowledge economy across the Gulf.

Details

Transformative Leadership and Sustainable Innovation in Education: Interdisciplinary Perspectives
Type: Book
ISBN: 978-1-83753-536-1

Keywords

Article
Publication date: 13 February 2024

John J. Sailors, Jamal A. Al-Khatib, Tarik Khzindar and Shaza Ezzi

The Islamic world spans many different languages with different language structures. This paper aims to explore one way in which language structure affects consumer response to…

Abstract

Purpose

The Islamic world spans many different languages with different language structures. This paper aims to explore one way in which language structure affects consumer response to the marketing of cobrands.

Design/methodology/approach

Two between subject experiments were conducted using samples of participants from Saudi Arabia and the USA. The first manipulated partner brand category similarity and brand name order, along with the structure of the language used to communicate with the market. The data for this study includes Arabic speakers in Saudi Arabia as well as English speakers in the USA. The second study explores how targeting a population fluent in multiple languages of varied structure nullifies the findings from the first study and uses Latino participants in the USA.

Findings

This study finds that when brands come from similar product categories, name order did not affect cobrand evaluations, but it did when the brands come from dissimilar product categories. Here, evaluations of the cobrand are enhanced when the invited brand is in the position that adjectives occupy in the participant’s language. The authors also find that being proficient in two languages, each with a different default order for adjectives and nouns, quashes the effect of name order otherwise seen when brands from dissimilar product categories engage in cobranding.

Originality/value

By examining the impact of language structure on the effects of cobrand evaluation and conducting studies among participants with differing dominant languages, this research can rule out simple primacy or recency effects.

Details

Journal of Islamic Marketing, vol. 15 no. 7
Type: Research Article
ISSN: 1759-0833

Keywords

Open Access
Article
Publication date: 19 September 2024

Loubna A. Youssef, Usama Elsayed, Sherif Shaheen and Nour Mahmoud Khalifa

This paper focuses on a project to work on the digital library of Arab children's culture for sustainable development (DLACSD).

Abstract

Purpose

This paper focuses on a project to work on the digital library of Arab children's culture for sustainable development (DLACSD).

Design/methodology/approach

This project claims to link the past, present, and future by creating a platform that can grow to include not only works by adults but by children who inspire adults with their imagination and the joys they bring to the world.

Findings

This project addresses in phases the different aspects of the problem of the lack of material for Egyptian/Arab children at different stages in Arabic on the internet (with copyright law in mind). It is time to fill this gap by having a rich repository of stories, plays, games and songs for children in Arabic in a digital library to enrich the life of the child and to inform the world that much that is worthwhile is available in Arabic for parents, teachers, and children to enjoy.

Research limitations/implications

Through reading samples of the works by Abdel-Tawab Youssef (1928–2015) by using the Dublin Core Elements, it will be informative to see how his writings address the United Nations Goals of Sustainable Development way before these Goals were discussed.

Practical implications

Writers for children, librarians, teachers, psychologists, literary critics, illustrators, and parents need a platform that makes material available to promote children’s culture in the Arab world and to introduce the world to what is of value for children in Arabic.

Social implications

Currently, communication brings the world together and although the social media and the new technology have introduced problems that are serious, to say the least, collaborators on all levels must play an active role in redressing the social wrongs, especially those affecting children.

Originality/value

This ongoing project by members of a team who believe in interdisciplinarity and multidisciplinarity has taken the first step to create and develop (DLACSD).

Details

Journal of Humanities and Applied Social Sciences, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2632-279X

Keywords

Article
Publication date: 18 August 2023

Gaurav Sarin, Pradeep Kumar and M. Mukund

Text classification is a widely accepted and adopted technique in organizations to mine and analyze unstructured and semi-structured data. With advancement of technological…

Abstract

Purpose

Text classification is a widely accepted and adopted technique in organizations to mine and analyze unstructured and semi-structured data. With advancement of technological computing, deep learning has become more popular among academicians and professionals to perform mining and analytical operations. In this work, the authors study the research carried out in field of text classification using deep learning techniques to identify gaps and opportunities for doing research.

Design/methodology/approach

The authors adopted bibliometric-based approach in conjunction with visualization techniques to uncover new insights and findings. The authors collected data of two decades from Scopus global database to perform this study. The authors discuss business applications of deep learning techniques for text classification.

Findings

The study provides overview of various publication sources in field of text classification and deep learning together. The study also presents list of prominent authors and their countries working in this field. The authors also presented list of most cited articles based on citations and country of research. Various visualization techniques such as word cloud, network diagram and thematic map were used to identify collaboration network.

Originality/value

The study performed in this paper helped to understand research gaps that is original contribution to body of literature. To best of the authors' knowledge, in-depth study in the field of text classification and deep learning has not been performed in detail. The study provides high value to scholars and professionals by providing them opportunities of research in this area.

Details

Benchmarking: An International Journal, vol. 31 no. 8
Type: Research Article
ISSN: 1463-5771

Keywords

1 – 10 of 118