Search results

1 – 10 of 25
Article
Publication date: 3 November 2020

Jagroop Kaur and Jaswinder Singh

Normalization is an important step in all the natural language processing applications that are handling social media text. The text from social media poses a different kind of…

Abstract

Purpose

Normalization is an important step in all the natural language processing applications that are handling social media text. The text from social media poses a different kind of problems that are not present in regular text. Recently, a considerable amount of work has been done in this direction, but mostly in the English language. People who do not speak English code mixed the text with their native language and posted text on social media using the Roman script. This kind of text further aggravates the problem of normalizing. This paper aims to discuss the concept of normalization with respect to code-mixed social media text, and a model has been proposed to normalize such text.

Design/methodology/approach

The system is divided into two phases – candidate generation and most probable sentence selection. Candidate generation task is treated as machine translation task where the Roman text is treated as source language and Gurmukhi text is treated as the target language. Character-based translation system has been proposed to generate candidate tokens. Once candidates are generated, the second phase uses the beam search method for selecting the most probable sentence based on hidden Markov model.

Findings

Character error rate (CER) and bilingual evaluation understudy (BLEU) score are reported. The proposed system has been compared with Akhar software and RB\_R2G system, which are also capable of transliterating Roman text to Gurmukhi. The performance of the system outperforms Akhar software. The CER and BLEU scores are 0.268121 and 0.6807939, respectively, for ill-formed text.

Research limitations/implications

It was observed that the system produces dialectical variations of a word or the word with minor errors like diacritic missing. Spell checker can improve the output of the system by correcting these minor errors. Extensive experimentation is needed for optimizing language identifier, which will further help in improving the output. The language model also seeks further exploration. Inclusion of wider context, particularly from social media text, is an important area that deserves further investigation.

Practical implications

The practical implications of this study are: (1) development of parallel dataset containing Roman and Gurmukhi text; (2) development of dataset annotated with language tag; (3) development of the normalizing system, which is first of its kind and proposes translation based solution for normalizing noisy social media text from Roman to Gurmukhi. It can be extended for any pair of scripts. (4) The proposed system can be used for better analysis of social media text. Theoretically, our study helps in better understanding of text normalization in social media context and opens the doors for further research in multilingual social media text normalization.

Originality/value

Existing research work focus on normalizing monolingual text. This study contributes towards the development of a normalization system for multilingual text.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 13 no. 4
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 17 May 2021

Maya Deori, Vinit Kumar and Manoj Kumar Verma

The purpose of the study is to evaluate certain characteristics of the videos of the software Koha and DSpace posted on YouTube. Since YouTube has the potential to provide the…

Abstract

Purpose

The purpose of the study is to evaluate certain characteristics of the videos of the software Koha and DSpace posted on YouTube. Since YouTube has the potential to provide the content creator to share their knowledge and experience through their content which has become much more beneficial to the information seeker. Nowadays, people search for queries or tutorial videos on YouTube very often to earn a better understanding of the term. Sentiment analysis of the viewers' opinion of the videos is another purpose of this study.

Design/methodology/approach

Dataset for evaluating the characteristic of the videos of Koha and DSpace was extracted by using Webometric Analyst by creating YouTube API. Once retrieval of data was completed, a manual verification was enhanced to filter out spam videos unrelated to the scope. After the confirmation of authentic relatable videos, seeking the video's id as query, the comments per video were extracted using Webometric Analyst. For opinion mining, the Parallel Dots API web service was used in Google Sheets as an addon function. The sentiment, multilingual sentiment, emotion, intention and word frequency of the viewers' opinion was examined with the help of certain default functionalities.

Findings

Webometric Analyst extracted a total of 461 and 397 videos of Koha and DSpace, respectively, uploaded on the YouTube platform. The findings of the study indicate that the growth rate of videos on Koha is decreasing, while the number of videos uploaded on DSpace is gradually increased in the last 10 years. The highest number of videos posted in 1–20 min duration category with mostly high definition (HD) with standard YouTube license and prominently in the English language. The sentiment analysis of the total extracted comments on Koha and DSpace videos found to be 2043 and 862 comments, respectively, among whom “Positive” comments are mostly found and with “Happy” emotion can be highly detected with most supportive “Feedback” intention on both Koha and DSpace videos. The top word frequency signifies that the users of both the software are using the comments section of the videos on YouTube to ask and provide troubleshooting help to each other.

Research limitations/implications

The present study has some limitations too; the dataset for the study includes only those videos whose title, description or keywords sections had the query terms “Koha” or “DSpace” there are chances that some videos would have been left out from the dataset related to these software.

Originality/value

This is the first paper to evaluate the characteristics and sentiment of both the videos Koha and DSpace. Through this, the popularity, likeness and dislike and the impact of the contents of the videos uploaded will be disclosed, and creators can make an improvement by referring this, and the seekers will adapt to the use of correct and authentic information.

Details

Library Hi Tech, vol. 41 no. 3
Type: Research Article
ISSN: 0737-8831

Keywords

Book part
Publication date: 17 September 2021

Alice Lam

The experience of “misfit” between individuals’ professional identities and their work roles or work contexts is common in career transitions. In contrast to extant literature…

Abstract

The experience of “misfit” between individuals’ professional identities and their work roles or work contexts is common in career transitions. In contrast to extant literature that focuses on the identity struggle of these people, this study examines how problematic identity dynamics associated with misfit motivate the shift toward the development of positive identities and induce creativity in meaning-making and change-oriented actions. It builds on the insights of Mead (1934) and Joas (1996) who view creativity as the most significant aspect of human agency, and the identity work literature that highlights the agentic process in identity construction. The study looks at a group of “pracademics” whose career trajectories deviate from the prototypical patterns in academia. It examines the identity work strategies that these people undertake to overcome misfit and shows how identity work liberates them from the limits of a particular identity, and facilitates new activities that alter aspects of their work contexts. The study advances our understanding of identity work as a creative human endeavor and sheds new light on the change-oriented agency of misfits.

Details

Organizing Creativity in the Innovation Journey
Type: Book
ISBN: 978-1-83982-874-4

Keywords

Book part
Publication date: 15 November 2016

Kathleen M. Alley and Barbara J. Peterson

To review and synthesize findings from peer-reviewed research related to students’ sources of ideas for writing, and instructional dimensions that affect students’ development of…

Abstract

Purpose

To review and synthesize findings from peer-reviewed research related to students’ sources of ideas for writing, and instructional dimensions that affect students’ development of ideas for composition in grades K-8.

Design/methodology/approach

The ideas or content expressed in written composition are considered critical to ratings of writing quality. We utilized a Systematic Mixed Studies Review (SMSR) methodological framework (Heyvaert, Maes, & Onghena, 2011) to explore K-8 students’ ideas and writing from a range of theoretical and methodological perspectives.

Findings

Students’ ideas for writing originate from a range of sources, including teachers, peers, literature, content area curriculum, autobiographical/life experiences, popular culture/media, drawing, and play. Intertextuality, copying, social dialogue, and playful peer interactions are productive strategies K-8 writers use to generate ideas for composing, in addition to strategies introduced through planned instruction. Relevant dimensions of instruction include motivation to write, idea planning and organization, as well as specific instructional strategies, techniques, and tools to facilitate idea generation and selection within the composition process.

Practical implications

A permeable curriculum and effective instructional practices are crucial to support students’ access to a full range of ideas and knowledge-based resources, and help them translate these into written composition. Instructional practices for idea development and writing: (a) connect reading and writing for authentic purposes; (b) include explicit modeling of strategies for planning and “online” generation of ideas throughout the writing process across genre; (c) align instructional focus across reading, writing, and other curricular activities; (d) allow for extended time to write; and (e) incorporate varied, flexible participation structures through which students can share ideas and receive teacher/peer feedback on writing.

Details

Writing Instruction to Support Literacy Success
Type: Book
ISBN: 978-1-78635-525-6

Keywords

Article
Publication date: 3 August 2009

Sumathi Renganathan

In this paper I explore the notion of reflexivity in two main domains. In the first, I explore my struggles as a trained objective, positivist researcher trying to embrace and…

1150

Abstract

In this paper I explore the notion of reflexivity in two main domains. In the first, I explore my struggles as a trained objective, positivist researcher trying to embrace and appreciate subjective qualitative research practices. In the second section, I explore the dynamic relationship between myself, the researcher and my participants, focusing on issues related to ethnicity and power. Generally, research that explores ethnicity and power relationships commonly depicts the researcher as the privileged self compared with the participants as the marginalised other. However, in this paper I illustrate how this relationship in a multiethnic, multicultural and multilingual research context is much more complex and multifaceted than usually acknowledged. Moreover, this was further complicated by the researcher’s own experience in relation to the issue under investigation, which was different from that of the participants.

Details

Qualitative Research Journal, vol. 9 no. 2
Type: Research Article
ISSN: 1443-9883

Keywords

Article
Publication date: 15 November 2018

Meg Gebhard and Holly Graham

This paper aims to analyze how middle schoolers developed a critical awareness of language while participating in a curricular unit informed by systemic functional linguistics…

Abstract

Purpose

This paper aims to analyze how middle schoolers developed a critical awareness of language while participating in a curricular unit informed by systemic functional linguistics (SFL). This unit was developed to understanding and taking action to protect a local bat population in the context of school reforms shaping teaching and learning in the USA. It was designed to support a heterogeneous class of seventh graders in learning to read scientific explanations, write letters to government officials and develop a functional metalanguage to support them in analyzing how language simultaneously constructs ideas, enacts power dynamics and manages the flow of information in disciplinary texts. The questions guiding this study are: How do students use SFL metalanguage in text production and interpretation practices? Do their uses of SFL metalanguage support critical language awareness and reflection? And, if so, in what ways?

Design/methodology/approach

This study uses ethnographic methods to conduct teacher action research. Data include classroom transcripts, student writing samples and interviews.

Findings

The findings illustrate how students engaged with SFL, often playfully, to create their own student-generated functional metalanguage in highly productive ways.

Research limitations/implications

This study contributes to a growing body of scholarship that suggests SFL metalanguage can provide teachers and students with a powerful semiotic toolkit that enables them to navigate the demands of teaching and learning in the context of the Standardization and Accountability movement.

Practical implications

This study has implications teachers’ professional development and students’ disciplinary literacy development in the context of school reform.

Originality/value

To date, few studies have explored how students take up and transform SFL metalanguage into a tool for critical reflection, especially adolescents.

Details

English Teaching: Practice & Critique, vol. 17 no. 4
Type: Research Article
ISSN: 1175-8708

Keywords

Article
Publication date: 19 October 2018

Hengyi Fu

With the increasing number of online multilingual resources, cross-language information retrieval (CLIR) has drawn much attention from the information retrieval (IR) research…

2932

Abstract

Purpose

With the increasing number of online multilingual resources, cross-language information retrieval (CLIR) has drawn much attention from the information retrieval (IR) research community. However, few studies have examined how and why multilingual searchers seek information in two or more languages, specifically how they switch and mix language in queries to get satisfying results. The purpose of this paper is to focus on Chinese–English bilinguals’ intra-sentential code-switching behaviors in online searches. The scenarios and reasons of code-switching, factors that may affect code-switching, the patterns of mixed language query formulation and reformulation and how current IR systems and other search tools can facilitate such information needs were examined.

Design/methodology/approach

In-depth semi-structured interviews were used as the research method. In total, 30 participants were recruited based on their English proficiency, location and profession, using a purposive sampling method.

Findings

Four scenarios and four reasons for using Chinese–English mixed language queries to cover information needs were identified, and results suggest that linguistic and cultural/social factors are of equivalent importance in code-switching behaviors. English terms and Chinese terms in queries play different roles in searches, and mixed language queries are irreplaceable by either single language queries or other search facilitating features. Findings also suggest current search engines and tools need greater emphasis in the user interface and more user education is required.

Originality/value

This study presents a qualitative analysis of bilinguals’ code-switching behaviors in online searches. Findings are expected to advance the theoretical understanding of bilingual users’ search strategies and interactions with IR systems, and provide insights for designing more effective IR systems and tools to discover multilingual online resources, including cross-language controlled vocabularies, personalized CLIR tools and mixed language query assistants.

Details

Aslib Journal of Information Management, vol. 71 no. 1
Type: Research Article
ISSN: 2050-3806

Keywords

Open Access
Article
Publication date: 25 May 2023

Hafizah Hamdan

This paper aims to investigate how Bruneian secondary school students employ code-switching in peer interactions. The functions of students' code-switching were analysed using…

1127

Abstract

Purpose

This paper aims to investigate how Bruneian secondary school students employ code-switching in peer interactions. The functions of students' code-switching were analysed using Reyes' (2004) and Appel and Muysken's (2005) typologies.

Design/methodology/approach

The data collected are based on audio-recorded group discussions designed to elicit students’ code-switched utterances.

Findings

The results indicate that the students used 11 functions of code-switching: referential, discourse marker, clarification, expressive, quotation imitation, turn accommodation, insistence, emphasis, question shift, situation shift and poetic.

Research limitations/implications

As the study only focusses on a specific secondary school, results from this school will not represent secondary school students in Brunei.

Originality/value

This paper hopes to provide insight into how students' code-switching can be seen in a positive light. Moreover, understanding how students use code-switching in the classroom is essential for successful knowledge transfer and for cultivating competent bilinguals, which is what the country's education system aims for.

Details

Southeast Asia: A Multidisciplinary Journal, vol. 23 no. 1
Type: Research Article
ISSN: 1819-5091

Keywords

Book part
Publication date: 30 November 2017

Wenyao (Will) Zhao

This research explores two interconnected questions: (1) How do we approach stylistic features of multimodal rhetorical artifacts such as protest posters? (2) Do said artifacts…

Abstract

This research explores two interconnected questions: (1) How do we approach stylistic features of multimodal rhetorical artifacts such as protest posters? (2) Do said artifacts designed for different purposes exhibit systematic stylistic differences? Drawing on Charles Sanders Peirce’s semiotic categorization, this study develops a framework for examining concision, one of the primary stylistic considerations for multimodal rhetorical artifacts such as protest posters. This paper illustrates the use of this framework by exploring the correlation between rhetorical purpose and concision in posters created and disseminated before and during the 2011–2012 Québécois student movement. This study fine-tunes our existing knowledge on multimodality with style sensitivity, and demonstrates how an economy-of-sign based semiotic approach could enrich the empirical examination of multimodal rhetorical artifacts by generating more controlled interpretations.

Details

Multimodality, Meaning, and Institutions
Type: Book
ISBN: 978-1-78743-330-4

Keywords

Article
Publication date: 4 August 2020

Amr Kotb, Hany Elbardan and Hussein Halabi

This paper reviews the field of internal auditing (IA) post-Enron to develop insights into how IA research has developed, offer a critique of the research to date and identify…

3745

Abstract

Purpose

This paper reviews the field of internal auditing (IA) post-Enron to develop insights into how IA research has developed, offer a critique of the research to date and identify ways that future research can help to advance IA.

Design/methodology/approach

A structured literature review (SLR) was used to analyse 471 papers from 64 journals published between 2005 and 2018 based on a number of criteria, namely author, journal type, journal location, year, theme, theory, nature of research, research setting, regional focus, method and citations.

Findings

The IA literature has not significantly contributed to knowledge of the internal audit function (IAF), and one still knows relatively little about the factors that contribute to making the impact of IA practice effective and measurable. The IA literature is US-dominated (authors and journals), focussed on the American context (publicly listed companies), reliant on positivist analyses and largely makes no explicit reference to theory. Central regions (emerging economies) and key organisational settings (private SMEs and not-for-profit organisations) are largely absent in prior IA research. This paper evaluates and identifies avenues through which future research can help to advance IA in order to address emerging challenges in the field.

Originality/value

This is the first comprehensive review to analyse IA research in the post-Enron period (2005–2018). The findings are relevant to researchers who are looking for appropriate research outlets and emerging scholars who wish to identify their own research directions.

Details

Accounting, Auditing & Accountability Journal, vol. 33 no. 8
Type: Research Article
ISSN: 0951-3574

Keywords

1 – 10 of 25