Sociolinguistic repositories as asset: challenges and difficulties in Brazil
Article publication date: 4 July 2022
Issue publication date: 29 November 2022
This paper aims to provide a context for Brazilian Portuguese language documentation and its data collection to establish linguistic repositories from a sociolinguistic overview.
The main sociolinguistic projects that have generated collections of Brazilian Portuguese language data are presented.
The comparison with another situation of repositories (seed vaults) and with the accounting concept of assets is evocated to map the challenges to be overcome in proposing a standardized and professional language repository to host the collections of linguistic data arising from the reported projects and others, in the accordance with the principles of the open science movement.
Thinking about the sustainability of projects to build linguistic documentation repositories, partnerships with the information technology area, or even with private companies, could minimize problems of obsolescence and safeguarding of data, by promoting the circulation and automation of analysis through natural language processing algorithms. These planning actions may help to promote the longevity of the linguistic documentation repositories of Brazilian sociolinguistic research.
The author would like to thank Coordination of Superior Level Staff Improvement (CAPES) and Foundation for Research and Technological Innovation of the State of Sergipe (FAPITEC) for supporting “Falares Sergipanos virtual: variedade, diversidade, contato e os direitos linguísticos” project (CAPES/FAPITEC 10/2016 PROMOB).
Meister Ko. Freitag, R. (2022), "Sociolinguistic repositories as asset: challenges and difficulties in Brazil", The Electronic Library, Vol. 40 No. 5, pp. 607-622. https://doi.org/10.1108/EL-02-2022-0025
Emerald Publishing Limited
Copyright © 2022, Emerald Publishing Limited