Creating a learner corpus infrastructure: Experiences from making learner corpora available

Jennifer-Carmen Frey; Alexander König; Darja Fišer

doi:10.1051/itmconf/20203303006

Open Access

Issue		ITM Web Conf. Volume 33, 2020 International Conference on ICT enhanced Social Sciences and Humanities (ICTeSSH 2020)


Article Number		03006
Number of page(s)		12
Section		Digital Tools and Infrastructures
DOI		https://doi.org/10.1051/itmconf/20203303006
Published online		14 August 2020

ITM Web of Conferences 33, 03006 (2020)

Creating a learner corpus infrastructure: Experiences from making learner corpora available

Jennifer-Carmen Frey¹^*, Alexander König² and Darja Fišer³^,4

¹ Eurac Research, Institute for Applied Linguistics 39100 Bolzano Italy
² CLARIN ERIC 3512 BS Utrecht The Netherlands
³ University of Ljubljana, Department of Translation 1000 Ljubljana Slovenia
⁴ Department of Knowledge Technologies, Jozef Stefan Institute 1000 Ljubljana Slovenia

^* Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.

Abstract

With language resources being collected in many - also small - projects in learner corpus research with considerate amounts of time and ef- fort spent in this activity, making these types of data available in a FAIR way, with standardized and reasoned methods, would contribute substan- tially to the advancement of the field. Additionally, it would answer current demands in transparency, replicability and reusability. In this article, we dis- cuss some of the challenges when making learner corpora FAIR and report from experiences in fulfilling this aim while creating a learner corpus infra- structure at a research institution hosting five different learner corpora.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.