ITM Web Conf.
Volume 33, 2020International Conference on ICT enhanced Social Sciences and Humanities (ICTeSSH 2020)
|Number of page(s)||12|
|Section||Digital Tools and Infrastructures|
|Published online||14 August 2020|
Creating a learner corpus infrastructure: Experiences from making learner corpora available
1 Eurac Research, Institute for Applied Linguistics 39100 Bolzano Italy
2 CLARIN ERIC 3512 BS Utrecht The Netherlands
3 University of Ljubljana, Department of Translation 1000 Ljubljana Slovenia
4 Department of Knowledge Technologies, Jozef Stefan Institute 1000 Ljubljana Slovenia
* Corresponding author: firstname.lastname@example.org
With language resources being collected in many - also small - projects in learner corpus research with considerate amounts of time and ef- fort spent in this activity, making these types of data available in a FAIR way, with standardized and reasoned methods, would contribute substan- tially to the advancement of the field. Additionally, it would answer current demands in transparency, replicability and reusability. In this article, we dis- cuss some of the challenges when making learner corpora FAIR and report from experiences in fulfilling this aim while creating a learner corpus infra- structure at a research institution hosting five different learner corpora.
© The Authors, published by EDP Sciences, 2020
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.