Overview of GUA-SPA at IberLEF 2023: Guarani-Spanish Code Switching Analysis

Document Type

Article

Publication Title

Procesamiento del Lenguaje Natural

Abstract

We present the first shared task for detecting and analyzing code-switching in Guarani and Spanish, GUA-SPA at IberLEF 2023. The challenge consisted of three tasks: identifying the language of a token, NER, and a novel task of classifying the way a Spanish span is used in the code-switched context. We annotated a corpus of 1500 texts extracted from news articles and tweets, around 25 thousand tokens, with the information for the tasks. Three teams took part in the evaluation phase, obtaining in general good results for Task 1, and more mixed results for Tasks 2 and 3.

First Page

321

Last Page

328

DOI

10.26342/2023-71-25

Publication Date

9-1-2023

Keywords

Code-switching, Guarani, NER, Spanish

Comments

IR conditions: non-described

Share

COinS