Argitalpenak

2025

BasqBBQ: A QA Benchmark for Assessing Social Biases in LLMs for Basque, a Low-Resource Language. Saralegi, X., & Zulaika, M. In Proceedings of the 31st International Conference on Computational Linguistics (pp. 4753-4767). 2025, urtarrila.

Generating Multiple-Choice Questions in Spanish and Basque using LLMs: A Comparative Manual Evaluation. López de Lacalle, M., Saralegi, X., & Saizar, A. Procesamiento del Lenguaje Natural, 74, 179-190. 2025.

Interfaces conversacionales, tecnolenguajes y tecnodesigualdades. RECERCA. Tabarés Gutiérrez, R. Revista De Pensament I Anàlisi, 30(1). 2025.

2024

XNLIeu: a dataset for cross-lingual NLI in Basque. Heredia, M., Etxaniz, J., Zulaika, M., Saralegi, X., Barnes, J., & Soroa, A. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) (pp. 4177–4188). NAACL 2024.

How Well Can BERT Learn the Grammar of an Agglutinative and Flexible-Order Language? The Case of Basque. Urbizu, G., Zulaika, M., Saralegi, X., Corral A. In proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). Torino, Italy. 2024.

GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction. Sainz, O., García-Ferrero, I., Agerri, R., de Lacalle, O. L., Rigau, G., & Agirre, E. (2024). ICLR 2024.

Plataformización, automatización y aceleración en los medios sociales. Tabarés Gutiérrez, R. Daimon Revista Internacional de Filosofia, (93), 137–152. 2024.

Mitigating Toxicity in Dialogue Agents through Adversarial Reinforcement Learning. Villate-Castillo, G., Sanz, B., & Del Ser, J. AEQUITAS@ECAI 2024.

2023

Strategies for bilingual intent classification for small datasets scenarios. López de Lacalle, M., Saralegi, X., Saizar, A., Urbizu, G. and Corral, A. Procesamiento del Lenguaje Natural, Revista nº 71, septiembre de 2023, pp. 137-147.

Scaling Laws for BERT in Low-Resource Settings. Urbizu, G., San Vicente, I., Saralegi, X., Agerri, R. and Soroa, A. In Findings of the Association for Computational Linguistics: ACL 2023, pages 7771–7789 July 9-14, 2023

Not Enough Data to Pre-train Your Language Model? MT to the Rescue! Urbizu, G., San Vicente, I., Saralegi,X., and Corral, A. In Findings of the Association for Computational Linguistics: ACL2023, pages 3826–3836 July 9-14, 2023