Aitor Soroa (ORCID 0000-0001-8573-2654)
Universidad del País Vasco
H-index: 33
Europe-Spain
Top articles of Aitor Soroa (ORCID 0000-0001-8573-2654)
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset | arXiv preprint arXiv:2403.00587 | Ander Salaberria Gorka Azkune Oier Lopez de Lacalle Aitor Soroa Eneko Agirre | 2024/3/1 |
IKER-GAITU: research on language technology for Basque and other low-resource languages | Eneko Agirre Itziar Aldabe Xabier Arregi Mikel Artetxe Unai Atutxa | 2024 | |
XNLIeu: a dataset for cross-lingual NLI in Basque | arXiv preprint arXiv:2404.06996 | Maite Heredia Julen Etxaniz Muitze Zulaika Xabier Saralegi Jeremy Barnes | 2024/4/10 |
Latxa: An open language model and evaluation suite for Basque | arXiv preprint arXiv:2403.20266 | Julen Etxaniz Oscar Sainz Naiara Perez Itziar Aldabe German Rigau | 2024/3/29 |
Deep Dive Text Analytics and Natural Language Understanding | Jose Manuel Gómez-Pérez Andrés García-Silva Cristian Berrio German Rigau Aitor Soroa | 2023/6/7 | |
State-of-the-Art in Language Technology and Language-centric Artificial Intelligence | Rodrigo Agerri Eneko Agirre Itziar Aldabe Nora Aranberri Jose Maria Arriola | 2023/6/7 | |
Image captioning for effective use of language models in knowledge-based visual question answering | Expert Systems with Applications | Ander Salaberria Gorka Azkune Oier Lopez de Lacalle Aitor Soroa Eneko Agirre | 2023/2/1 |
Bloom: A 176b-parameter open-access multilingual language model | Teven Le Scao Angela Fan Christopher Akiki Ellie Pavlick Suzana Ilić | 2023/11/20 | |
Do Multilingual Language Models Think Better in English? | arXiv preprint arXiv:2308.01223 | Julen Etxaniz Gorka Azkune Aitor Soroa Oier Lopez de Lacalle Mikel Artetxe | 2023/8/2 |
Scaling Laws for BERT in Low-Resource Settings | Gorka Urbizu Iñaki San Vicente Xabier Saralegi Rodrigo Agerri Aitor Soroa | 2023/7 | |
Noisy Channel for Automatic Text Simplification | arXiv preprint arXiv:2211.03152 | Oscar M Cumbicus-Pineda Iker Gutiérrez-Fandiño Itziar Gonzalez-Dios Aitor Soroa | 2022/11/6 |
Project European Language Equality (ELE) Grant agreement no. LC-01641480–101018166 ELE Coordinator Prof. Dr. Andy Way (DCU) Co-coordinator Prof. Dr. Georg Rehm (DFKI) Start … | Kepa Sarasola Itziar Aldabe Arantza Diaz de Ilarraza Reviewers Annika Grützner-Zahn Maria Giagkou | 2022/2/28 | |
Documenting geographically and contextually diverse data sources: The bigscience catalogue of language data and resources | arXiv preprint arXiv:2201.10066 | Angelina McMillan-Major Zaid Alyafeai Stella Biderman Kimbo Chen Francesco De Toni | 2022/1/25 |
BasqueGLUE: A natural language understanding benchmark for Basque | Gorka Urbizu Iñaki San Vicente Xabier Saralegi Rodrigo Agerri Aitor Soroa | 2022/6 | |
KIDE4I: A generic semantics-based task-oriented dialogue system for human-machine interaction in industry 5.0 | Applied Sciences | Cristina Aceta Izaskun Fernández Aitor Soroa | 2022/1/24 |
PoeLM: A meter-and rhyme-controllable language model for unsupervised poetry generation | arXiv preprint arXiv:2205.12206 | Aitor Ormazabal Mikel Artetxe Manex Agirrezabal Aitor Soroa Eneko Agirre | 2022/5/24 |
KIDE4Assistant: an Ontology-Driven Dialogue System Adaptation for Assistance in Maintenance Procedures. | Cristina Aceta Patricia Casla Izaskun Fernandez Aitor Soroa | 2022 | |
The bigscience roots corpus: A 1.6 tb composite multilingual dataset | Advances in Neural Information Processing Systems | Hugo Laurençon Lucile Saulnier Thomas Wang Christopher Akiki Albert Villanova del Moral | 2022/12/6 |
Principled paraphrase generation with parallel corpora | arXiv preprint arXiv:2205.12213 | Aitor Ormazabal Mikel Artetxe Aitor Soroa Gorka Labaka Eneko Agirre | 2022/5/24 |
IrekiaLFes: a new open benchmark and baseline systems for Spanish automatic text simplification | Itziar Gonzalez-Dios Iker Gutiérrez-Fandiño Oscar M Cumbicus-Pineda Aitor Soroa | 2022/12 |