Aitor Soroa (ORCID 0000-0001-8573-2654)

Aitor Soroa (ORCID 0000-0001-8573-2654)

Universidad del País Vasco

H-index: 33

Europe-Spain

About Aitor Soroa (ORCID 0000-0001-8573-2654)

Aitor Soroa (ORCID 0000-0001-8573-2654), With an exceptional h-index of 33 and a recent h-index of 20 (since 2020), a distinguished researcher at Universidad del País Vasco, specializes in the field of Natural Language Processing, Computational Linguistics, Artificial Intelligence, Computer Science.

His recent articles reflect a diverse array of research interests and contributions to the field:

Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset

IKER-GAITU: research on language technology for Basque and other low-resource languages

XNLIeu: a dataset for cross-lingual NLI in Basque

Latxa: An open language model and evaluation suite for Basque

Deep Dive Text Analytics and Natural Language Understanding

State-of-the-Art in Language Technology and Language-centric Artificial Intelligence

Image captioning for effective use of language models in knowledge-based visual question answering

Bloom: A 176b-parameter open-access multilingual language model

Aitor Soroa (ORCID 0000-0001-8573-2654) Information

University

Position

Associate professor

Citations(all)

6246

Citations(since 2020)

2900

Cited By

4264

hIndex(all)

33

hIndex(since 2020)

20

i10Index(all)

64

i10Index(since 2020)

41

Email

University Profile Page

Universidad del País Vasco

Google Scholar

View Google Scholar Profile

Aitor Soroa (ORCID 0000-0001-8573-2654) Skills & Research Interests

Natural Language Processing

Computational Linguistics

Artificial Intelligence

Computer Science

Top articles of Aitor Soroa (ORCID 0000-0001-8573-2654)

Title

Journal

Author(s)

Publication Date

Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset

arXiv preprint arXiv:2403.00587

Ander Salaberria

Gorka Azkune

Oier Lopez de Lacalle

Aitor Soroa

Eneko Agirre

...

2024/3/1

IKER-GAITU: research on language technology for Basque and other low-resource languages

Eneko Agirre

Itziar Aldabe

Xabier Arregi

Mikel Artetxe

Unai Atutxa

...

2024

XNLIeu: a dataset for cross-lingual NLI in Basque

arXiv preprint arXiv:2404.06996

Maite Heredia

Julen Etxaniz

Muitze Zulaika

Xabier Saralegi

Jeremy Barnes

...

2024/4/10

Latxa: An open language model and evaluation suite for Basque

arXiv preprint arXiv:2403.20266

Julen Etxaniz

Oscar Sainz

Naiara Perez

Itziar Aldabe

German Rigau

...

2024/3/29

Deep Dive Text Analytics and Natural Language Understanding

Jose Manuel Gómez-Pérez

Andrés García-Silva

Cristian Berrio

German Rigau

Aitor Soroa

...

2023/6/7

State-of-the-Art in Language Technology and Language-centric Artificial Intelligence

Rodrigo Agerri

Eneko Agirre

Itziar Aldabe

Nora Aranberri

Jose Maria Arriola

...

2023/6/7

Image captioning for effective use of language models in knowledge-based visual question answering

Expert Systems with Applications

Ander Salaberria

Gorka Azkune

Oier Lopez de Lacalle

Aitor Soroa

Eneko Agirre

2023/2/1

Bloom: A 176b-parameter open-access multilingual language model

Teven Le Scao

Angela Fan

Christopher Akiki

Ellie Pavlick

Suzana Ilić

...

2023/11/20

Do Multilingual Language Models Think Better in English?

arXiv preprint arXiv:2308.01223

Julen Etxaniz

Gorka Azkune

Aitor Soroa

Oier Lopez de Lacalle

Mikel Artetxe

2023/8/2

Scaling Laws for BERT in Low-Resource Settings

Gorka Urbizu

Iñaki San Vicente

Xabier Saralegi

Rodrigo Agerri

Aitor Soroa

2023/7

Noisy Channel for Automatic Text Simplification

arXiv preprint arXiv:2211.03152

Oscar M Cumbicus-Pineda

Iker Gutiérrez-Fandiño

Itziar Gonzalez-Dios

Aitor Soroa

2022/11/6

Project European Language Equality (ELE) Grant agreement no. LC-01641480–101018166 ELE Coordinator Prof. Dr. Andy Way (DCU) Co-coordinator Prof. Dr. Georg Rehm (DFKI) Start …

Kepa Sarasola

Itziar Aldabe

Arantza Diaz de Ilarraza

Reviewers Annika Grützner-Zahn

Maria Giagkou

2022/2/28

Documenting geographically and contextually diverse data sources: The bigscience catalogue of language data and resources

arXiv preprint arXiv:2201.10066

Angelina McMillan-Major

Zaid Alyafeai

Stella Biderman

Kimbo Chen

Francesco De Toni

...

2022/1/25

BasqueGLUE: A natural language understanding benchmark for Basque

Gorka Urbizu

Iñaki San Vicente

Xabier Saralegi

Rodrigo Agerri

Aitor Soroa

2022/6

KIDE4I: A generic semantics-based task-oriented dialogue system for human-machine interaction in industry 5.0

Applied Sciences

Cristina Aceta

Izaskun Fernández

Aitor Soroa

2022/1/24

PoeLM: A meter-and rhyme-controllable language model for unsupervised poetry generation

arXiv preprint arXiv:2205.12206

Aitor Ormazabal

Mikel Artetxe

Manex Agirrezabal

Aitor Soroa

Eneko Agirre

2022/5/24

KIDE4Assistant: an Ontology-Driven Dialogue System Adaptation for Assistance in Maintenance Procedures.

Cristina Aceta

Patricia Casla

Izaskun Fernandez

Aitor Soroa

2022

The bigscience roots corpus: A 1.6 tb composite multilingual dataset

Advances in Neural Information Processing Systems

Hugo Laurençon

Lucile Saulnier

Thomas Wang

Christopher Akiki

Albert Villanova del Moral

...

2022/12/6

Principled paraphrase generation with parallel corpora

arXiv preprint arXiv:2205.12213

Aitor Ormazabal

Mikel Artetxe

Aitor Soroa

Gorka Labaka

Eneko Agirre

2022/5/24

IrekiaLFes: a new open benchmark and baseline systems for Spanish automatic text simplification

Itziar Gonzalez-Dios

Iker Gutiérrez-Fandiño

Oscar M Cumbicus-Pineda

Aitor Soroa

2022/12

See List of Professors in Aitor Soroa (ORCID 0000-0001-8573-2654) University(Universidad del País Vasco)