LLMs4EU

Large Language Models for the European Union


March 2025 - February 2028


Summary

Funding

"Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or the European Commission. Neither the European Union nor the granting authority can be held responsible for them."

Funded by the European Union

Call

DIGITAL-2024-AI-B-06

Topic

DIGITAL-2024-AI-B-06-LANGUAGE

Type of action

DIGITAL Simple Grants

Granting authority

European Commission-EU

Project number

101198470

Consortium

  • Coordinator: Alliance for Language Technologies (ALT-EDIC)
  • 66 member institutions from more then 20 countries
  • Romania is represented in the consortium by ICIA as a research institution and CertSign SA as an industry partner

Partners and Data Providers

Project summary

The LLMs4EU project, coordinated by the Alliance for Language Technologies (ALT-EDIC), aims to preserve European linguistic and cultural diversity in the digital age through cooperation between economic and academic actors. Indeed, some European languages are threatened to be left aside from generative AI development due to the lack of resources to train language models. The project brings together Europe’s leading players in the field of generative AI to ensure that European companies and especially SMEs have access to the tools and resources to become competitive regarding language technologies and especially Large Language Models (LLMs). The goal is to make LLMs and all the tools necessary for their exploitation in all EU languages available in open data by capitalizing on existing European programs and competencies. The tools that will be made accessible to European companies will cover all the steps from training LLMs to ensuring their conformity to European legislation (AI Act, GDPR, etc.). The consortium created around ALT-EDIC includes organizations working in more than 20 countries, which ensures good geographical and linguistic coverage. The project will develop different relevant use cases to demonstrate the capacity of European actors to work together to create adapted tools for different economic sectors, and the coverage of all EU languages will be ensured through the creation and acquisition of the necessary datasets by the project.

Publications

Multimodal Romanian language resources and tools: challenges and perspectives

Mitrofan, M., Irimia, E. & Păiş, V.

2025 Discover Data 3, 26, Springer Nature
link

RADAR: Raman Spectral Analysis Using Deep Learning for Artifact Removal

J. Sjöberg, N. Siminea, A. Păun, A. Lita, M. Larion, I. Petre

Adv. Optical Mater. 2025, 2500736
link

Team

I.C.I.A. Team Members

Dr. Vasile Florian PĂIȘ

Principal Investigator

Scientific Researcher II

Acad. Dan Ioan TUFIȘ


Scientific Researcher I

Dr. Paul-Andrei PĂUN


Scientific Researcher I

Dr. Verginica BARBU MITITELU


Scientific Researcher II

Dr. Radu ION


Scientific Researcher II

Dr. Elena IRIMIA


Scientific Researcher III

Dr. Maria CARP


Scientific Researcher III

Eric CUREA


Scientific Researcher III