The LLMs4EU project, coordinated by the Alliance for Language Technologies (ALT-EDIC), aims to preserve European linguistic and cultural diversity in the digital age through cooperation between economic and academic actors. Indeed, some European languages are threatened to be left aside from generative AI development due to the lack of resources to train language models.
The project brings together Europe’s leading players in the field of generative AI to ensure that European companies and especially SMEs have access to the tools and resources to become competitive regarding language technologies and especially Large Language Models (LLMs).
The goal is to make LLMs and all the tools necessary for their exploitation in all EU languages available in open data by capitalizing on existing European programs and competencies. The tools that will be made accessible to European companies will cover all the steps from training LLMs to ensuring their conformity to European legislation (AI Act, GDPR, etc.).
The consortium created around ALT-EDIC includes organizations working in more than 20 countries, which ensures good geographical and linguistic coverage. The project will develop different relevant use cases to demonstrate the capacity of European actors to work together to create adapted tools for different economic sectors, and the coverage of all EU languages will be ensured through the creation and acquisition of the necessary datasets by the project.
XLAB contributes expertise and skills on the field of language data processing, transformation and standardization. It brings experience and technical know-how on LLMs, Data Spaces and Data Science.