The race to make AI as multilingual as Europe

Europe´s pursuit of truly multilingual Artificial Intelligence is heating up, as researchers and startups tackle language bias in leading language models.

Europe stands out as one of the most linguistically diverse continents, with the European Union alone recognizing 24 official languages, countless unofficial ones, and hundreds more when including dialects and languages brought by migrants. Despite this diversity, English overwhelmingly dominates the digital landscape, the result of early technological leadership and cultural export from the United States. This is reflected in the startling fact that half of all web content is in English, a language natively spoken by just 6% of the global population, creating a cycle in which language models trained on internet data inherit this English-centric bias.

As Artificial Intelligence increasingly powers communications and productivity tools, the language imbalance is being baked into large language models (LLMs), further marginalizing low-resource European languages. European startups and research consortia are working to counter this. Hugging Face, with French roots, has played a pivotal role through its open model hosting, the BLOOM project, and partnerships with Meta to advance translation technologies. The French company Mistral has evolved its models to explicitly support multiple European languages, notably addressing issues after their tools failed to respond in non-English languages. EuroLLM, a project led by Portuguese-based Unbabel, aims to support every official EU language as well as widely spoken migrant and trade languages, leveraging data sources like Europarl to improve coverage for lesser-used tongues.

Several other initiatives amplify Europe´s push for linguistic inclusiveness in Artificial Intelligence. OpenLLM Europe fosters a developer community for low- and medium-resource language models, while OpenEuroLLM and the Lumi and Silo projects focus on scalable, open source support—especially for Nordic languages. Training data scarcity remains a chief obstacle for these efforts, as does ensuring that cross-lingual techniques don´t result in stilted or non-native outputs. There are also ongoing debates over the true openness of models labeled as such. Research and anecdotal evidence suggest that while many Europeans still default to English for work-related tasks, the adoption of multilingual Artificial Intelligence tools is on the rise, with platforms like Hugging Face seeing growing activity in non-English models. The ultimate vision is a new digital era where no language—or its speakers—is left behind, making Artificial Intelligence accessible and effective for all corners of Europe and beyond.

73

Impact Score

UK and EU Artificial Intelligence regulatory outlook for May 2026

The UK is moving ahead with targeted Artificial Intelligence measures in policing, online safety, cyber security and copyright policy, while the EU is refining how the EU Artificial Intelligence Act will apply in practice. Consultations, new offences and implementation deadlines are shaping the next phase of compliance on both sides.

Germany sets out national implementation of the Artificial Intelligence Act

Germany has published a draft law to implement the European Artificial Intelligence Act through new supervisory structures, clearer institutional responsibilities, and measures designed to support innovation. The proposal puts the Federal Network Agency at the center of enforcement while preserving sector-specific oversight in sensitive fields.

ECB warns banks about new Artificial Intelligence security risks

The European Central Bank has called major banks to an emergency meeting over cybersecurity risks tied to advanced Artificial Intelligence models. Regulators want banks to speed up security updates as newer tools make it easier to find and exploit vulnerabilities.

Anthropic keeps Mythos restricted after vulnerability findings

Anthropic says its cybersecurity model Mythos is powerful at uncovering software flaws but remains too risky for broad release. Early testing found large numbers of vulnerabilities across major software and open source projects, while fixes have lagged far behind discoveries.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.