Apertus: Swiss teams release fully open multilingual large language model

EPFL, ETH Zurich and the Swiss National Supercomputing Centre have released Apertus, a fully open multilingual large language model with its architecture, weights and training recipes published. The model is intended to support research, commercial adoption and public oversight of Artificial Intelligence.

EPFL, ETH Zurich and the Swiss National Supercomputing Centre (CSCS) have released Apertus, a large-scale, open, multilingual large language model. The project name Apertus is Latin for open and the team says the entire development process is openly accessible. That includes model architecture, model weights, intermediate checkpoints, training data and training recipes, all accompanied by comprehensive documentation and source code.

The release is positioned as a tool for research, industry and public examination rather than a conventional technology transfer to a single product. Thomas Schulthess, director of CSCS and professor at ETH Zurich, frames the project as a driver of innovation and a way to strengthen expertise across research, society and industry. Imanol Schlag, technical lead of the LLM project and research scientist at ETH Zurich, says Apertus is built for the public good and designed around multilingualism, transparency and compliance.

Access will be provided through several routes. Swisscom will offer a dedicated interface for participants at upcoming Swiss Artificial Intelligence Weeks hackathons and make the model available to Swisscom business customers via its sovereign Swiss artificial intelligence platform. For users outside Switzerland, access will be offered through the Public Artificial Intelligence Inference Utility. The research team released the model under a permissive open-source license that also allows commercial use and says future work will expand the model family, improve efficiency, explore domain-specific adaptations for law, climate, health and education, and add capabilities while maintaining reproducibility and transparency.

75

Impact Score

NVIDIA and Doosan broaden physical Artificial Intelligence partnership

NVIDIA and Doosan Group are expanding work across robotics, autonomous equipment, power infrastructure and advanced materials. The partnership links NVIDIA accelerated computing platforms with Doosan businesses serving industrial automation, energy systems and data center hardware.

Chatbot liability suits test Artificial Intelligence safety law

A Florida lawsuit targeting ChatGPT’s maker signals a new product liability threat for Artificial Intelligence companies. The fight could turn on unsettled questions about platform immunity, speech protections, causation, and federal safety rules.

Canada pushes Artificial Intelligence sovereignty strategy

Canada has unveiled an Artificial Intelligence for All strategy focused on reducing reliance on foreign cloud and Artificial Intelligence providers. The plan mirrors the EU’s new sovereignty push and sets targets for adoption, infrastructure and jobs.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.