Google and DeepMind unveil VaultGemma, largest open-source Artificial Intelligence LLM

Google and DeepMind released VaultGemma, a one-billion-parameter Artificial Intelligence language model built from scratch with differential privacy and available on Hugging Face and Kaggle. It is aimed at research and real-world use in sensitive sectors such as healthcare and finance.

Google and DeepMind have unveiled VaultGemma, a one-billion-parameter Artificial Intelligence language model built from scratch with differential privacy. The teams describe VaultGemma as the largest open-source model of its kind and say it is available for free on Hugging Face and Kaggle. The model is positioned for research and practical deployments in sensitive areas such as healthcare and finance, where keeping training and user data private is a primary concern.

VaultGemma runs on Googleu2019s Gemma architecture and uses Multi-Query Attention with a 1,024-token context window, a configuration intended to balance inference speed with privacy protections. Training was conducted across 2,048 TPUv6e chips and followed new privacy scaling rules. According to the teams, the training process matched predicted accuracy targets while preserving stronger privacy guarantees than models that apply privacy measures only after pre-training.

On standard benchmarks such as ARC-C and TriviaQA, VaultGemma scores roughly on par with non-private models from five years ago, a performance level described as solid though not state of the art. The crucial distinction is that VaultGemma bakes differential privacy into pre-training rather than applying it only at the end, reducing the risk of memorizing or leaking training data. Google and DeepMind frame the release as a new standard for privacy-first Artificial Intelligence models that could be especially valuable for developers and organizations that must meet strict data protection requirements.

70

Impact Score

Saudi Artificial Intelligence startup launches Arabic LLM

Misraj Artificial Intelligence unveiled Kawn, an Arabic large language model, at AWS re:Invent and launched Workforces, a platform for creating and managing Artificial Intelligence agents for enterprises and public institutions.

Introducing Mistral 3: open artificial intelligence models

Mistral 3 is a family of open, multimodal and multilingual Artificial Intelligence models that includes three Ministral edge models and a sparse Mistral Large 3 trained with 41B active and 675B total parameters, released under the Apache 2.0 license.

NVIDIA and Mistral Artificial Intelligence partner to accelerate new family of open models

NVIDIA and Mistral Artificial Intelligence announced a partnership to optimize the Mistral 3 family of open-source multilingual, multimodal models across NVIDIA supercomputing and edge platforms. The collaboration highlights Mistral Large 3, a mixture-of-experts model designed to improve efficiency and accuracy for enterprise artificial intelligence deployments starting Tuesday, Dec. 2.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.