Sarvam-M: Indian Startup Unveils Powerful Homegrown Language Model

Indian firm Sarvam has launched Sarvam-M, a large language model built for Indian languages and education, sparking attention and debate in the Artificial Intelligence space.

Indian tech startup Sarvam has launched Sarvam-M, its flagship large language model, aiming to fill a unique niche in the global Artificial Intelligence landscape by focusing on Indian languages, mathematics, and programming tasks. The model, which is based on the Mistral Small architecture and scaled up to 24 billion parameters, is designed to handle complex queries and deliver conversational, multilingual support tailored for Indian users. Sarvam-M supports ten Indian languages—including Hindi, Bengali, and Gujarati—and excels in tasks like math problem-solving, programming, and machine translation.

The development of Sarvam-M followed a rigorous, multi-stage training process. Initial supervised fine-tuning provided high-quality, culturally sensitive data to ensure the model’s relevance for daily conversation and advanced reasoning tasks. This was complemented by reinforcement learning with verifiable rewards, further enhancing instruction-following, logical thinking, and programming skills. The final phase, inference optimisation, focused on increasing the model’s performance and speed through techniques like FP8 quantisation, albeit with ongoing challenges in handling high-traffic deployments. Sarvam-M is positioned for use in conversational Artificial Intelligence tools, educational platforms, virtual assistants, and machine translation services.

Benchmark tests revealed Sarvam-M’s strong performance in Indian languages and reasoning: it outperformed Meta’s Llama-4 Scout and matched larger models such as Llama 3.3 70B and Google’s Gemma 3 27B. Particularly, it demonstrated major improvements—over 86 percent—in handling hybrid math and romanised Indian language queries. However, its English knowledge scores were slightly lower than some counterparts. Despite technical achievements, the model faced muted reception upon release, with just 334 downloads on Hugging Face in the first two days, prompting criticism from parts of the developer and investor community. Sarvam AI’s founders and supporters defended the model’s benchmarks and training methodology, emphasizing its developmental significance for India’s sovereign Artificial Intelligence ambitions. Industry voices, including Zoho’s Sridhar Vembu, highlighted the need for patience and ongoing innovation, framing Sarvam-M as both a milestone and a foundation for further advancements in locally relevant Artificial Intelligence technology.

67

Impact Score

Firefox 148 adds artificial intelligence killswitch after user backlash

Mozilla is adding a persistent artificial intelligence killswitch to Firefox 148 after strong community backlash against plans for an artificial intelligence first browser experience. Users will be able to disable individual artificial intelligence features or shut them all off with a single control.

Western Digital unveils high bandwidth hard drives with 4x I/O performance

Western Digital is introducing new high bandwidth hard drives that combine multi-head read and write techniques with a dual actuator design to significantly boost I/O performance while preserving capacity. The roadmap targets up to 100 TB HDDs with throughput that aims to rival traditional QLC SSDs on price and density.

Nvidia and Dassault deepen partnership to build industrial virtual twins

Nvidia and Dassault Systèmes are expanding their long-running partnership to build shared industrial Artificial Intelligence world models that merge physics-based virtual twins with accelerated computing. The companies aim to shift engineering, manufacturing and scientific work into real-time, simulation-driven workflows powered by Artificial Intelligence companions.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.