Indian tech startup Sarvam has launched Sarvam-M, its flagship large language model, aiming to fill a unique niche in the global Artificial Intelligence landscape by focusing on Indian languages, mathematics, and programming tasks. The model, which is based on the Mistral Small architecture and scaled up to 24 billion parameters, is designed to handle complex queries and deliver conversational, multilingual support tailored for Indian users. Sarvam-M supports ten Indian languages—including Hindi, Bengali, and Gujarati—and excels in tasks like math problem-solving, programming, and machine translation.
The development of Sarvam-M followed a rigorous, multi-stage training process. Initial supervised fine-tuning provided high-quality, culturally sensitive data to ensure the model’s relevance for daily conversation and advanced reasoning tasks. This was complemented by reinforcement learning with verifiable rewards, further enhancing instruction-following, logical thinking, and programming skills. The final phase, inference optimisation, focused on increasing the model’s performance and speed through techniques like FP8 quantisation, albeit with ongoing challenges in handling high-traffic deployments. Sarvam-M is positioned for use in conversational Artificial Intelligence tools, educational platforms, virtual assistants, and machine translation services.
Benchmark tests revealed Sarvam-M’s strong performance in Indian languages and reasoning: it outperformed Meta’s Llama-4 Scout and matched larger models such as Llama 3.3 70B and Google’s Gemma 3 27B. Particularly, it demonstrated major improvements—over 86 percent—in handling hybrid math and romanised Indian language queries. However, its English knowledge scores were slightly lower than some counterparts. Despite technical achievements, the model faced muted reception upon release, with just 334 downloads on Hugging Face in the first two days, prompting criticism from parts of the developer and investor community. Sarvam AI’s founders and supporters defended the model’s benchmarks and training methodology, emphasizing its developmental significance for India’s sovereign Artificial Intelligence ambitions. Industry voices, including Zoho’s Sridhar Vembu, highlighted the need for patience and ongoing innovation, framing Sarvam-M as both a milestone and a foundation for further advancements in locally relevant Artificial Intelligence technology.