Sarvam-M: Indian Startup Unveils Powerful Homegrown Language Model

Indian firm Sarvam has launched Sarvam-M, a large language model built for Indian languages and education, sparking attention and debate in the Artificial Intelligence space.

Indian tech startup Sarvam has launched Sarvam-M, its flagship large language model, aiming to fill a unique niche in the global Artificial Intelligence landscape by focusing on Indian languages, mathematics, and programming tasks. The model, which is based on the Mistral Small architecture and scaled up to 24 billion parameters, is designed to handle complex queries and deliver conversational, multilingual support tailored for Indian users. Sarvam-M supports ten Indian languages—including Hindi, Bengali, and Gujarati—and excels in tasks like math problem-solving, programming, and machine translation.

The development of Sarvam-M followed a rigorous, multi-stage training process. Initial supervised fine-tuning provided high-quality, culturally sensitive data to ensure the model’s relevance for daily conversation and advanced reasoning tasks. This was complemented by reinforcement learning with verifiable rewards, further enhancing instruction-following, logical thinking, and programming skills. The final phase, inference optimisation, focused on increasing the model’s performance and speed through techniques like FP8 quantisation, albeit with ongoing challenges in handling high-traffic deployments. Sarvam-M is positioned for use in conversational Artificial Intelligence tools, educational platforms, virtual assistants, and machine translation services.

Benchmark tests revealed Sarvam-M’s strong performance in Indian languages and reasoning: it outperformed Meta’s Llama-4 Scout and matched larger models such as Llama 3.3 70B and Google’s Gemma 3 27B. Particularly, it demonstrated major improvements—over 86 percent—in handling hybrid math and romanised Indian language queries. However, its English knowledge scores were slightly lower than some counterparts. Despite technical achievements, the model faced muted reception upon release, with just 334 downloads on Hugging Face in the first two days, prompting criticism from parts of the developer and investor community. Sarvam AI’s founders and supporters defended the model’s benchmarks and training methodology, emphasizing its developmental significance for India’s sovereign Artificial Intelligence ambitions. Industry voices, including Zoho’s Sridhar Vembu, highlighted the need for patience and ongoing innovation, framing Sarvam-M as both a milestone and a foundation for further advancements in locally relevant Artificial Intelligence technology.

67

Impact Score

House panel advances export controls after China report

The House Foreign Affairs Committee moved export control legislation after a House Select Committee report detailed China’s use of illegal means to build its Artificial Intelligence and semiconductor sectors. The measure is aimed at chip smuggling and Artificial Intelligence model theft.

Intel repurposes scrap dies to expand CPU supply

Intel is repurposing wafer-edge and lower-yield silicon that would normally be discarded into sellable CPUs as industry demand outpaces supply. The strategy reflects a market where customers are willing to buy lower-tier parts to secure any available capacity.

The missing step between Artificial Intelligence hype and profit

Artificial Intelligence companies have built powerful systems and promised sweeping change, but the path from technical progress to real business value remains unclear. Conflicting studies, weak workplace performance, and poor transparency are leaving a critical gap between hype and evidence.

Samsung workers leaked secrets into ChatGPT

Samsung employees reportedly exposed confidential company information while using ChatGPT for coding help and meeting note generation. The incidents highlight the risk of feeding sensitive data into public Artificial Intelligence tools that retain user inputs.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.