Sarvam-M LLM now open for download on Hugging Face, boosting Indian AI innovation

Sources:

Indian AI startup Sarvam has launched its flagship Large Language Model (LLM), Sarvam-M, now available for download on Hugging Face and accessible via Sarvam's API.

The model is a 24-billion-parameter open-weights hybrid language model built on top of Mistral Small, designed to advance Indian AI innovation.

Sarvam-M has set new benchmarks in mathematics, programming tasks, and Indian language understanding. It underwent a rigorous three-step enhancement process involving Supervised Fine-Tuning (SFT), Reinforcement Learning with Verifiable Rewards (RLVR), and Inference Optimisations.

In particular, the model demonstrated an 86% improvement on combined tasks involving Indian languages and math, such as the romanised Indian language GSM-8K benchmark.

This release marks a significant step forward in AI tailored for Indian languages and complex problem-solving, providing developers and researchers with a powerful tool for experimentation and integration.

"The Sarvam-M model is currently accessible via Sarvam’s API and can be downloaded from Hugging Face," enabling broader access to cutting-edge AI technology.

The open availability of Sarvam-M is expected to accelerate innovation in the Indian AI ecosystem, fostering advancements in language processing and computational tasks specific to the region.

Sources:

Indian AI startup Sarvam has released its flagship 24-billion-parameter Large Language Model, Sarvam-M, on Hugging Face. The hybrid model excels in mathematics, programming, and Indian language tasks, showing an 86% improvement on benchmarks, and is available for download and API access to boost Indian AI innovation.

Sarvam-M LLM now open for download on Hugging Face, boosting Indian AI innovation

Source Citations