How are Indian firms training LLMs?
How Are Indian Firms Training LLMs?
UPSC Prelims + Mains Study Note | GS-III: Science & Technology / Economy
1. At a Glance
- Large Language Models (LLMs) are neural networks trained on vast text corpora to power generative AI services; training requires massive GPU clusters, electricity, and curated data — all scarce in India relative to the US/China. [S1]
- The IndiaAI Mission (approved March 2024, ₹10,372 crore) is the primary state instrument enabling Indian firms to train LLMs domestically by subsidising compute access. [S2]
- Sarvam AI (Bengaluru) is the flagship case study — it won a Government of India tender and released India's first sovereign foundational LLMs trained entirely on IndiaAI Mission compute in February 2026. [S3]
- UPSC relevance: cuts across GS-III (Science & Tech, Economy), GS-II (Government Policy), and India's digital sovereignty / strategic autonomy discourse.
2. Why in the News
- AI Impact Summit, New Delhi (19 February 2026): Prime Minister Modi met Sarvam AI co-founder Pratyush Kumar at Bharat Mandapam. Sarvam released two open-sourced foundation LLMs — Sarvam 30B and Sarvam 105B — both trained exclusively on IndiaAI Mission infrastructure. [S4]
- IndiaAI Mission's common compute capacity crossed 34,000 GPUs (later expanding to 38,000+), a milestone underscoring India's domestic AI infrastructure build-out. [S1]
- April 2025: Government formally selected Sarvam AI to build India's sovereign LLM — capable of reasoning, designed for voice, fluent in Indian languages, intended for population-scale secure deployment. [S3]
3. Background & Evolution
| Year | Milestone |
|---|---|
| 2023 | Global LLM race accelerates post-ChatGPT; India lacks domestic GPU compute and curated Indian-language data |
| Mar 2024 | Union Cabinet approves IndiaAI Mission — ₹10,372 crore outlay over multiple years under MeitY [S2] |
| Mid-2024 | IndiaAI Compute Portal goes live; empanels AI service providers to offer shared GPU access at subsidised rates [S1] |
| Late 2024 | Common compute capacity reaches 18,693 GPUs; 14 AI service providers empanelled; data centres in Mumbai, Navi Mumbai, Hyderabad, Bengaluru, Noida, Jamnagar [S1] |
| Apr 2025 | Sarvam AI selected via competitive tender under IndiaAI Mission for sovereign LLM mandate [S3] |
| Feb 2026 | Sarvam 30B and 105B released at AI Impact Summit; trained on IndiaAI Mission compute; use Mixture of Experts (MoE) architecture [S4] |
| 2026 | Common compute capacity crosses 38,000 GPUs + 1,050 TPUs; 20,000 additional GPUs under procurement [S1] |
4. Core Static Facts
Implementing Body - Ministry of Electronics and Information Technology (MeitY) — nodal ministry for IndiaAI Mission [S2] - IndiaAI (dedicated implementation entity under MeitY) manages compute portal, model development calls, and startup support [S2]
Budget - ₹10,372 crore (≈ $1.1 billion) approved by Union Cabinet, March 2024 [S2]
Compute Infrastructure - 38,000+ GPUs empanelled across 14 AI service providers [S1] - 1,050 TPUs also available [S1] - Subsidised rate: ₹65–₹100/hour vs global market rate >₹200/hour [S1] - Data centre locations: Mumbai, Navi Mumbai, Hyderabad, Bengaluru, Noida, Jamnagar [S1] - Eligible users: startups, researchers, academic institutions, government organisations [S1]
Sarvam AI Models (Feb 2026) - Sarvam 30B — 32-billion-parameter Mixture of Experts (MoE), 65K context window; speed-optimised (comparable to Gemini Flash-Lite / GPT-5 mini tier) [S3][S4] - Sarvam 105B — 106-billion-parameter MoE, 128K context window; complex reasoning tasks (comparable to Gemini Pro / GPT-5 tier); wins ~90% of Indian-language benchmark comparisons vs GPT-4 [S3][S4] - Both models: open-sourced, trained 100% on IndiaAI Mission infrastructure [S3]
Key Terminology - LLM: Large Language Model — neural network trained on billions of text tokens, foundation for generative AI - Parameter: numerical weight in a neural network; more parameters ≈ greater model capacity - Token: smallest unit of text processed by an LLM; Indian-language text requires more tokens than equivalent English, raising inference cost [S4] - MoE (Mixture of Experts): architecture where only a subset of model parameters ("experts") are activated per inference pass — lowers compute cost without proportional loss in capability [S4] - GPU (Graphics Processing Unit): primary hardware for LLM training/inference - TPU (Tensor Processing Unit): Google-designed chip for ML workloads; available on IndiaAI portal
5. Multi-Dimensional Analysis
Economic
- India's domestic LLM training market was entirely dependent on foreign cloud providers (AWS, Google, Azure) pre-IndiaAI Mission; subsidised compute reduces this dependency. [S1]
- ₹10,372 crore public investment aimed at catalysing private AI ecosystem; expected to reduce per-GPU-hour costs by 50%+ for Indian startups. [S1][S2]
- MoE architecture reduces training and inference costs substantially — enabling Indian firms with smaller capital bases to compete with global labs. [S4]
Scientific / Technological
- Dual bottleneck for Indian LLMs: (a) scarcity of high-quality Indian-language training data (English/East Asian languages dominate internet corpora); (b) capital-intensive GPU clusters costing millions of dollars per training run. [S4]
- Indian-language tokens are structurally less efficient: sentences must often be translated to English for inference, consuming extra tokens and raising costs. [S4]
- MoE innovation addresses compute bottleneck; curated Indian-language datasets (being built under IndiaAI's Data Management Unit) address the data bottleneck. [S2]
- Sarvam 105B achieving ~90% win-rate over GPT-4 on Indian-language benchmarks signals narrowing of the capability gap with frontier global models. [S3]
Geopolitical / Strategic
- Digital sovereignty: training LLMs on Indian soil with Indian data reduces dependence on US/China AI stacks for critical government applications. [S3]
- Sovereign LLM mandate explicitly targets "secure population-scale deployment" — implying use for government services (Aadhaar, health, welfare) without data leaving India. [S3]
- Aligns with India's G20 AI principles (2023) and NITI Aayog's AI strategy emphasising "AI for all." [S2]
Administrative / Governance
- IndiaAI Mission uses a competitive tender model — transparent procurement for sovereign LLM rather than directed allocation; Sarvam selected after open call. [S3]
- 14 empanelled private AI service providers supply GPU capacity — public-private partnership model for infrastructure without full state ownership. [S1]
- Eligible institutions include academic researchers and government bodies, not just startups — broad access design. [S1]
Ethical / Governance
- Open-sourcing Sarvam models (30B, 105B) enables public scrutiny, reduces proprietary lock-in, and allows downstream adaptation by Indian researchers. [S3]
- Risk: concentrated state selection of a single "sovereign LLM" vendor could create governance/monopoly concerns if procurement transparency lapses.
Social
- Indian-language capability of LLMs has direct equity implications: 90%+ of India's population is not English-dominant; English-centric LLMs exclude them from AI benefits. [S4]
- Voice-first design of Sarvam's sovereign mandate targets low-literacy users — aligns with Digital India's last-mile inclusion goals. [S3]
6. Recent Developments (Last 12–18 Months)
- April 2025: Sarvam AI formally selected by Government of India under IndiaAI Mission to build India's sovereign LLM. [S3]
- September 2025: Government announced 500 AI Data Labs with ₹988 crore investment to boost AI capabilities. [S5]
- February 19, 2026: PM Modi meets Sarvam co-founder Pratyush Kumar at AI Impact Summit, Bharat Mandapam, New Delhi. [S4]
- February 26, 2026: Sarvam AI officially releases Sarvam 30B and Sarvam 105B — both open-sourced, trained on IndiaAI Mission compute. [S4]
- 2026: IndiaAI common compute capacity crosses 38,000 GPUs and 1,050 TPUs; 20,000 additional GPUs under active procurement. [S1]
- IndiaAI Mission expanded with additional startup support mechanisms and affordable compute extensions beyond initial mandate. [S1]
7. Prelims Hooks
- The IndiaAI Mission was approved by Union Cabinet in March 2024 with an outlay of ₹10,372 crore. [S2]
- Nodal ministry for IndiaAI Mission: Ministry of Electronics and Information Technology (MeitY). [S2]
- IndiaAI common compute portal provides GPUs at ₹65–₹100/hour vs global rates exceeding ₹200/hour. [S1]
- As of 2026, IndiaAI has empanelled 14 AI service providers offering 38,000+ GPUs and 1,050 TPUs. [S1]
- Sarvam AI is the Bengaluru-based startup selected by Government of India as the sovereign LLM developer under IndiaAI Mission. [S3]
- Sarvam's Sarvam 30B has 32 billion parameters with a 65K context window; Sarvam 105B has 106 billion parameters with a 128K context window. [S3]
- Both Sarvam models use Mixture of Experts (MoE) architecture — key reason they are less compute-intensive than comparable dense models. [S4]
- In MoE architecture, only a subset of parameters (experts) is activated per inference, reducing computational cost. [S4]
- Sarvam 105B wins approximately 90% of comparisons against GPT-4 on Indian-language benchmarks. [S3]
- Both Sarvam LLMs were trained entirely on IndiaAI Mission infrastructure and are open-sourced. [S3]
- IndiaAI data centres are located in: Mumbai, Navi Mumbai, Hyderabad, Bengaluru, Noida, and Jamnagar. [S1]
- Indian-language text costs more in LLM inference because it requires more tokens — including translation overhead to English. [S4]
- 500 AI Data Labs announced in September 2025 with ₹988 crore investment to expand India's AI data infrastructure. [S5]
- The AI Impact Summit where Sarvam models were unveiled was held at Bharat Mandapam, New Delhi, on 19 February 2026. [S4]
8. Mains Relevance
GS Papers: - GS-III: Science & Technology (AI/ML); Indian Economy (Start-up ecosystem, Digital Infrastructure) - GS-II: Government Policies and Interventions; e-Governance
Syllabus Headings: - GS-III: "Awareness in IT, Space, Computers, Robotics, Nano-technology, Bio-technology and issues relating to Intellectual Property Rights" - GS-III: "Indian Economy and issues relating to growth, development and employment" (Digital Economy) - GS-II: "Government policies and interventions for development in various sectors and issues arising out of their design and implementation"
Plausible Mains Questions: 1. "Critically examine the challenges faced by Indian firms in training Large Language Models domestically, and evaluate the role of the IndiaAI Mission in addressing them." (GS-III, 15 marks) 2. "The development of a sovereign LLM is not merely a technological achievement but a matter of strategic autonomy. Discuss in the context of India's IndiaAI Mission." (GS-III/GS-II, 15 marks) 3. "What is the Mixture of Experts (MoE) architecture, and why is it particularly significant for AI development in resource-constrained economies like India?" (GS-III, 10 marks)
9. Related Topics to Study Next
| Topic | Connection |
|---|---|
| IndiaAI Mission (full scope) | Parent policy — covers Data Management, Application Development, FutureSkills, not just compute |
| National Data Governance Framework | Governs the Indian-language data that feeds LLM training |
| Digital India Programme | Foundational infrastructure (broadband, DigiLocker, UPI) on which AI deployment scales |
| Semiconductor Mission (India) | Long-term solution to GPU import dependency; chip fabrication is the upstream bottleneck |
| Global AI Governance (G20, UN AI Panel) | India's position on AI regulation, safety standards, and sovereign AI norms |
| Open-Source vs Proprietary AI | Policy debate on whether government-funded models should be open-sourced — directly relevant to Sarvam case |
| National Language Technology Mission | Earlier predecessor initiative for Indian-language NLP; provides historical context |
| Start-up India & Deep-Tech Policy | Sarvam AI's growth is embedded in this broader start-up ecosystem support framework |
10. Common Errors / Trap Areas
- Wrong ministry: IndiaAI Mission is under MeitY, NOT NITI Aayog (NITI Aayog authored earlier AI strategy papers but does not implement the Mission).
- Parameter confusion: Sarvam 30B is described as "35 billion parameters" in some reports and "32 billion" in others — the MoE model has 32B active parameters but 35B total; exam questions will likely use the model name (30B/105B) rather than exact count.
- MoE ≠ smaller model: Sarvam 105B has 106B total parameters — it is NOT a small model; MoE makes it cheaper to run because only a fraction of parameters activate per token, not because it has fewer parameters overall.
- Confusing Sarvam AI with other Indian AI initiatives: Krutrim (Ola), Hanooman (SML India), BharatGPT are separate LLM projects — Sarvam is the one with the government sovereign LLM mandate.
- Year of IndiaAI Mission approval: March 2024, not 2023 (the National AI Strategy/NITI Aayog paper was 2018; these are different documents separated by 6 years).
11. Sources
- [S1] India's Common Compute Capacity Crosses 34,000 GPUs — https://www.pib.gov.in/PressReleasePage.aspx?PRID=2132817 — (Tier 1)
- [S2] Cabinet Approves Over ₹10,300 Crore for IndiaAI Mission — https://www.pib.gov.in/PressReleasePage.aspx?PRID=2012375 — (Tier 1)
- [S3] Sarvam AI — Wikipedia / IndiaAI Mission sovereign LLM selection — https://en.wikipedia.org/wiki/Sarvam_AI — (Tier 3/reference)
- [S4] "How are Indian firms training LLMs?" — The Hindu / Hindu BusinessLine, 26 February 2026 (article excerpt supplied as primary source) — https://www.thehindu.com/todays-paper/2026-02-26/th_international/articleGGCFL07NJ-13661862.ece — (Tier 4)
- [S5] India to set up 500 Data Labs, boost AI capabilities with ₹988 crore investment — https://www.newsonair.gov.in/india-to-set-up-500-data-labs-boost-ai-capabilities-with-988-crore-investment — (Government of India / AIR, Tier 1-adjacent)
- [S6] IndiaAI Mission Expands AI Ecosystem with Affordable Compute and Startup Support — https://www.pib.gov.in/PressReleasePage.aspx?PRID=2245069 — (Tier 1)