How are Indian firms training LLMs?


How Are Indian Firms Training LLMs?

UPSC Prelims + Mains Study Note | GS-III: Science & Technology / Economy


1. At a Glance


2. Why in the News


3. Background & Evolution

Year Milestone
2023 Global LLM race accelerates post-ChatGPT; India lacks domestic GPU compute and curated Indian-language data
Mar 2024 Union Cabinet approves IndiaAI Mission — ₹10,372 crore outlay over multiple years under MeitY [S2]
Mid-2024 IndiaAI Compute Portal goes live; empanels AI service providers to offer shared GPU access at subsidised rates [S1]
Late 2024 Common compute capacity reaches 18,693 GPUs; 14 AI service providers empanelled; data centres in Mumbai, Navi Mumbai, Hyderabad, Bengaluru, Noida, Jamnagar [S1]
Apr 2025 Sarvam AI selected via competitive tender under IndiaAI Mission for sovereign LLM mandate [S3]
Feb 2026 Sarvam 30B and 105B released at AI Impact Summit; trained on IndiaAI Mission compute; use Mixture of Experts (MoE) architecture [S4]
2026 Common compute capacity crosses 38,000 GPUs + 1,050 TPUs; 20,000 additional GPUs under procurement [S1]

4. Core Static Facts

Implementing Body - Ministry of Electronics and Information Technology (MeitY) — nodal ministry for IndiaAI Mission [S2] - IndiaAI (dedicated implementation entity under MeitY) manages compute portal, model development calls, and startup support [S2]

Budget - ₹10,372 crore (≈ $1.1 billion) approved by Union Cabinet, March 2024 [S2]

Compute Infrastructure - 38,000+ GPUs empanelled across 14 AI service providers [S1] - 1,050 TPUs also available [S1] - Subsidised rate: ₹65–₹100/hour vs global market rate >₹200/hour [S1] - Data centre locations: Mumbai, Navi Mumbai, Hyderabad, Bengaluru, Noida, Jamnagar [S1] - Eligible users: startups, researchers, academic institutions, government organisations [S1]

Sarvam AI Models (Feb 2026) - Sarvam 30B — 32-billion-parameter Mixture of Experts (MoE), 65K context window; speed-optimised (comparable to Gemini Flash-Lite / GPT-5 mini tier) [S3][S4] - Sarvam 105B — 106-billion-parameter MoE, 128K context window; complex reasoning tasks (comparable to Gemini Pro / GPT-5 tier); wins ~90% of Indian-language benchmark comparisons vs GPT-4 [S3][S4] - Both models: open-sourced, trained 100% on IndiaAI Mission infrastructure [S3]

Key Terminology - LLM: Large Language Model — neural network trained on billions of text tokens, foundation for generative AI - Parameter: numerical weight in a neural network; more parameters ≈ greater model capacity - Token: smallest unit of text processed by an LLM; Indian-language text requires more tokens than equivalent English, raising inference cost [S4] - MoE (Mixture of Experts): architecture where only a subset of model parameters ("experts") are activated per inference pass — lowers compute cost without proportional loss in capability [S4] - GPU (Graphics Processing Unit): primary hardware for LLM training/inference - TPU (Tensor Processing Unit): Google-designed chip for ML workloads; available on IndiaAI portal


5. Multi-Dimensional Analysis

Economic

Scientific / Technological

Geopolitical / Strategic

Administrative / Governance

Ethical / Governance

Social


6. Recent Developments (Last 12–18 Months)


7. Prelims Hooks

  1. The IndiaAI Mission was approved by Union Cabinet in March 2024 with an outlay of ₹10,372 crore. [S2]
  2. Nodal ministry for IndiaAI Mission: Ministry of Electronics and Information Technology (MeitY). [S2]
  3. IndiaAI common compute portal provides GPUs at ₹65–₹100/hour vs global rates exceeding ₹200/hour. [S1]
  4. As of 2026, IndiaAI has empanelled 14 AI service providers offering 38,000+ GPUs and 1,050 TPUs. [S1]
  5. Sarvam AI is the Bengaluru-based startup selected by Government of India as the sovereign LLM developer under IndiaAI Mission. [S3]
  6. Sarvam's Sarvam 30B has 32 billion parameters with a 65K context window; Sarvam 105B has 106 billion parameters with a 128K context window. [S3]
  7. Both Sarvam models use Mixture of Experts (MoE) architecture — key reason they are less compute-intensive than comparable dense models. [S4]
  8. In MoE architecture, only a subset of parameters (experts) is activated per inference, reducing computational cost. [S4]
  9. Sarvam 105B wins approximately 90% of comparisons against GPT-4 on Indian-language benchmarks. [S3]
  10. Both Sarvam LLMs were trained entirely on IndiaAI Mission infrastructure and are open-sourced. [S3]
  11. IndiaAI data centres are located in: Mumbai, Navi Mumbai, Hyderabad, Bengaluru, Noida, and Jamnagar. [S1]
  12. Indian-language text costs more in LLM inference because it requires more tokens — including translation overhead to English. [S4]
  13. 500 AI Data Labs announced in September 2025 with ₹988 crore investment to expand India's AI data infrastructure. [S5]
  14. The AI Impact Summit where Sarvam models were unveiled was held at Bharat Mandapam, New Delhi, on 19 February 2026. [S4]

8. Mains Relevance

GS Papers: - GS-III: Science & Technology (AI/ML); Indian Economy (Start-up ecosystem, Digital Infrastructure) - GS-II: Government Policies and Interventions; e-Governance

Syllabus Headings: - GS-III: "Awareness in IT, Space, Computers, Robotics, Nano-technology, Bio-technology and issues relating to Intellectual Property Rights" - GS-III: "Indian Economy and issues relating to growth, development and employment" (Digital Economy) - GS-II: "Government policies and interventions for development in various sectors and issues arising out of their design and implementation"

Plausible Mains Questions: 1. "Critically examine the challenges faced by Indian firms in training Large Language Models domestically, and evaluate the role of the IndiaAI Mission in addressing them." (GS-III, 15 marks) 2. "The development of a sovereign LLM is not merely a technological achievement but a matter of strategic autonomy. Discuss in the context of India's IndiaAI Mission." (GS-III/GS-II, 15 marks) 3. "What is the Mixture of Experts (MoE) architecture, and why is it particularly significant for AI development in resource-constrained economies like India?" (GS-III, 10 marks)


9. Related Topics to Study Next

Topic Connection
IndiaAI Mission (full scope) Parent policy — covers Data Management, Application Development, FutureSkills, not just compute
National Data Governance Framework Governs the Indian-language data that feeds LLM training
Digital India Programme Foundational infrastructure (broadband, DigiLocker, UPI) on which AI deployment scales
Semiconductor Mission (India) Long-term solution to GPU import dependency; chip fabrication is the upstream bottleneck
Global AI Governance (G20, UN AI Panel) India's position on AI regulation, safety standards, and sovereign AI norms
Open-Source vs Proprietary AI Policy debate on whether government-funded models should be open-sourced — directly relevant to Sarvam case
National Language Technology Mission Earlier predecessor initiative for Indian-language NLP; provides historical context
Start-up India & Deep-Tech Policy Sarvam AI's growth is embedded in this broader start-up ecosystem support framework

10. Common Errors / Trap Areas

  1. Wrong ministry: IndiaAI Mission is under MeitY, NOT NITI Aayog (NITI Aayog authored earlier AI strategy papers but does not implement the Mission).
  2. Parameter confusion: Sarvam 30B is described as "35 billion parameters" in some reports and "32 billion" in others — the MoE model has 32B active parameters but 35B total; exam questions will likely use the model name (30B/105B) rather than exact count.
  3. MoE ≠ smaller model: Sarvam 105B has 106B total parameters — it is NOT a small model; MoE makes it cheaper to run because only a fraction of parameters activate per token, not because it has fewer parameters overall.
  4. Confusing Sarvam AI with other Indian AI initiatives: Krutrim (Ola), Hanooman (SML India), BharatGPT are separate LLM projects — Sarvam is the one with the government sovereign LLM mandate.
  5. Year of IndiaAI Mission approval: March 2024, not 2023 (the National AI Strategy/NITI Aayog paper was 2018; these are different documents separated by 6 years).

11. Sources