How are Indian firms training LLMs?

How Are Indian Firms Training LLMs?

UPSC Prelims + Mains Study Note | GS-III: Science & Technology / Economy

1. At a Glance

Large Language Models (LLMs) are neural networks trained on vast text corpora to power generative AI services; training requires massive GPU clusters, electricity, and curated data — all scarce in India relative to the US/China. [S1]
The IndiaAI Mission (approved March 2024, ₹10,372 crore) is the primary state instrument enabling Indian firms to train LLMs domestically by subsidising compute access. [S2]
Sarvam AI (Bengaluru) is the flagship case study — it won a Government of India tender and released India's first sovereign foundational LLMs trained entirely on IndiaAI Mission compute in February 2026. [S3]
UPSC relevance: cuts across GS-III (Science & Tech, Economy), GS-II (Government Policy), and India's digital sovereignty / strategic autonomy discourse.

2. Why in the News

AI Impact Summit, New Delhi (19 February 2026): Prime Minister Modi met Sarvam AI co-founder Pratyush Kumar at Bharat Mandapam. Sarvam released two open-sourced foundation LLMs — Sarvam 30B and Sarvam 105B — both trained exclusively on IndiaAI Mission infrastructure. [S4]
IndiaAI Mission's common compute capacity crossed 34,000 GPUs (later expanding to 38,000+), a milestone underscoring India's domestic AI infrastructure build-out. [S1]
April 2025: Government formally selected Sarvam AI to build India's sovereign LLM — capable of reasoning, designed for voice, fluent in Indian languages, intended for population-scale secure deployment. [S3]

3. Background & Evolution

Year	Milestone
2023	Global LLM race accelerates post-ChatGPT; India lacks domestic GPU compute and curated Indian-language data
Mar 2024	Union Cabinet approves IndiaAI Mission — ₹10,372 crore outlay over multiple years under MeitY [S2]
Mid-2024	IndiaAI Compute Portal goes live; empanels AI service providers to offer shared GPU access at subsidised rates [S1]
Late 2024	Common compute capacity reaches 18,693 GPUs; 14 AI service providers empanelled; data centres in Mumbai, Navi Mumbai, Hyderabad, Bengaluru, Noida, Jamnagar [S1]
Apr 2025	Sarvam AI selected via competitive tender under IndiaAI Mission for sovereign LLM mandate [S3]
Feb 2026	Sarvam 30B and 105B released at AI Impact Summit; trained on IndiaAI Mission compute; use Mixture of Experts (MoE) architecture [S4]
2026	Common compute capacity crosses 38,000 GPUs + 1,050 TPUs; 20,000 additional GPUs under procurement [S1]

4. Core Static Facts

Implementing Body - Ministry of Electronics and Information Technology (MeitY) — nodal ministry for IndiaAI Mission [S2] - IndiaAI (dedicated implementation entity under MeitY) manages compute portal, model development calls, and startup support [S2]

Budget - ₹10,372 crore (≈ $1.1 billion) approved by Union Cabinet, March 2024 [S2]

Compute Infrastructure - 38,000+ GPUs empanelled across 14 AI service providers [S1] - 1,050 TPUs also available [S1] - Subsidised rate: ₹65–₹100/hour vs global market rate >₹200/hour [S1] - Data centre locations: Mumbai, Navi Mumbai, Hyderabad, Bengaluru, Noida, Jamnagar [S1] - Eligible users: startups, researchers, academic institutions, government organisations [S1]

Sarvam AI Models (Feb 2026) - Sarvam 30B — 32-billion-parameter Mixture of Experts (MoE), 65K context window; speed-optimised (comparable to Gemini Flash-Lite / GPT-5 mini tier) [S3][S4] - Sarvam 105B — 106-billion-parameter MoE, 128K context window; complex reasoning tasks (comparable to Gemini Pro / GPT-5 tier); wins ~90% of Indian-language benchmark comparisons vs GPT-4 [S3][S4] - Both models: open-sourced, trained 100% on IndiaAI Mission infrastructure [S3]

Key Terminology - LLM: Large Language Model — neural network trained on billions of text tokens, foundation for generative AI - Parameter: numerical weight in a neural network; more parameters ≈ greater model capacity - Token: smallest unit of text processed by an LLM; Indian-language text requires more tokens than equivalent English, raising inference cost [S4] - MoE (Mixture of Experts): architecture where only a subset of model parameters ("experts") are activated per inference pass — lowers compute cost without proportional loss in capability [S4] - GPU (Graphics Processing Unit): primary hardware for LLM training/inference - TPU (Tensor Processing Unit): Google-designed chip for ML workloads; available on IndiaAI portal

5. Multi-Dimensional Analysis

Economic

India's domestic LLM training market was entirely dependent on foreign cloud providers (AWS, Google, Azure) pre-IndiaAI Mission; subsidised compute reduces this dependency. [S1]
₹10,372 crore public investment aimed at catalysing private AI ecosystem; expected to reduce per-GPU-hour costs by 50%+ for Indian startups. [S1][S2]
MoE architecture reduces training and inference costs substantially — enabling Indian firms with smaller capital bases to compete with global labs. [S4]

Scientific / Technological

Dual bottleneck for Indian LLMs: (a) scarcity of high-quality Indian-language training data (English/East Asian languages dominate internet corpora); (b) capital-intensive GPU clusters costing millions of dollars per training run. [S4]
Indian-language tokens are structurally less efficient: sentences must often be translated to English for inference, consuming extra tokens and raising costs. [S4]
MoE innovation addresses compute bottleneck; curated Indian-language datasets (being built under IndiaAI's Data Management Unit) address the data bottleneck. [S2]
Sarvam 105B achieving ~90% win-rate over GPT-4 on Indian-language benchmarks signals narrowing of the capability gap with frontier global models. [S3]

Geopolitical / Strategic

Digital sovereignty: training LLMs on Indian soil with Indian data reduces dependence on US/China AI stacks for critical government applications. [S3]
Sovereign LLM mandate explicitly targets "secure population-scale deployment" — implying use for government services (Aadhaar, health, welfare) without data leaving India. [S3]
Aligns with India's G20 AI principles (2023) and NITI Aayog's AI strategy emphasising "AI for all." [S2]

Administrative / Governance

IndiaAI Mission uses a competitive tender model — transparent procurement for sovereign LLM rather than directed allocation; Sarvam selected after open call. [S3]
14 empanelled private AI service providers supply GPU capacity — public-private partnership model for infrastructure without full state ownership. [S1]
Eligible institutions include academic researchers and government bodies, not just startups — broad access design. [S1]

Ethical / Governance

Open-sourcing Sarvam models (30B, 105B) enables public scrutiny, reduces proprietary lock-in, and allows downstream adaptation by Indian researchers. [S3]
Risk: concentrated state selection of a single "sovereign LLM" vendor could create governance/monopoly concerns if procurement transparency lapses.

Social

Indian-language capability of LLMs has direct equity implications: 90%+ of India's population is not English-dominant; English-centric LLMs exclude them from AI benefits. [S4]
Voice-first design of Sarvam's sovereign mandate targets low-literacy users — aligns with Digital India's last-mile inclusion goals. [S3]

6. Recent Developments (Last 12–18 Months)

April 2025: Sarvam AI formally selected by Government of India under IndiaAI Mission to build India's sovereign LLM. [S3]
September 2025: Government announced 500 AI Data Labs with ₹988 crore investment to boost AI capabilities. [S5]
February 19, 2026: PM Modi meets Sarvam co-founder Pratyush Kumar at AI Impact Summit, Bharat Mandapam, New Delhi. [S4]
February 26, 2026: Sarvam AI officially releases Sarvam 30B and Sarvam 105B — both open-sourced, trained on IndiaAI Mission compute. [S4]
2026: IndiaAI common compute capacity crosses 38,000 GPUs and 1,050 TPUs; 20,000 additional GPUs under active procurement. [S1]
IndiaAI Mission expanded with additional startup support mechanisms and affordable compute extensions beyond initial mandate. [S1]

7. Prelims Hooks

The IndiaAI Mission was approved by Union Cabinet in March 2024 with an outlay of ₹10,372 crore. [S2]
Nodal ministry for IndiaAI Mission: Ministry of Electronics and Information Technology (MeitY). [S2]
IndiaAI common compute portal provides GPUs at ₹65–₹100/hour vs global rates exceeding ₹200/hour. [S1]
As of 2026, IndiaAI has empanelled 14 AI service providers offering 38,000+ GPUs and 1,050 TPUs. [S1]
Sarvam AI is the Bengaluru-based startup selected by Government of India as the sovereign LLM developer under IndiaAI Mission. [S3]
Sarvam's Sarvam 30B has 32 billion parameters with a 65K context window; Sarvam 105B has 106 billion parameters with a 128K context window. [S3]
Both Sarvam models use Mixture of Experts (MoE) architecture — key reason they are less compute-intensive than comparable dense models. [S4]
In MoE architecture, only a subset of parameters (experts) is activated per inference, reducing computational cost. [S4]
Sarvam 105B wins approximately 90% of comparisons against GPT-4 on Indian-language benchmarks. [S3]
Both Sarvam LLMs were trained entirely on IndiaAI Mission infrastructure and are open-sourced. [S3]
IndiaAI data centres are located in: Mumbai, Navi Mumbai, Hyderabad, Bengaluru, Noida, and Jamnagar. [S1]
Indian-language text costs more in LLM inference because it requires more tokens — including translation overhead to English. [S4]
500 AI Data Labs announced in September 2025 with ₹988 crore investment to expand India's AI data infrastructure. [S5]
The AI Impact Summit where Sarvam models were unveiled was held at Bharat Mandapam, New Delhi, on 19 February 2026. [S4]

8. Mains Relevance

GS Papers: - GS-III: Science & Technology (AI/ML); Indian Economy (Start-up ecosystem, Digital Infrastructure) - GS-II: Government Policies and Interventions; e-Governance

Syllabus Headings: - GS-III: "Awareness in IT, Space, Computers, Robotics, Nano-technology, Bio-technology and issues relating to Intellectual Property Rights" - GS-III: "Indian Economy and issues relating to growth, development and employment" (Digital Economy) - GS-II: "Government policies and interventions for development in various sectors and issues arising out of their design and implementation"

Plausible Mains Questions: 1. "Critically examine the challenges faced by Indian firms in training Large Language Models domestically, and evaluate the role of the IndiaAI Mission in addressing them." (GS-III, 15 marks) 2. "The development of a sovereign LLM is not merely a technological achievement but a matter of strategic autonomy. Discuss in the context of India's IndiaAI Mission." (GS-III/GS-II, 15 marks) 3. "What is the Mixture of Experts (MoE) architecture, and why is it particularly significant for AI development in resource-constrained economies like India?" (GS-III, 10 marks)

9. Related Topics to Study Next

Topic	Connection
IndiaAI Mission (full scope)	Parent policy — covers Data Management, Application Development, FutureSkills, not just compute
National Data Governance Framework	Governs the Indian-language data that feeds LLM training
Digital India Programme	Foundational infrastructure (broadband, DigiLocker, UPI) on which AI deployment scales
Semiconductor Mission (India)	Long-term solution to GPU import dependency; chip fabrication is the upstream bottleneck
Global AI Governance (G20, UN AI Panel)	India's position on AI regulation, safety standards, and sovereign AI norms
Open-Source vs Proprietary AI	Policy debate on whether government-funded models should be open-sourced — directly relevant to Sarvam case
National Language Technology Mission	Earlier predecessor initiative for Indian-language NLP; provides historical context
Start-up India & Deep-Tech Policy	Sarvam AI's growth is embedded in this broader start-up ecosystem support framework

10. Common Errors / Trap Areas

Wrong ministry: IndiaAI Mission is under MeitY, NOT NITI Aayog (NITI Aayog authored earlier AI strategy papers but does not implement the Mission).
Parameter confusion: Sarvam 30B is described as "35 billion parameters" in some reports and "32 billion" in others — the MoE model has 32B active parameters but 35B total; exam questions will likely use the model name (30B/105B) rather than exact count.
MoE ≠ smaller model: Sarvam 105B has 106B total parameters — it is NOT a small model; MoE makes it cheaper to run because only a fraction of parameters activate per token, not because it has fewer parameters overall.
Confusing Sarvam AI with other Indian AI initiatives: Krutrim (Ola), Hanooman (SML India), BharatGPT are separate LLM projects — Sarvam is the one with the government sovereign LLM mandate.
Year of IndiaAI Mission approval: March 2024, not 2023 (the National AI Strategy/NITI Aayog paper was 2018; these are different documents separated by 6 years).

11. Sources

[S1] India's Common Compute Capacity Crosses 34,000 GPUs — https://www.pib.gov.in/PressReleasePage.aspx?PRID=2132817 — (Tier 1)
[S2] Cabinet Approves Over ₹10,300 Crore for IndiaAI Mission — https://www.pib.gov.in/PressReleasePage.aspx?PRID=2012375 — (Tier 1)
[S3] Sarvam AI — Wikipedia / IndiaAI Mission sovereign LLM selection — https://en.wikipedia.org/wiki/Sarvam_AI — (Tier 3/reference)
[S4] "How are Indian firms training LLMs?" — The Hindu / Hindu BusinessLine, 26 February 2026 (article excerpt supplied as primary source) — https://www.thehindu.com/todays-paper/2026-02-26/th_international/articleGGCFL07NJ-13661862.ece — (Tier 4)
[S5] India to set up 500 Data Labs, boost AI capabilities with ₹988 crore investment — https://www.newsonair.gov.in/india-to-set-up-500-data-labs-boost-ai-capabilities-with-988-crore-investment — (Government of India / AIR, Tier 1-adjacent)
[S6] IndiaAI Mission Expands AI Ecosystem with Affordable Compute and Startup Support — https://www.pib.gov.in/PressReleasePage.aspx?PRID=2245069 — (Tier 1)