Unified data stack: The missing link in India’s AI ambitions
India has data at scale, but it is not integrated. A unified data stack could be the only way out to achieve successfully the country’s quest for AI sovereignty


India’s next big leap in AI will come not from GPUs or algorithms—but from how it unifies its data.
India’s ambition to build home-grown large language models (LLMs) is entering a decisive phase. With the IndiaAI Mission approved by the Cabinet in 2024, and initiatives such as AI Kosh (IndiaAI Datasets Platform) launched to democratise access to datasets, the country’s potential is vast.
Yet amid the focus on model architectures and GPU clusters, a quieter challenge threatens to slow the revolution — India’s fragmented data ecosystem. If LLMs are the engines of intelligence, data is the fuel. And while India has plenty of it, that fuel is scattered, inconsistent and under-curated.
However, many operate as vertical silos, with differing metadata, consent and access frameworks. The National Data Governance Framework Policy (Draft, May 2022), issued by the Ministry of Electronics and Information Technology (MeitY), recognised this explicitly — noting that government data “is currently managed… in differing and inconsistent ways” across agencies.
Consider healthcare: ABDM has digitised over 300 million health records, yet their interoperability across states remains limited. A unified data backbone could help securely share such data—enabling AI for diagnostics, disease surveillance and health copilots suited to Indian realities.
India is taking steps in this direction. The IndiaAI Datasets Platform aims to provide AI-ready datasets to startups and researchers.
Key components include:
By contrast, China shows the power of coordination and governance in its national data efforts—for example, through the National AI Data Resource Center and municipal data exchanges. India can blend these strengths—US-style openness plus China-style coordination—designing a model that is democratic in spirit, disciplined in execution and uniquely Indian in design.
This stack should now stand alongside semiconductor and compute policies as a core pillar of the IndiaAI Mission. The sooner India unifies its data, the faster it will unify its intelligence—moving from being merely data-rich to truly knowledge-sovereign.
Alexy Thomas, Partner, Technology Consulting, EY India
First Published: Jan 02, 2026, 13:08
Subscribe Now