IIT Bombay ignites AI revolution in Bharat with AIKOSH

IIT Bombay has released 16 culturally significant and diverse AI datasets on AIKOSH, the Government of India’s official AI repository, developed under the aegis of the Ministry of Electronics and Information Technology (MeitY).

A Bharat-Centric AI Push: “AI by India, for India”

This initiative isn’t just about data—it’s about asserting India’s AI sovereignty in a world dominated by western-centric datasets. The 16 datasets released by IIT Bombay reflect India’s unique diversity, with a sharp focus on language, script, document processing, and multimodal understanding.

Key components of the release include:

Handwritten and printed Indian scripts: Enabling advanced OCR and NLP for native languages.
Scanned table data from Indian documents: Facilitating document digitisation and automation for governance and legal sectors.
Multilingual Indian audio datasets: Enhancing speech recognition and synthesis systems for underserved languages.
Drone surveillance imagery: Boosting AI capabilities in smart agriculture, disaster management, and border surveillance.
Visual Question-Answering (VQA) datasets contextualised for India: Enabling intelligent systems that understand Indian imagery and cultural cues.

This initiative is being hailed as a pivotal step towards responsible and contextual AI development, essential for unlocking the true potential of technology in Indian ecosystems—from rural Bharat to high-tech urban centers.

AIKOSH: India’s secure data arsenal for the future

AIKOSH is India’s first-of-its-kind AI repository—a digital fort of datasets, pre-trained models, toolkits, and real-world use cases. Envisioned as a self-reliant ecosystem to fuel AI research and innovation, the platform empowers Indian developers, researchers, startups, and institutions to build solutions tailored for Indian realities.

With this release, IIT Bombay emerges as a trailblaser in aligning academic R&D with national AI priorities, contributing not just to data availability but also to the ethics and accountability of AI development in India.

AI systems are only as inclusive and intelligent as the data they are trained on. Until now, much of the global AI landscape has been trained on western-centric datasets, often neglecting non-English languages, diverse scripts, and culturally specific content. India, with its 122+ major languages, diverse scripts, and multilingual populations, cannot afford to remain dependent on such skewed data foundations.

The datasets released today represent a reclamation of India’s data identity, empowering researchers to:

Train models that understand regional languages like Marathi, Tamil, Assamese, Bhojpuri, Sanskrit, and more.
Solve local problems—from automatic processing of handwritten Indian forms to speech interfaces for rural populations.
Build inclusive AI models that reflect and serve India’s socio-cultural realities, not just borrowed global paradigms.

This move is part of a broader national vision to make India a global leader in responsible AI, with a strong focus on open data, transparency, and indigenous development. With IIT Bombay’s contribution, AIKOSH now grows into a powerhouse of innovation, signaling to the world that India is not just catching up—it is leading the AI transformation with its own voice, data, and values.

The released datasets and tools can be accessed by researchers, developers, students, startups, and government institutions at aikosh.indiaai.gov.in. The initiative also aligns with MeitY’s broader mission of Digital India, Make in AI, and Bharat Gen, aimed at democratising access to AI technologies and resources across all sectors.

With this release, IIT Bombay isn’t just powering AI innovation—it’s scripting a new digital future for Bharat. A future where AI speaks our languages, sees through our lenses, and solves problems rooted in our soil.

Spearheading AI in revolution Bharat: IIT Bombay unleashes 16 culturally rooted AI datasets on AIKOSH

IIT Bombay has released 16 culturally rooted AI datasets on AIKOSH, enabling responsible, India-centric AI development. The datasets empower innovation in Indian languages, documents, audio, and drone imagery, marking a bold step in Bharat’s AI sovereignty

A journey from Rio to Rishikesh: The heart-touching story of Padma Shri Awardee and vedic teacher Acharya Jonas Masetti

From Vedas to Virtual: DU introduces ‘Computer Applications for Sanskrit’ to merge ancient language with modern tech

Related News

Uttar Pradesh: Lucknow University opens first Green Skills & Applied AI Centre, aims to train 5,000 youth in first year

Declaration of 15th BRICS Trade Union Forum calls for human-centric AI, universal social security & labour cooperation

BRICS trade union forum Bhartiya Majdoor Sangh charts roadmap for human-centric at national Conference in DU

Indo-Japan Summit: Tech & AI are the pillars of partnership; MoUs inked on defence, critical minerals & clean energy

Central Sanskrit University launches India’s 1st AICTE-approved AI engineering programme with Indian knowledge systems

Mann Ki Baat: PM Modi highlights indigenous innovation, defence strength, yoga & people’s participation

Latest News

Punjab: Police resort to lathi-charge, teargas, and water cannons to disperse protesting MGNREGA workers

Balochistan Declares Independence from Pakistan: Unveils flag & anthem; Claims control of 85% province & minerals

Offensive Video on Hindu Gods: GAC slams Dhruv Rathee’s video titled “Can Hindus eat Beef?”; Orders takedown

Beyond NEET: The bigger debate around Sonam Wangchuk, foreign funding networks and the George Soros connection

MMA fighter Sangram Singh knocks Pakistan’s Ali out in 80 seconds to win STRIKE Asia Championship title

Andy Burnham to assume power as UK Prime Minister; Seventh PM in 10 years vows economic revival & political stability

Fact Check: Pakistan-based social media falsely labels Sonam Wangchuk’s court-ordered medical transfer as ‘CRPF Arrest’

Assam floods wreak havoc, more than 600 villages submerged affecting more than 2 lakhs people

Anti-India nexus of CJP: Anjali Bharadwaj & Shabana Azmi, funded by Ford, Soros & deep state spotted at Delhi protest

Kargil Vijay Diwas: Victory and the visionary reforms