Home   »   GenomeIndia Project

GenomeIndia Project: India Unlocks National Genetic Resource for Research and Innovation”

The GenomeIndia Project, spearheaded by the Department of Biotechnology (DBT), has achieved a landmark by completing Whole Genome Sequencing (WGS) of over 10,000 individuals across India. With the national data archived at the Indian Biological Data Center (IBDC), the government has now enabled access to researchers under established protocols, reinforcing its commitment to scientific advancement and sustainable development.

Why in the news?

On April 30, 2025, the Government of India officially announced that the GenomeIndia data is now accessible to Indian researchers. It follows the Prime Minister’s dedication of the GenomeIndia dataset on January 9, 2025, during the Genomics Data Conclave.

Purpose

  • To build a national genetic database representing India’s population diversity.
  • To democratize access to genomic data for Indian researchers.

Aim

  • Facilitate translational research in genomics and health.
  • Advance precision medicine and healthcare policy formulation.
  • Enable discoveries in biology, disease, and heritage through data-driven science.

Significance

  • One of the largest whole genome databases in Asia.
  • Crucial for addressing India-specific diseases, drug response patterns, and ancestry studies.
  • Boosts India’s scientific capabilities, healthcare R&D, and global genomic standing.
  • Supports Sustainable Development Goals (SDG) like SDG 3 (Good Health & Well-being) and SDG 9 (Industry, Innovation and Infrastructure).

Background

  • IBDC (Indian Biological Data Centre) was set up in March 2020 with advanced computational infrastructure.
  • Biotech-PRIDE Guidelines (2021) and FeED Protocols were formulated for ethical, transparent data sharing.
  • WGS completed for 9772 samples; phenotypic data cleaned for 9330 samples.

 Important Details

  • Funded by: Department of Biotechnology (DBT), Government of India

Dataset includes,

  • FASTQ files (700 TB)
  • gVCF files (35 TB)
  • Phenotypic data for 9330 individuals
  • 27 critical phenotype variables (e.g., Hemoglobin, Cholesterol, Fasting Glucose)
  • Anthropometric data (e.g., Age, Gender, Height, Weight, Body Fat)
  • Prime Minister Narendra Modi dedicated the data on Jan 9, 2025.
  • President Droupadi Murmu emphasized its national significance during her Republic Day eve address.

Other Points

  • Researchers can apply beyond the formal “Call for Proposals”.
  • FASTQ files are not downloadable due to size and security concerns—access is provided via cloud interface.
  • This model aligns with international data-sharing practices.
Summary/Static Details
Why in the news? GenomeIndia data opened to researchers
Purpose Build a national genomic data repository
Aim Enable research, innovation, and personalized medicine
Significance National-level scientific infrastructure; supports health R&D
Launch/Key Date PM Modi dedicated data on Jan 9, 2025
Key Stakeholders DBT, IBDC, PMO, researchers, health & biotech ecosystem

GenomeIndia Project: India Unlocks National Genetic Resource for Research and Innovation"_4.1

prime_image