NASDAQ: $ILMN
Genomic Research Reaches New Heights with Expansion and Proteomic Integration
The landscape of genomic discovery is undergoing a significant transformation with major developments announced by the Alliance for Genomic Discovery (AGD), a collaboration spearheaded by Illumina, Inc. and Nashville Biosciences, LLC (NashBio). Two key announcements signal a massive leap forward: the addition of the Regeneron Genetics Center® (RGC®) as the tenth member, and the launch of a new initiative incorporating proteomic data alongside whole-genome sequencing.
Dataset Grows to Over 312,000 Whole Genomes
With the integration of the RGC, the AGD’s core dataset now surpasses 312,000 whole genomes. This collection is distinguished not just by its sheer size, but by the associated depth of clinical information. The AGD dataset stands out as one of the world’s largest repositories pairing whole-genome sequences with extensive, longitudinal clinical data derived from electronic health records (EHRs). This de-identified, deep phenotypic data is crucial for precisely defining disease cohorts, particularly those enriched for advanced or complex conditions.
Rami Mehio, Senior Vice President and General Manager of BioInsight at Illumina, emphasized the impact of this scale: “AGD has already enabled disease-impacting discoveries in autoimmune disease and obesity, with many more such studies underway, and continued expansion accelerates this progress. Integrating high-quality clinical and genomic data with advanced AI will help pharma translate discoveries into meaningful advances for patients.”
Regeneron Genetics Center Brings Significant Scale and Expertise
Regeneron Genetics Center, a subsidiary of Regeneron, utilizes one of the industry’s most robust genetic databases, boasting nearly 3 million sequenced exomes and associated de-identified EHRs. Aris Baras, MD, Senior Vice President and Head of RGC, expressed enthusiasm about joining the Alliance. “We are thrilled to join the Alliance alongside so many of our longtime partners, including Vanderbilt with their exceptional biobank and population-scale genetics program — one of the most impressive in the world,” said Dr. Baras. “Together, this alliance brings together an extraordinarily large and rich dataset, and we cannot wait to see the discoveries that lie ahead.”
RGC’s core mission aligns perfectly with the Alliance’s goals: uncovering large-effect protective genetic factors that can illuminate the next generation of high-confidence drug targets. Regeneron plans to leverage this expanded data across its entire drug discovery pipeline, from target identification through clinical trial design and patient access.
New Multiomic Frontier: Pairing Genomics with Proteomics
Moving beyond genomics alone, the AGD is initiating a critical second phase: the integration of proteomic data. Multiomic information adds vital layers of biological context, potentially accelerating drug discovery research significantly.
GSK is among the initial participants in this expansion, helping to establish a dataset featuring 50,000 whole genomes paired with corresponding proteomic data generated using Illumina® Protein Prep. This paired information is specifically designed to facilitate faster, more efficient target discovery and therapy development by aiding in the understanding of molecular mechanisms underlying disease-associated genetic variation.
Leeland Ekstrom, PhD, CEO of NashBio, noted the importance of this evolution: “We are thrilled to have GSK on board as we move to the next evolution of AGD. There is proven value in the integration of proteomics and comprehensive datasets, evidenced by the boon of large-scale studies showing promise in pinpointing drug targets linked to human disease.” Illumina’s recent acquisition of SomaLogic proteomics technologies further bolsters the Alliance’s multiomics capabilities in areas like cancer, cardiometabolic, and immunologic diseases.
Alliance Momentum and Infrastructure
The AGD network currently includes major pharmaceutical leaders such as AbbVie, Alnylam, Amgen, AstraZeneca, Bayer, Bristol Myers Squibb, GSK, Merck, and Novo Nordisk, in addition to the newest member, RGC.
The project, which began sequencing in January 2023, is recognized as one of the fastest large-scale genomics initiatives to date. This rapid progress is underpinned by NashBio and Vanderbilt University Medical Center’s foundational work with the BioVU biobank and the operational efficiency of utilizing tools like the DRAGEN Iterative gVCF Genotyper for enhanced variant calling accuracy across diverse populations. This expanded effort, coupled with initiatives like Illumina’s Billion Cell Atlas, underscores a growing industry trend toward leveraging massive, integrated datasets to redefine therapeutic understanding and development.
benefit of modern cell therapies.
