AgenticGenomics: pharmacogenomics benchmark dataset for "Trustworthy agentic genomics through versioned skill libraries"
AgenticGenomics benchmark dataset. Controlled three-arm comparison of nine frontier large language models on pharmacogenomic interpretation: Claude Opus 4, Claude Sonnet 4, GPT-5.2, GPT-4.1, o3, o4-mini, Gemini 2.5 Flash, DeepSeek V3 and Mistral Large 2, evaluated on 110 CPIC Level A cases across three population contexts (European, admixed Latin American, East African) under three conditions (free-prompted; retrieval-augmented from the CPIC guideline corpus; constrained by a versioned SKILL.md-format specification executed as a contract), three replicates per cell, 26,730 evaluations. The archive contains raw model responses, baseline and clinical-equivalence rescorer outputs, the locked merged three-arm dataset (26,730 rows), the forward and reverse adversarial scrambled-specification experiments, the (gene, drug)-keyed chunking control, the drug-recommendation regression classification, and figure-input tables. Records are scored on phenotype identification (A1), drug recommendation (A2) and lethal-class safety action (A3). All data derive from public CPIC Level A guidelines and PharmGKB annotations; genotypes are synthetic canonical cases; no patient data and no new sequencing data are included. Analysis code: https://github.com/manuelcorpas/24-AGENTIC-PGX-BENCHMARK (commit dc2c849e). Companion to the Perspective "Agentic Genomics: From Pipeline Automation to Autonomous Validation".
Keywords
agentic genomics| Item Type | Dataset |
|---|---|
| Resource Type |
Resource Type Resource Description Software Python script |
| Capture method | Other |
| Date | 6 June 2026 |
| Language(s) of written materials | English |
| Creator(s) |
Corpas, M; Iacoangeli, A; Bourdenx, M; Skene, N; Aldraimli, M; Fatumo, S |
| LSHTM Faculty/Department | MRC/UVRI and LSHTM Uganda Research Unit |
| Participating Institutions | London School of Hygiene & Tropical Medicine, London, United Kingdom |
| Date Deposited | 08 Jun 2026 09:01 |
| Last Modified | 08 Jun 2026 09:01 |
| Publisher | Zenodo |