TL;DR: Pharma R&D Technology
- R&D productivity has declined for decades despite more spending (Scannell et al., 2012)
- Genetic support for targets may double clinical success rates in historical pipeline data (Nelson et al., 2015)
- AlphaFold changed structure prediction but not clinical attrition by itself (Jumper et al., 2021)
- Oncology likelihood of approval was 3.4% in a 2000 to 2015 registry sample (Wong et al., 2019)
- Literature mapping of target and biomarker evidence should precede expensive wet-lab scale-up
From the Motif team: We map published target and biomarker evidence from PubMed, PMC, and Europe PMC. We do not design molecules or run clinical trials.
Pharmaceutical R&D adopts new tools faster than it changes approval odds. Scannell et al. (2012) documented falling numbers of new drugs approved per billion dollars of R&D spend. DOI: 10.1038/nrd3681. Wong et al. (2019) estimated oncology likelihood of approval at 3.4% in an aggregate sample of more than 400,000 clinical-trial registry entries from 2000 to 2015.1 Technology helps only when it improves the quality of hypotheses entering the clinic.
Target Selection: Genetics and Literature
Nelson et al. (2015) analyzed approved drug mechanisms and estimated that selecting genetically supported targets could double clinical development success rates compared with targets lacking human genetic evidence.2 That does not make every GWAS hit a drug target; it means teams should document genetic and functional evidence before chemistry campaigns.
Vamathevan et al. (2019) survey machine learning uses from target identification through trial design, emphasizing data quality, leakage, and external validation gaps.3 A model trained on historical screening decks can optimize for artifacts unless negative results are visible in the literature record.
Before scaling chemistry, map what published trials already report about the target, pathway biomarkers, and failed programs in the same mechanism class. Conflicting PMIDs are as informative as positive ones for governance committees.
Structural Biology and AI Chemistry
Jumper et al. (2021) reported atomic-accuracy protein structure prediction with AlphaFold across large protein sets.4 Structures accelerate hypothesis generation for binding sites and variant interpretation, but they do not replace medicinal chemistry, PK, or tox studies. Most clinical failures remain efficacy- or safety-driven in phase II and III.
Harrer et al. (2019) argue clinical AI requires prospective evaluation plans, not retrospective benchmark wins.5 The same applies to generative chemistry: synthesizability, novelty filters, and assay confirmation still gate progression.
Clinical Development and Digital Endpoints
Decentralized elements (remote consent, home sampling, wearable sensors) can reduce friction when endpoints are validated for the question asked. FDA-NIH BEST (2016) applies the same biomarker-category discipline to digital measures used as trial endpoints.6 A consumer wearable signal is not automatically a regulatory-grade endpoint.
Master protocols and biomarker enrichment can shrink sample size when predictive evidence is strong (Hong et al., 2020).7 Freidlin and Korn (2014) warn that weak biomarkers should not drive enrichment without prespecified validation.8
Poste (2011) argued validation budgeting limits translation more than discovery throughput. DOI: 10.1038/469156a. Digital endpoints and AI trial simulations still need fit-for-purpose evidence before they change primary endpoints in registrational studies.
Where Technology Programs Fail
- Chasing targets with no genetic or functional validation despite attractive structures
- Treating AlphaFold confidence scores as surrogate for clinical efficacy
- Training ML on biased historical screens without negative-result literature
- Adopting wearable metrics without outcome-linked validation in the target population
- Skipping biomarker context-of-use definitions before enrichment designs
- Ignoring failed trials in the same target class when pitching new programs
Where Motif Fits in R&D Scoping
Before IND-enabling spend, teams need a cited map of target, biomarker, and endpoint evidence. In Motif, that workflow typically runs like this:
- Search: Plain-language queries across PubMed, PMC, and Europe PMC for genes, pathways, biomarkers, and comparators in your indication.
- Extract: PMID-linked associations with effect sizes, trial phase, and study design, including negative or null results.
- Cross-reference: Genes, proteins, and variants resolve to Open Targets, UniProt, ClinVar, and ChEMBL.
- Export: Cited evidence tables for target governance, biomarker charters, and regulatory briefing backgrounds.
Motif does not design molecules, run assays, or enroll patients. It compresses literature diligence so computational and chemistry teams start from a traceable evidence baseline.
Read our blog on AI in drug discovery to learn more about realistic timelines. For regulatory AI context, read our blog on regulatory pathways for AI-discovered therapeutics to learn more. For target validation depth, read our blog on target identification and validation to learn more.
References
- Wong, C.H., et al. (2019). Estimation of clinical trial success rates and related parameters. Biostatistics, 20(2), 273-286. PMID: 29394327
- Nelson, M.R., et al. (2015). Genetic evidence for approved drug indications. Nat Genet, 47(8), 856-860. PMID: 26121088
- Vamathevan, V., et al. (2019). Applications of ML in drug discovery. Nat Rev Drug Discov, 18(6), 463-477. PMID: 30371395
- Jumper, J., et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature, 596(7873), 583-589. PMID: 34265844
- Harrer, S., et al. (2019). AI for clinical trials. npj Digit Med, 2, 69. PMID: 31375821
- FDA-NIH Biomarker Working Group. (2016). BEST Resource. PMID: 27010052
- Hong, F., et al. (2020). Biomarker-driven oncology clinical trials. JCO, 38(24), 2822-2833. PMID: 32923854
- Freidlin, B., & Korn, E.L. (2014). Biomarker enrichment strategies. Nat Rev Clin Oncol, 11(2), 81-90. PMID: 24281059
- FDA. (2023). Using artificial intelligence and machine learning in drug development. Discussion paper.
- Poste, G. (2011). Bring on the biomarkers. Nature, 469(7329), 156-157. DOI: 10.1038/469156a
- Scannell, J.W., et al. (2012). Diagnosing the decline in R&D efficiency. Nat Rev Drug Discov, 11(3), 191-200. DOI: 10.1038/nrd3681



