Skip to main content

Benchmarking Active Learning for Molecular Design and Discovery

Reference number CCPR175

Funding
Funded
Study mode
Full-time
Part-time
Apply by
Start date
Subject area
Chemistry

Join us at our Postgraduate Open Events

Meet us on campus or online in March 2026 to find out more about master’s degrees and research opportunities at Liverpool.

Change country or region

We’re currently showing entry requirements and other information for applicants with qualifications from United Kingdom.

Please select from our list of commonly chosen countries below or choose your own.

If your country or region isn’t listed here, please contact us with any questions about studying with us.

Overview

This PhD will develop rigorous benchmarks for active learning in molecular design, testing when and why AL methods outperform baselines under realistic data and experimental constraints. Fragment screening and DNA-encoded libraries will serve as contrasting real-world application domains for prospective validation.

About this opportunity

Active learning (AL) is widely promoted as a route to more efficient molecular discovery, yet there is limited consensus on which AL strategies work, under what conditions, and why. Reported successes are often based on retrospective simulations, favourable datasets, or bespoke experimental setups, making it difficult to compare methods or assess their robustness. The ALGeNeM project addresses this gap by developing principled, transparent benchmarks for AL in chemistry, grounded in realistic data regimes and prospective experimental feedback.

Aim.

The central aim of this PhD is to systematically benchmark active learning strategies for molecular design, quantifying their benefits and failure modes across different data modalities, objectives, and uncertainty regimes. Rather than proposing a single new AL algorithm, the project will focus on understanding performance trade-offs between acquisition functions, surrogate models, and representations, and on defining best practices for deploying AL in real discovery settings.

What the candidate will do.

Define benchmarking frameworks: Design evaluation protocols for AL that reflect realistic constraints (small data, noisy measurements, batch selection, delayed feedback, and model misspecification). Establish strong non-AL baselines (random, diversity-based, greedy optimisation).

Compare AL strategies: Benchmark commonly used acquisition functions (e.g. uncertainty sampling, expected improvement, information-theoretic criteria) across multiple surrogate model classes and molecular representations.

Uncertainty and realism: Investigate how uncertainty estimation quality, distribution shift, and dataset bias impact AL performance, including cases where AL provides no benefit or is actively harmful.

Prospective and semi-prospective validation: Apply the benchmarking framework in two contrasting experimental contexts:

Fragment-based and structure-enabled optimisation, where data are sparse but information-rich.

DNA-encoded library (DEL) or large library screening, where data are abundant but noisy and biased.

These will serve as test cases to stress-test conclusions, not as the primary focus of the thesis.

Training and collaboration.

The student will receive interdisciplinary training across the Materials Innovation Factory (MIF) and the Department of Chemistry, covering machine learning for molecular data, statistical decision-making, experimental design, and reproducible research software engineering. Through ALGeNeM collaborations, the student will interact with experimental scientists to understand how algorithmic choices translate into real cost, time, and risk trade-offs.

Project structure.

Year 1: Core training; literature review on AL theory and chemical applications; definition of benchmarking criteria and datasets; implementation of baseline pipelines.

Year 2: Systematic benchmarking of AL strategies across simulated and historical datasets; analysis of uncertainty, bias, and failure modes; first methods/benchmarking publication.

Year 3: Application of the framework to fragment screening and DEL-style datasets with prospective or semi-prospective evaluation; synthesis of general design principles for AL in chemistry.

Final period: Thesis completion, dissemination of open benchmarking tools, and submission of final publications.

This project will produce actionable guidance for the community on when active learning is worth using in molecular discovery — and when it is not — alongside reusable benchmarking infrastructure aligned with ALGeNeM’s broader goals.

Back to top

Who is this for?

Candidates will have, or be due to obtain, a Master’s Degree or equivalent in a relevant subject. Exceptional candidates with a First Class Bachelor’s Degree in an appropriate field or significant relevant experience will also be considered. This degree is for students with a strong quantitative background who are motivated to work at the interface of machine learning and molecular science. Suitable candidates will hold (or be completing) a degree in chemistry, chemical biology, computational chemistry, computer science, data science, physics, engineering, or a closely related discipline, and will be interested in applying active learning methods to real-world molecular discovery problems.

Back to top

How to apply

  1. 1. Contact supervisors

    Candidates wishing to apply should complete the University of Liverpool application form to apply for a PhD in Chemistry.

    Please review our guide on How to apply for a PhD | Postgraduate research | University of Liverpool carefully and complete the online postgraduate research application form to apply for this PhD project.

    Please ensure you include the project title and reference number CCPR175 when applying.

    Supervisor  Email address
    Anthony Bradley anthony.bradley@liverpool.ac.uk
  2. 2. Prepare your application documents

    You may need the following documents to complete your online application:

    • A research proposal (this should cover the research you’d like to undertake)
    • University transcripts and degree certificates to date
    • Passport details (international applicants only)
    • English language certificates (international applicants only)
    • A personal statement
    • A curriculum vitae (CV)
    • Contact details for two proposed supervisors
    • Names and contact details of two referees.
  3. 3. Apply

    Finally, register and apply online. You'll receive an email acknowledgment once you've submitted your application. We'll be in touch with further details about what happens next.

Back to top

Funding your PhD

This UKRI funded Studentship will cover full tuition fees (for 2025-26 this is £5,006 pa.) and pay a maintenance grant for 3.5 years, at the UKRI standard rates (for 2025-26 this is £20,780 pa.) The Studentship also comes with access to additional funding in the form of a Research Training Support Grant to fund consumables, conference attendance, etc.

UKRI Studentships are available to any prospective student wishing to apply including both home and international students. While UKRI funding will not cover international fees, a limited number of scholarships to meet the fee difference will be available to support outstanding international students.

We want all our Staff and Students to feel that Liverpool is an inclusive and welcoming environment that actively celebrates and encourages diversity. We are committed to working with students to make all reasonable project adaptations including supporting those with caring responsibilities, disabilities or other personal circumstances. For example, if you have a disability you may be entitled to a Disabled Students Allowance on top of your studentship to help cover the costs of any additional support that a person studying for a doctorate might need as a result. We believe everyone deserves an excellent education and encourage students from all backgrounds and personal circumstances to apply.

Back to top

Contact us

Have a question about this research opportunity or studying a PhD with us? Please get in touch with us, using the contact details below, and we’ll be happy to assist you.

Back to top