Skip to main content
What types of page to search?

Alternatively use our A-Z index.

Build Big meets Build Smart to Explore the Universe

Published on

A person posing for a photo.

Our Liverpool Virtual Seminar Series on Data Intensive Science will continue on Tuesday 14th October at 15:00 BST. The seminar will be given by Carolina Cuesta-Lazaro of the NSF Institute for Artificial Intelligence and Fundamental Interactions (IAIFI) at MIT who will present “Build Big meets Build Smart to Explore the Universe”

Seminars in this series cover R&D outside of the data intensive science CDT’s core research areas and give an insight into cutting edge research in this area. At the end of the talk there will be a Q&A session with the speaker.

About the talk

Modern cosmology exemplifies the synergy between two complementary approaches in machine learning: scaling up with large models and datasets ("build big") versus incorporating targeted inductive biases for specific problems ("build smart"). Rather than choosing between these strategies, the most promising advances emerge from combining both. This talk presents three complementary projects that demonstrate this principle in action.

First, I show how foundation models for science benefit from incorporating both simulated and observed data. By learning shared representations across these domains through alignment losses, we achieve robust simulation-based inference that remains reliable even under model misspecification.

 

Second, I demonstrate scale-dependent anomaly detection using machine learning with cosmological inductive biases. By incorporating physical knowledge about scale dependence into the model’s architectures, we can detect deviations from standard models across different cosmological scales non-parametrically. This approach leverages both large observational datasets and physically-motivated architectural choices to identify potential new physics.

Third, I explore using large language models for automated hypothesis generation in cosmology. Through a systematic evaluation framework, I show that LLMs can autonomously propose novel dark energy theories and implement them in existing physics codes like CLASS. While the approach shows promise, it also reveals current limitations, including implementation challenges for complex models and the tendency to improve fits through additional parameters rather than fundamental insights.

Each project illustrates how the future of scientific discovery lies not in choosing between computational scale and inductive biases, but in thoughtfully combining both.

About the speaker

Carolina Cuesta-Lazaro works at the intersection of astrophysics and machine learning. She is interested in developing robust and interpretable models that can guide us towards future discoveries in physics.

Carolina received her Ph.D. in Physics and Data Science from the Institute of Computational Cosmology at Durham University, UK. Alongside her PhD, she has been a research collaborator with the United Nations (UN) Global Pulse and the UK’s National Health Service (NHS), developing epidemiological simulations, and a research intern at Amazon’s Alexa team. She was also a postdoctoral fellow at the NSF Institute for Artificial Intelligence and Fundamental Interactions (IAIFI) at MIT and the Center for Astrophysics at Harvard. Next year she will join NYU for a faculty appointment.

How to attend

Participation is free, but you need to register to attend this and other webinars in the series. For more information and how to register please follow this link. Once registered, you will receive the Zoom connection details on the morning of the online seminar.

The seminar details

Speaker: Carolina Cuesta-Lazaro (IAIFI)

Seminar title: “Build Big meets Build Smart to Explore the Universe”

Date/Time: Tuesday 14th October at 15:00 BST