Welcome to Bootcamp for Data Chemistry

Overview

Dates: September 22 – 26, 2025
Location: University of Notre Dame


LOGO

Introduction

In today’s research landscape, chemistry is evolving from a discipline driven mainly by experimental intuition to one powered by data-centric discovery. Advances in machine learning, high-throughput simulation, and automated lab instrumentation are generating vast datasets that can reveal patterns no human could spot unaided. Harnessing these data effectively—whether to predict molecular properties, design greener reactions, or accelerate materials discovery—demands a new skill set at the intersection of chemistry and artificial intelligence. Our Data-Chemistry Bootcamp is designed to equip chemistry students with exactly this toolkit: foundational AI concepts, hands-on coding experience, and domain-specific case studies that show how algorithms translate into real scientific breakthroughs. By bridging chemical insight with modern data methods, participants will be prepared to drive the next wave of innovation in both academia and industry.


Please register in advance so we can reserve seating and meals.

  • Complete the form here
  • Registration deadline: 1 September 2025 (or until all places are filled)

We look forward to welcoming you to Notre Dame for a week of data-driven chemistry!


Invitees

  • 5 external faculty (Jackson – UIUC; Zimmerman – U Mich; Ferguson – UC; Anstine – Mich State; Lindsey – Mich)
  • Industry scientists (AbbVie, Dow, Lilly, 3M, P&G) – attend selected sessions

Agenda at a Glance

Sunday (Sep 21)

  • 18:00 – 20:00 – Welcome reception & registration; workshop overview; networking

Day 1 – Monday: Representing Chemistry to ML

  • Keynotes on molecular representations & GNNs
  • Tutorial: RDKit fundamentals
  • Hands-on: Building a molecular property-prediction pipeline

Day 2 – Tuesday: Design of Experiments & Active Learning

  • Talks on Bayesian optimisation & active learning
  • Tutorial: scikit-learn for chemistry
  • Hands-on: Active-learning pipeline; poster session

Day 3 – Wednesday: Machine-Learning Potentials

  • Talks on neural network potentials & coarse-graining
  • Tutorial: ASE basics
  • Hands-on: Deploying ML potentials as DFT surrogates

Day 4 – Thursday: AI Agents for Chemistry

  • Talks on LLMs for scientific discovery
  • Panel: Future of AI in chemistry
  • Workshop: Building an AI research assistant for chemistry

Acknowledgement

We gratefully acknowledge the generous support of the ND SAI and C-CAS. Their commitment to fostering interdisciplinary collaboration and advancing data-driven chemistry has been instrumental in making this bootcamp possible, enabling us to bring together students, researchers, and industry partners for a productive week of learning and discovery.