Protein Design & Molecular Structure
Applying generative modeling to protein design, molecular structure prediction, and crystal structure prediction using SE(3)-invariant flow matching and all-atom diffusion models.
We apply generative modeling to protein design and molecular structure prediction. Our work on SE(3) stochastic flow matching (FoldFlow) pioneered the use of flow matching on the SE(3) manifold for protein backbone generation, enabling the generation of novel protein structures with desirable properties.
Current projects span the full spectrum of molecular design: sequence-structure co-design of proteins around arbitrary biomolecules (DISCO), topological guidance for macrocycle generation (MacroGuide), and all-atom diffusion models for organic crystal structure prediction (OXtal). We also develop methods for protein-ligand docking and conformational generation using geometric deep learning.
Our goal is to build generative models that can design novel biomolecules—proteins, macrocycles, and crystalline materials—with specified functional properties, bridging the gap between machine learning and experimental validation.
Key Directions
- Protein Backbone Generation: SE(3)-stochastic flow matching and sequence-augmented flow matching for novel protein structures
- Sequence-Structure Codesign: Joint optimization of protein sequences and 3D structures (DISCO) for enzyme design
- Macrocycle Generation: Topological guidance via persistent homology to steer diffusion toward ring-shaped drug candidates (MacroGuide)
- Crystal Structure Prediction: All-atom diffusion models for organic crystal structure prediction (OXtal)
- Molecular Docking: Geometric deep learning for protein-ligand docking and pocket conformation generation
People
Selected Publications
-
MacroGuide: Topological Guidance for Macrocycle Generation
Alicja Maksymiuk, Alexandre Duplessis, Michael Bronstein, Alexander Tong, Fernanda Duarte, İsmail İlkan Ceylan
In ICML 2026
July 2026
-
OXtal: An All-Atom Diffusion Model for Organic Crystal Structure Prediction
Emily Jin, Andrei Cristian Nica, Mikhail Galkin, Jarrid Rector-Brooks, Kin Long Kelvin Lee, Santiago Miret, Frances H. Arnold, Michael Bronstein, Avishek Joey Bose, Alexander Tong, Chenghao Liu
In ICLR 2026
May 2026
-
Flow matching for generative modelling in bioinformatics and computational biology
Alex Morehead, Lazar Atanackovic, Akshata Hegde, Yanli Wang, Frimpong Boadu, Joel Selvaraj, Alexander Tong, Aditi Krishnapriyan, Jianlin Cheng
In Nature Machine Intelligence
April 2026
-
General Multimodal Protein Design Enables DNA-Encoding of Chemistry
Jarrid Rector-Brooks, Théophile Lambert, Marta Skreta, Daniel Roth, Yueming Long, Zi-Qi Li, Xi Zhang, Miruna Cretu, Francesca-Zhoufan Li, Tanvi Ganapathy, Emily Jin, Avishek Joey Bose, Jason Yang, Kirill Neklyudov, Yoshua Bengio, Alexander Tong, Frances H. Arnold, Cheng-Hao Liu
Preprint
April 2026
-
Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation
Guillaume Huguet, James Vuckovic, Kilian Fatras, Eric Thibodeau-Laufer, Pablo Lemos, Riashat Islam, Cheng-Hao Liu, Jarrid Rector-Brooks, Tara Akhound-Sadegh, Michael Bronstein, Alexander Tong, Avishek Joey Bose
In NeurIPS 2024
December 2024
-
SE(3)-Stochastic Flow Matching for Protein Backbone Generation
Avishek Joey Bose, Tara Akhound-Sadegh, Kilian Fatras, Guillaume Huguet, Jarrid Rector-Brooks, Cheng-Hao Liu, Andrei Cristian Nica, Maksym Korablyov, Michael Bronstein, Alexander Tong
In ICLR 2024 (Spotlight)
May 2024