Today we are making RFdiffusion, our artificial intelligence (AI) program that can generate novel proteins with potential applications in medicine, vaccines, and advanced materials, free for both non-profit and for-profit use under a governed license. The software, which has been tested in the lab, is much faster and more capable than prior protein design tools.
Researchers can access RFdiffusion through the open-source online platform ColabFold, which is part of the cloud-based Google Colaboratory. The code is also available for download from GitHub.
“With RFdiffusion, the power of AI can be harnessed to create useful proteins in a matter of seconds. We invite researchers from around the world to join us in this exciting journey of scientific discovery,” said Joseph Watson, a lead developer of RFdiffusion and postdoctoral scholar in the lab.
RFdiffusion outperforms existing protein design methods across a broad range of problems, including topology-constrained protein monomer design, protein binder design, symmetric oligomer design, enzyme active site scaffolding, and symmetric motif scaffolding for therapeutic and metal-binding protein design.
RFdiffusion was developed by a team of computational biologists from UW Medicine, Columbia University, and MIT. Sergey Ovchinnikov, a John Harvard Distinguished Science Fellow at Harvard University, implemented RFdiffusion onto the ColabFold platform.
David Juergens, a lead developer of the software and doctoral student in the lab, said, “We’re thrilled to share RFdiffusion with the global research community, and we can’t wait to see the diverse and innovative ways scientists will apply this powerful AI tool.”
Drawing inspiration from AI image generation tools like DALL-E, the team developed RFdiffusion as a guided diffusion model for protein design. Proteins are composed of amino acid chains, and the sequence of these building blocks determines a protein’s structure and function, such as transporting oxygen in the blood, digesting food in the stomach, or sending signals in the brain.
To date, we have used RFdiffusion to create proteins that bind to target molecules, assemble into complex structures, and more. In one project, we created proteins that bind to hormones more tightly than any prior protein or small molecule designed on a computer. This breakthrough in creating proteins with enhanced binding properties has promising implications for the development of more effective diagnostics and therapies.
“We are making RFdiffusion freely available to empower researchers around the world. We believe it can help unlock new solutions to important challenges in medicine and beyond,” said Nate Bennett, a lead developer of RFdiffusion and doctoral student in the lab. Proteins made via RFdiffusion have the potential to prevent infections, combat cancer, reverse autoimmune disorders, and serve as key components in advanced materials.
The development and initial testing of RFdiffusion is described in the preprints “Broadly applicable and accurate protein design by integrating structure prediction networks and diffusion generative models” and “De novo design of high-affinity protein binders to bioactive helical peptides.” Both manuscripts are currently under peer review. Because of the considerable demand for the code, we are making it available prior to journal publication.