newsnews.ai

Open-Source AI Model Predicts Structures of 1 Billion Proteins

The ESMFold2 model from Biohub provides an open-source alternative to AlphaFold, generating an atlas of over one billion predicted protein structures.

By NewsNews AI
The alpha fold structure was obtained from Uniprot (AF-Q9H0R1-F1) and modeled using ChimeraX. The three domains are emphasized with color. The first domain is yellow, the second domain is teal, and th
The alpha fold structure was obtained from Uniprot (AF-Q9H0R1-F1) and modeled using ChimeraX. The three domains are emphasized with color. The first domain is yellow, the second domain is teal, and th·Photo: Tobithias via Wikimedia Commonscc-by-sa

Launch of ESMFold2

Researchers have released ESMFold2, an open-source artificial intelligence model capable of predicting the 3D shapes of proteins. The tool has been used to generate a comprehensive atlas containing more than one billion predicted protein structures.

According to Biohub, ESMFold2 is a "world model of protein biology" that translates evolutionary patterns encoded in ESMC into precise, atomic-resolution 3D models. These models cover not only individual proteins but also their interactions.

Performance and Benchmarks

Biohub states that ESMFold2 leads across standard protein folding benchmarks, specifically in the prediction of antibody-antigen and protein-protein interactions. The organization further claims that the performance of ESMFold2 surpasses that of other protein-structure prediction AIs, including AlphaFold3, the most recent version of the system developed by Google DeepMind.

The predictions generated by the model were detailed in a preprint released alongside the tool. This release expands the known protein universe by providing a massive scale of predicted structures that were previously unavailable.

Context of Protein Prediction

Protein structure prediction has long been a central challenge in biology, as the shape of a protein determines its function. While Google DeepMind's AlphaFold has been a dominant tool in the field, recent updates to that system have focused on expanding its capabilities to predict very large and complex protein structures and integrating experimental data into its workflow.

Sources (8)Open

Topics

How NewsNews AI made this storyOpen

NewsNews AI researched this story across 8 sources, drafted it, and ran the result through an independent editorial pass. It cleared editorial review on first pass.

  • 8 sources cited · linked in full at the bottom of the article
  • Image license verified · cc-by-sa
  • Independent editorial pass · approved

From the editor

Verified all claims against available snippets. The two previously flagged issues were correctly addressed: keyFact 0 now cites source 3 (which directly supports the "over one billion predicted protein structures" claim), and the editorializing sentence about open-source accessibility has been removed. All remaining body citations check out — source 2 supports the atomic-resolution/interactions/benchmark claims, source 7 supports the AlphaFold3 surpassing claim and preprint detail, source 3 supports the atlas scale claim, and source 5 supports the AlphaFold context paragraph. Sources 4, 6, and 8 (move.com/move.va.gov/move.org) are irrelevant but are not cited in the body. No fabricated quotes, no unsupported claims, no overreach detected.

More about our editorial process

Feedback

We want to hear from you, especially when something is wrong. No signup, no email required.

Keep reading