A newer ‘all atom’ iteration of RFdiffusion 5 allows designers to computationally shape proteins around non-protein targets such as DNA, small molecules and even metal ions. For example, Baker’s team is using RFdiffusion to engineer novel proteins that can form snug interfaces with targets of interest, yielding designs that “just conform perfectly to the surface,” Baker says. RFdiffusion software 3 developed by Baker’s lab and the Chroma tool by Generate Biomedicines in Somerville, Massachusetts 4, exploit this strategy to remarkable effect. These algorithms are initially trained to remove computer-generated noise from large numbers of real structures by learning to discriminate realistic structural elements from noise, they gain the ability to form biologically plausible, user-defined structures. Some of the most sophisticated of these use ‘diffusion’ models, which also underlie image-generating tools such as DALL-E. ‘Structure based’ approaches are better for this, and 2023 saw notable progress in this type of protein-design algorithm, too. Sequence-based approaches can build on and adapt existing protein features to form new frameworks, but they’re less effective for the bespoke design of structural elements or features, such as the ability to bind specific targets in a predictable fashion. Although worth monitoring, these tools need time to mature and to establish their broader role in the scientific world. Furthermore, ChatGPT’s persistent issuing of either misleading or fabricated responses was the leading concern of more than two-thirds of survey respondents. However, many of these applications represent labour-saving gains rather than transformations of the research process. Such tools are also proving valuable from an equity perspective, helping those for whom English isn’t their first language to refine their prose and thereby ease their paths to publication and career growth. Respondents to a Nature survey in September (see go./45232vd) cited ChatGPT as the most useful AI-based tool and were enthusiastic about its potential for coding, literature reviews and administrative tasks. ChatGPT and its ilk seem poised to become part of many researchers’ daily routines and were feted as part of the 2023 Nature’s 10 round-up (see go./3trp7rg). But one such tool did not make the final cut: the much-hyped artificial-intelligence (AI)-powered chatbots. Readers might detect a theme in this year’s technologies to watch: the outsized impact of deep-learning methods. Another tool co-developed by Ferruz, called ZymCTRL, draws on sequence and functional data to design members of naturally occurring enzyme families 2. In 2022, her team developed an algorithm called ProtGPT2 that consistently comes up with synthetic proteins that fold stably when produced in the laboratory 1. “They really learn the hidden grammar,” says Noelia Ferruz, a protein biochemist at the Molecular Biology Institute of Barcelona, Spain. By treating protein sequences like documents comprising polypeptide ‘words’, these algorithms can discern the patterns that underlie the architectural playbook of real-world proteins. ‘Sequence based’ strategies use the large language models (LLMs) that power tools such as the chatbot ChatGPT (see ‘ChatGPT? Maybe next year’). But sophisticated methods of deep learning, a form of artificial intelligence (AI), have also been essential. Much of that progress comes down to increasingly massive data sets that link protein sequence to structure. “Things that were impossible a year and a half ago - now you just do it.” “It’s hugely empowering,” says Neil King, a biochemist at the University of Washington who collaborates with Baker’s team to design protein-based vaccines and vehicles for drug delivery. Today, de novo protein design has matured into a practical tool for generating made-to-order enzymes and other proteins. ‘Top7’ folded as predicted, but it was inert: it performed no meaningful biological functions. Two decades ago, David Baker at the University of Washington in Seattle and his colleagues achieved a landmark feat: they used computational tools to design an entirely new protein from scratch. From protein engineering and 3D printing to detection of deepfake media, here are seven areas of technology that Nature will be watching in the year ahead.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |